| Caching and database scaling in distributed shared-nothing information retrieval systems |
| Full text |
Pdf
(1.07 MB)
|
| Source
|
International Conference on Management of Data
archive
Proceedings of the 1993 ACM SIGMOD international conference on Management of data
table of contents
Washington, D.C., United States
Pages: 129 - 138
Year of Publication: 1993
ISBN:0-89791-592-5
Also published in ...
|
|
Authors
|
|
Anthony Tomasic
|
Stanford University, Department of Computer Science, Margaret Jacks Hall, Stanford, CA
|
|
Hector Garcia-Molina
|
Stanford University, Department of Computer Science, Margaret Jacks Hall, Stanford, CA
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 4, Downloads (12 Months): 30, Citation Count: 3
|
|
|
ABSTRACT
A common class of existing information retrieval system provides access to abstracts. For example Stanford University, through its FOLIO system, provides access to the INSPECT database of abstracts of the literature on physics, computer science, electrical engineering, etc. In this paper this database is studied by using a trace-driven simulation. We focus on physical index design, inverted index caching, and database scaling in a distributed shared-nothing system. All three issues are shown to have a strong effect on response time and throughput. Database scaling is explored in two ways. One way assumes an “optimal” configuration for a single host and then linearly scales the database by duplicating the host architecture as needed. The second way determines the optimal number of hosts given a fixed database size.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
|
 |
3
|
J. K. Cringean , R. England , G. A. Manson , P. Willett, Parallel text searching in serial files using a processor farm, Proceedings of the 13th annual international ACM SIGIR conference on Research and development in information retrieval, p.429-453, September 05-07, 1990, Brussels, Belgium
[doi> 10.1145/96749.98249]
|
| |
4
|
S. DeFazio and J. Hull. Toward servicing textual database transactions on symmetric shared memory multiprocessors. In Proceedings of the intern=tional Workshop on High Performance Transaction Systems, Asilomar, 1991.
|
| |
5
|
P. A. Erarath. Page Indzzing Jot Teztual Inyorma#ion Retrieval Systems. PhD thesis, University of illinois at Urbane-Champaign, October 1983.
|
 |
6
|
|
| |
7
|
W. B. Frt#kes and R. Baeza-Yates. Information Retrieval: Data Structures and Algorithms. Prentice-Hall, 1992.
|
| |
8
|
|
| |
9
|
1%. Jain. The Art of Computer Systems Performance Analysis. John Wiley and Sons, New York, 1991.
|
| |
10
|
B.-S. Jeong and E. Omleclnskl. Inverted file partitioning schemes for a shared-everytl#ing multlprocessor. Tech#cal Report GIT-CC-92/39, Georgia Institute of Technology, College of Computing, 1992.
|
 |
11
|
|
| |
12
|
|
| |
13
|
A. Tome#ic and H. Garcia-Mollna. Caching and database scaling in distributed shared-nothlng information retrieval systems. T~c2xrdcal Report STAN-CS-92-14#6, Stanford University, December 1992.
|
| |
14
|
|
CITED BY 3
|
|
|
|
|
Paricia Correia Saraiva , Edleno Silva de Moura , Novio Ziviani , Wagner Meira , Rodrigo Fonseca , Berthier Riberio-Neto, Rank-preserving two-level caching for scalable search engines, Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval, p.51-58, September 2001, New Orleans, Louisiana, United States
|
|
|
|
|