ACM Home Page
Please provide us with feedback. Feedback
Caching and database scaling in distributed shared-nothing information retrieval systems
Full text PdfPdf (1.07 MB)
Source International Conference on Management of Data archive
Proceedings of the 1993 ACM SIGMOD international conference on Management of data table of contents
Washington, D.C., United States
Pages: 129 - 138  
Year of Publication: 1993
ISBN:0-89791-592-5
Also published in ...
Authors
Anthony Tomasic  Stanford University, Department of Computer Science, Margaret Jacks Hall, Stanford, CA
Hector Garcia-Molina  Stanford University, Department of Computer Science, Margaret Jacks Hall, Stanford, CA
Sponsors
SIGACT: ACM Special Interest Group on Algorithms and Computation Theory
SIGART: ACM Special Interest Group on Artificial Intelligence
SIGMOD: ACM Special Interest Group on Management of Data
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 4,   Downloads (12 Months): 30,   Citation Count: 3
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/170035.170063
What is a DOI?

ABSTRACT

A common class of existing information retrieval system provides access to abstracts. For example Stanford University, through its FOLIO system, provides access to the INSPECT database of abstracts of the literature on physics, computer science, electrical engineering, etc. In this paper this database is studied by using a trace-driven simulation. We focus on physical index design, inverted index caching, and database scaling in a distributed shared-nothing system. All three issues are shown to have a strong effect on response time and throughput. Database scaling is explored in two ways. One way assumes an “optimal” configuration for a single host and then linearly scales the database by duplicating the host architecture as needed. The second way determines the optimal number of hosts given a fixed database size.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
3
 
4
S. DeFazio and J. Hull. Toward servicing textual database transactions on symmetric shared memory multiprocessors. In Proceedings of the intern=tional Workshop on High Performance Transaction Systems, Asilomar, 1991.
 
5
P. A. Erarath. Page Indzzing Jot Teztual Inyorma#ion Retrieval Systems. PhD thesis, University of illinois at Urbane-Champaign, October 1983.
6
 
7
W. B. Frt#kes and R. Baeza-Yates. Information Retrieval: Data Structures and Algorithms. Prentice-Hall, 1992.
 
8
 
9
1%. Jain. The Art of Computer Systems Performance Analysis. John Wiley and Sons, New York, 1991.
 
10
B.-S. Jeong and E. Omleclnskl. Inverted file partitioning schemes for a shared-everytl#ing multlprocessor. Tech#cal Report GIT-CC-92/39, Georgia Institute of Technology, College of Computing, 1992.
11
 
12
 
13
A. Tome#ic and H. Garcia-Mollna. Caching and database scaling in distributed shared-nothlng information retrieval systems. T~c2xrdcal Report STAN-CS-92-14#6, Stanford University, December 1992.
 
14


Collaborative Colleagues:
Anthony Tomasic: colleagues
Hector Garcia-Molina: colleagues