ACM Home Page
Please provide us with feedback. Feedback
The effectiveness of GIOSS for the text database discovery problem
Full text PdfPdf (1.36 MB)
Source International Conference on Management of Data archive
Proceedings of the 1994 ACM SIGMOD international conference on Management of data table of contents
Minneapolis, Minnesota, United States
Pages: 126 - 137  
Year of Publication: 1994
ISBN:0-89791-639-5
Also published in ...
Authors
Luis Gravano  Stanford University, Computer Science Dept., Margaret Jacks Hall, Stanford, CA
Héctor García-Molina  Stanford University, Computer Science Dept., Margaret Jacks Hall, Stanford, CA
Anthony Tomasic  Stanford University, Computer Science Dept., Margaret Jacks Hall, Stanford, CA and Princeton University, Department of Computer Science
Sponsors
SIGACT: ACM Special Interest Group on Algorithms and Computation Theory
SIGART: ACM Special Interest Group on Artificial Intelligence
SIGMOD: ACM Special Interest Group on Management of Data
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 2,   Downloads (12 Months): 35,   Citation Count: 53
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/191839.191869
What is a DOI?

ABSTRACT

The popularity of on-line document databases has led to a new problem: finding which text databases (out of many candidate choices) are the most relevant to a user. Identifying the relevant databases for a given query is the text database discovery problem. The first part of this paper presents a practical solution based on estimating the result size of a query and a database. The method is termed GlOSS—Glossary of Servers Server. The second part of this paper evaluates the effectiveness of GlOSS based on a trace of real user queries. In addition, we analyze the storage cost of our approach.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
Michael F. Schwartz, Alan Emtage, Brewster Kahle, and B. Cliford Neuman. A comparison of INTERNET resource discovery approaches. Computer Systems, 5(4), 1992.
 
3
 
4
Tim Berners-Lee, Robert Cailliau, Jean-F. Croft, and Bernd Pollermann. World-Wide Web: The Information Universe. Electronic Networking: Research, Applications and Policy, 1(2), 1992.
 
5
Steve Foster. About the Veronica service, November 1992. Message posted in comp. ~nfosystems. gopher.
 
6
B. Clifford Neuman. The Prospero File System: A global file system based on the Virtual System model. Computer Systems, 5(4), 1992.
 
7
Brewster Kahle and Art Medlar. An information system for corporate users: Wide Area Information Servers. Technical Report TMC199, Thinking Machines Corporation, April 1991.
 
8
Jim Fullton, Archie Warnock, et al. Release notes for freeWAIS 0.2, October 1993.
 
9
Michael F. Schwartz. A scalable, non-hierarchical resource discovery mechanism based on probabilistic protocols. Technical Report CU-CS-474-90, Dept. of Computer Science, University of Colorado at Boulder, June 1990.
 
10
 
11
Peter B. Danzig, Shih-Hao Li, and Katia Obraczka. Distributed indexing of autonomous INTERNET services. Computer Systems, 5(4), 1992.
12
 
13
Patricia Simpson and Rafael Alonso. Querying a network of autonomous databases. Technical Report CS-TR-202-89, Dept. of Computer Science, Princeton University, January 1989.
 
14
Daniel Barbar# and Chris Clifton. Information Brokers: Sharing knowledge in a heterogeneous distributed system. Technical Report MITL-TR-31-92, Matsushita Information Technology Laboratory, October 1992.
 
15
Joann J. OrdilIe and Barton P. Miller. Distributed active catalogs and meta-data caching in descriptive name services. Technical Report #1118, University of Wisconsin-Madison, November 1992.
 
16
Chris Weider and Simon Spero. Architecture of the WHOIS++ Index Service, October 1993. Working draft.
 
17
Ran Giladi and Peretz Shoval. Routing queries in a network of databases driven by a meta knowledgebase. In Proceedings of the International Workshop on Next Generation Informatwn Technologies and Systems, June 1993.
 
18
 
19
Alice Y. Chamis. Selection of online databases using switching vocabularies. Journal of the American Society for Information Sc,ence, 39(3), 1988.
 
20
21

CITED BY  53

Collaborative Colleagues:
Luis Gravano: colleagues
Héctor García-Molina: colleagues
Anthony Tomasic: colleagues