| The effectiveness of GIOSS for the text database discovery problem |
| Full text |
Pdf
(1.36 MB)
|
| Source
|
International Conference on Management of Data
archive
Proceedings of the 1994 ACM SIGMOD international conference on Management of data
table of contents
Minneapolis, Minnesota, United States
Pages: 126 - 137
Year of Publication: 1994
ISBN:0-89791-639-5
Also published in ...
|
|
Authors
|
|
Luis Gravano
|
Stanford University, Computer Science Dept., Margaret Jacks Hall, Stanford, CA
|
|
Héctor García-Molina
|
Stanford University, Computer Science Dept., Margaret Jacks Hall, Stanford, CA
|
|
Anthony Tomasic
|
Stanford University, Computer Science Dept., Margaret Jacks Hall, Stanford, CA and Princeton University, Department of Computer Science
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 2, Downloads (12 Months): 35, Citation Count: 53
|
|
|
ABSTRACT
The popularity of on-line document databases has led to a new problem: finding which text databases (out of many candidate choices) are the most relevant to a user. Identifying the relevant databases for a given query is the text database discovery problem. The first part of this paper presents a practical solution based on estimating the result size of a query and a database. The method is termed GlOSS—Glossary of Servers Server. The second part of this paper evaluates the effectiveness of GlOSS based on a trace of real user queries. In addition, we analyze the storage cost of our approach.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
Michael F. Schwartz, Alan Emtage, Brewster Kahle, and B. Cliford Neuman. A comparison of INTERNET resource discovery approaches. Computer Systems, 5(4), 1992.
|
| |
3
|
|
| |
4
|
Tim Berners-Lee, Robert Cailliau, Jean-F. Croft, and Bernd Pollermann. World-Wide Web: The Information Universe. Electronic Networking: Research, Applications and Policy, 1(2), 1992.
|
| |
5
|
Steve Foster. About the Veronica service, November 1992. Message posted in comp. ~nfosystems. gopher.
|
| |
6
|
B. Clifford Neuman. The Prospero File System: A global file system based on the Virtual System model. Computer Systems, 5(4), 1992.
|
| |
7
|
Brewster Kahle and Art Medlar. An information system for corporate users: Wide Area Information Servers. Technical Report TMC199, Thinking Machines Corporation, April 1991.
|
| |
8
|
Jim Fullton, Archie Warnock, et al. Release notes for freeWAIS 0.2, October 1993.
|
| |
9
|
Michael F. Schwartz. A scalable, non-hierarchical resource discovery mechanism based on probabilistic protocols. Technical Report CU-CS-474-90, Dept. of Computer Science, University of Colorado at Boulder, June 1990.
|
| |
10
|
|
| |
11
|
Peter B. Danzig, Shih-Hao Li, and Katia Obraczka. Distributed indexing of autonomous INTERNET services. Computer Systems, 5(4), 1992.
|
 |
12
|
Peter B. Danzig , Jongsuk Ahn , John Noll , Katia Obraczka, Distributed indexing: a scalable mechanism for distributed information retrieval, Proceedings of the 14th annual international ACM SIGIR conference on Research and development in information retrieval, p.220-229, October 13-16, 1991, Chicago, Illinois, United States
[doi> 10.1145/122860.122883]
|
| |
13
|
Patricia Simpson and Rafael Alonso. Querying a network of autonomous databases. Technical Report CS-TR-202-89, Dept. of Computer Science, Princeton University, January 1989.
|
| |
14
|
Daniel Barbar# and Chris Clifton. Information Brokers: Sharing knowledge in a heterogeneous distributed system. Technical Report MITL-TR-31-92, Matsushita Information Technology Laboratory, October 1992.
|
| |
15
|
Joann J. OrdilIe and Barton P. Miller. Distributed active catalogs and meta-data caching in descriptive name services. Technical Report #1118, University of Wisconsin-Madison, November 1992.
|
| |
16
|
Chris Weider and Simon Spero. Architecture of the WHOIS++ Index Service, October 1993. Working draft.
|
| |
17
|
Ran Giladi and Peretz Shoval. Routing queries in a network of databases driven by a meta knowledgebase. In Proceedings of the International Workshop on Next Generation Informatwn Technologies and Systems, June 1993.
|
| |
18
|
Mark A. Sheldon , Andrzej Duda , Ron Weiss , James W. O'Toole, Jr. , David K. Gifford, Content routing for distributed information servers, Proceedings of the 4th international conference on extending database technology on Advances in database technology, p.109-122, May 1994, Cambridge, United Kingdom
|
| |
19
|
Alice Y. Chamis. Selection of online databases using switching vocabularies. Journal of the American Society for Information Sc,ence, 39(3), 1988.
|
| |
20
|
|
 |
21
|
|
CITED BY 53
|
|
|
|
|
|
|
|
James C. French , Allison L. Powell , Jamie Callan , Charles L. Viles , Travis Emmitt , Kevin J. Prey , Yun Mou, Comparing the performance of database selection algorithms, Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval, p.238-245, August 15-19, 1999, Berkeley, California, United States
|
|
|
|
|
|
Carl Lagoze , David Fielding , Sandra Payette, Making global digital libraries work: collection services, connectivity regions, and collection views, Proceedings of the third ACM conference on Digital libraries, p.134-143, June 23-26, 1998, Pittsburgh, Pennsylvania, United States
|
|
|
James C. French , Allison L. Powell , Walter R. Creighton, III, Efficient searching in distributed digital libraries, Proceedings of the third ACM conference on Digital libraries, p.283-284, June 23-26, 1998, Pittsburgh, Pennsylvania, United States
|
|
|
Michelle Baldonado , Chen-Chuan K. Chang , Luis Gravano , Andreas Paepcke, Metadata for digital libraries: architecture and design rationale, Proceedings of the second ACM international conference on Digital libraries, p.47-56, July 23-26, 1997, Philadelphia, Pennsylvania, United States
|
|
|
Allison L. Powell , James C. French , Jamie Callan , Margaret Connell , Charles L. Viles, The impact of database selection on distributed searching, Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval, p.232-239, July 24-28, 2000, Athens, Greece
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
James P. Callan , Zhihong Lu , W. Bruce Croft, Searching distributed collections with inference networks, Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval, p.21-28, July 09-13, 1995, Seattle, Washington, United States
|
|
|
Wendy Chang , Gholamhosein Sheikholeslami , Aidong Zhang , Tanveer F. Syeda-Mahmood, Efficient resource selection in distributed visual information systems, Proceedings of the fifth ACM international conference on Multimedia, p.203-213, November 09-13, 1997, Seattle, Washington, United States
|
|
|
|
|
|
Jian Xu , Yinyan Cao , Ee-Peng Lim , Wee-Keong Ng, Database selection techniques for routing bibliographic queries, Proceedings of the third ACM conference on Digital libraries, p.264-274, June 23-26, 1998, Pittsburgh, Pennsylvania, United States
|
|
|
Naomi Dushay , James C. French , Carl Lagoze, Using query mediators for distributed searching in federated digital libraries, Proceedings of the fourth ACM conference on Digital libraries, p.171-178, August 11-14, 1999, Berkeley, California, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
James C. French , Allison L. Powell , Charles L. Viles , Travis Emmitt , Kevin J. Prey, Evaluating database selection techniques: a testbed and experiment, Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval, p.121-129, August 24-28, 1998, Melbourne, Australia
|
|
|
Luo Si , Rong Jin , Jamie Callan , Paul Ogilvie, A language modeling framework for resource selection and results merging, Proceedings of the eleventh international conference on Information and knowledge management, November 04-09, 2002, McLean, Virginia, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Andreas Paepcke , Steve B. Cousins , Hector Garcia-Molina , Scott W. Hassan , Steven P. Ketchpel , Martin Röscheisen , Terry Winograd, Using Distributed Objects for Digital Library Interoperability, Computer, v.29 n.5, p.61-68, May 1996
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Jack G. Conrad , Xi S. Guo , Peter Jackson , Monem Meziou, Database selection using actual physical and acquired logical collection resources in a massive domain-specific operational environment, Proceedings of the 28th international conference on Very Large Data Bases, p.71-82, August 20-23, 2002, Hong Kong, China
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|