ACM Home Page
Please provide us with feedback. Feedback
Efficient and effective metasearch for text databases incorporating linkages among documents
Full text PdfPdf (245 KB)
Source International Conference on Management of Data archive
Proceedings of the 2001 ACM SIGMOD international conference on Management of data table of contents
Santa Barbara, California, United States
Pages: 187 - 198  
Year of Publication: 2001
ISBN:1-58113-332-4
Also published in ...
Authors
Clement Yu  Dept. of CS, U. of Illinois at Chicago, Chicago, IL
Weiyi Meng  Dept. of CS, SUNY at Binghamton, Binghamton, NY
Wensheng Wu  Dept. of Computer Science, UIUC
King-Lup Liu  School of CSTIS, DePaul University, Chicago, IL
Sponsor
SIGMOD: ACM Special Interest Group on Management of Data
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 3,   Downloads (12 Months): 44,   Citation Count: 11
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/375663.375684
What is a DOI?

ABSTRACT

Linkages among documents have a significant impact on the importance of documents, as it can be argued that important documents are pointed to by many documents or by other important documents. Metasearch engines can be used to facilitate ordinary users for retrieving information from multiple local sources (text databases). There is a search engine associated with each database. In a large-scale metasearch engine, the contents of each local database is represented by a representative. Each user query is evaluated against he set of representatives of all databases in order to determine the appropriate databases (search engines) to search (invoke) In previous word, the linkage information between documents has not been utilized in determining the appropriate databases to search. In this paper, such information is employed to determine the degree of relevance of a document with respect to a given query. Specifically, the importance (rank) of each document as determined by the linkages is integrated in each database representative to facilitate the selection of databases for each given query. We establish a necessary and sufficient condition to rank databases optimally, while incorporating the linkage information. A method is provided to estimate the desired quantities stated in the necessary and sufficient condition. The estimation method runs in time linearly proportional to the number of query terms. Experimental results are provided to demonstrate the high retrieval effectiveness of the method.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
2
 
3
4
 
5
Y. Fan, and S. Gauch. Adaptive Agents for Information Gathering from Multiple, Distributed Information Sources. 1999 AAAI Symposium on Intelligent Agents in Cyberspace, Stanford University, March 1999.
 
6
 
7
 
8
A. Howe, and D. Dreilinger. SavvySearch: A Meta-Search Engine that Learns Which Search Engines to Query. AI Magazine, 18(2), 1997.
9
 
10
B. Kahle, and A. Medlar. An Information System for Corporate Users: Wide Area information Servers. Technical Report TMC199, Thinking Machine Corporation, April 1991.
 
11
T. Kirk, A. Levy, Y. Sagiv, and D. Srivastava. The Information Manifold. AAAI Spring Symposium on Information Gathering in Distributed Heterogeneous Environments. 1995.
12
 
13
 
14
15
 
16
 
17
U. Manber, and P. Bigot. The Search Broker. USENIX Symposium on Internet Technologies and Systems (NSITS'97), Monterey, California, 1997, pp. 231-239.
 
18
 
19
 
20
W. Meng, C. Yu, and K. Liu. Building Efficient and Effective Metasearch Engines, Technical Report, Dept. of CS, SUNY at Binghamton, 2000.
 
21
L. Page, S. Brin, R. Motwani, and Terry Winograd. The PageRank Citation Ranking: Bring Order to the Web. Technical Report, Stanford University, 1998.
22
 
23
 
24
E. Selberg, and O. Etzioni. The MetaCrawler Architecture forResource Aggregation on the Web. IEEE Expert, 1997.
25
26
27
28
29
 
30
 
31
 
32
33
 
34

CITED BY  11

Collaborative Colleagues:
Clement Yu: colleagues
Weiyi Meng: colleagues
Wensheng Wu: colleagues
King-Lup Liu: colleagues