ACM Home Page
Please provide us with feedback. Feedback
Compiling document collections from the Internet
Full text PdfPdf (684 KB)
Source ACM SIGIR Forum archive
Volume 34 ,  Issue 2  (Fall 2000) table of contents
Pages: 9 - 14  
Year of Publication: 2000
ISSN:0163-5840
Author
V. Kluev  The Core and Information Technology Center, The University of Aizu, Tsuruga, Ikki-machi Aizu-Wakamatsu City, Fukushima, 965-8580, Japan
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 0,   Downloads (12 Months): 5,   Citation Count: 2
Additional Information:

abstract   cited by   index terms  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/381258.381264
What is a DOI?

ABSTRACT

Presently domain specific search engines are becoming popular because they offer greater accuracy, when compared to general purpose search engines. In this study, a method for collecting domain specific documents from the net was developed for the purpose of improving search results. The main thrust of our approach is to use several metrics to estimate the relevance of every automatically discovered document by a crawler regarding a topic of interest. This type of search resulted in two important findings. First, the time required for manual analysis of document content by the crawler was significantly reduced; second, the content quality of selected documents was improved. These results suggest that the rough estimation of precision and recall calculated in this study offer great promise.