ACM Home Page
Please provide us with feedback. Feedback
Digital Library logoTake a look at the new version of this page: [ beta version ]. Tell us what you think.
A knowledge-based search engine powered by wikipedia
Full text PdfPdf (407 KB)
Source
Conference on Information and Knowledge Management archive
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management table of contents
Lisbon, Portugal
SESSION: Semantic IR (IR) table of contents
Pages: 445-454  
Year of Publication: 2007
ISBN:978-1-59593-803-9
Authors
David N. Milne  University of Waikato, Hamilton, New Zealand
Ian H. Witten  University of Waikato, Hamilton, New Zealand
David M. Nichols  University of Waikato, Hamilton, New Zealand
Sponsors
SIGIR: ACM Special Interest Group on Information Retrieval
SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 23,   Downloads (12 Months): 240,   Citation Count: 11
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1321440.1321504
What is a DOI?

ABSTRACT

This paper describes Koru, a new search interface that offers effective domain-independent knowledge-based information retrieval. Koru exhibits an understanding of the topics of both queries and documents. This allows it to (a) expand queries automatically and (b) help guide the user as they evolve their queries interactively. Its understanding is mined from the vast investment of manual effort and judgment that is Wikipedia. We show how this open, constantly evolving encyclopedia can yield inexpensive knowledge structures that are specifically tailored to expose the topics, terminology and semantics of individual document collections. We conducted a detailed user study with 12 participants and 10 topics from the 2005 TREC HARD track, and found that Koru and its underlying knowledge base offers significant advantages over traditional keyword search. It was capable of lending assistance to almost every query issued to it; making their entry more efficient, improving the relevance of the documents they return, and narrowing the gap between expert and novice seekers.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Allan, J. (2005) HARD Track overview in TREC 2005 high accuracy retrieval from documents. Proc TREC-2005.
 
2
 
3
 
4
 
5
 
6
 
7
FAO (1995) Agrovoc Multilingual Agricultural Thesaurus, Food and Agricultural Organization of the United Nations.
8
 
9
Gabrilovich, E. and Markovitch, S. (2005) Feature Generation for Text Categorization Using World Knowledge. Proc. Int Joint Conf on Artificial Intelligence, pp 1048--1053.
 
10
Gabrilovich, E. and Markovitch, S. (2006) Overcoming the Brittleness bottleneck using Wikipedia: enhancing text categorization with encyclopedic knowledge. Proc. American Association for Artificial Intelligence, pp. 1301--1306.
 
11
 
12
 
13
14
15
 
16
Milne, D. and Witten, I. H. (2007) Extracting Corpus Specific Knowledge Bases from Wikipedia, Working Paper 03/2007, Department of Computer Science, University of Waikato.
 
17
18
 
19
Shiri, A. and Revie, C. (2005) Usability and user perceptions of a thesaurus-enhanced search interface. Journal of Documentation 61(5), 640--656.
 
20
 
21

CITED BY  11

Collaborative Colleagues:
David N. Milne: colleagues
Ian H. Witten: colleagues
David M. Nichols: colleagues