| A knowledge-based search engine powered by wikipedia |
| Full text |
Pdf
(407 KB)
|
Source
|
Conference on Information and Knowledge Management
archive
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
table of contents
Lisbon, Portugal
SESSION: Semantic IR (IR)
table of contents
Pages 445-454
Year of Publication: 2007
ISBN:978-1-59593-803-9
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 23, Downloads (12 Months): 267, Citation Count: 8
|
|
|
ABSTRACT
This paper describes Koru, a new search interface that offers effective domain-independent knowledge-based information retrieval. Koru exhibits an understanding of the topics of both queries and documents. This allows it to (a) expand queries automatically and (b) help guide the user as they evolve their queries interactively. Its understanding is mined from the vast investment of manual effort and judgment that is Wikipedia. We show how this open, constantly evolving encyclopedia can yield inexpensive knowledge structures that are specifically tailored to expose the topics, terminology and semantics of individual document collections. We conducted a detailed user study with 12 participants and 10 topics from the 2005 TREC HARD track, and found that Koru and its underlying knowledge base offers significant advantages over traditional keyword search. It was capable of lending assistance to almost every query issued to it; making their entry more efficient, improving the relevance of the documents they return, and narrowing the gap between expert and novice seekers.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Allan, J. (2005) HARD Track overview in TREC 2005 high accuracy retrieval from documents. Proc TREC-2005.
|
| |
2
|
|
| |
3
|
|
| |
4
|
|
| |
5
|
|
| |
6
|
|
| |
7
|
FAO (1995) Agrovoc Multilingual Agricultural Thesaurus, Food and Agricultural Organization of the United Nations.
|
 |
8
|
|
| |
9
|
Gabrilovich, E. and Markovitch, S. (2005) Feature Generation for Text Categorization Using World Knowledge. Proc. Int Joint Conf on Artificial Intelligence, pp 1048--1053.
|
| |
10
|
Gabrilovich, E. and Markovitch, S. (2006) Overcoming the Brittleness bottleneck using Wikipedia: enhancing text categorization with encyclopedic knowledge. Proc. American Association for Artificial Intelligence, pp. 1301--1306.
|
| |
11
|
|
| |
12
|
|
| |
13
|
|
 |
14
|
Rila Mandala , Takenobu Tokunaga , Hozumi Tanaka, Combining multiple evidence from different types of thesaurus for query expansion, Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval, p.191-197, August 15-19, 1999, Berkeley, California, United States
[doi> 10.1145/312624.312677]
|
 |
15
|
|
| |
16
|
Milne, D. and Witten, I. H. (2007) Extracting Corpus Specific Knowledge Bases from Wikipedia, Working Paper 03/2007, Department of Computer Science, University of Waikato.
|
| |
17
|
|
 |
18
|
|
| |
19
|
Shiri, A. and Revie, C. (2005) Usability and user perceptions of a thesaurus-enhanced search interface. Journal of Documentation 61(5), 640--656.
|
| |
20
|
|
| |
21
|
|
CITED BY 9
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Huan Wang , Xing Jiang , Liang-Tien Chia , Ah-Hwee Tan, Ontology enhanced web image retrieval: aided by wikipedia & spreading activation theory, Proceeding of the 1st ACM international conference on Multimedia information retrieval, October 30-31, 2008, Vancouver, British Columbia, Canada
|
|
|
|
|
|
|
|
|
|
|
|
|
|