| Toward an improved concept-based information retrieval system |
| Full text |
Pdf
(150 KB)
|
| Source
|
Annual ACM Conference on Research and Development in Information Retrieval
archive
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
table of contents
New Orleans, Louisiana, United States
Pages: 384 - 385
Year of Publication: 2001
ISBN:1-58113-331-6
|
|
Authors
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 4, Downloads (12 Months): 54, Citation Count: 4
|
|
|
ABSTRACT
This paper presents a novel information retrieval system that includes 1) the addition of concepts to facilitate the identification of the correct word sense, 2) a natural language query interface, 3) the inclusion of weights and penalties for proper nouns that build upon the Okapi weighting scheme, and 4) a term clustering technique that exploits the spatial proximity of search terms in a document to further improve the performance. The effectiveness of the system is validated by experimental results.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Clifford Weinstein, Young-Suk Lee, Stephanie Seneff, Dinesh Tummala, Beth Carlson, John T. Lynch, Jung-Taik Hwang, Linda Kukolich. "Automated English-Korean Translation for Enhanced Coalition Communications," The Lincoln Laboratory Journal 10(1). pp. 35-60, 1997.
|
| |
2
|
|
| |
3
|
G. Miller. "WordNet: An On-Line Lexical Database," International Journal of Lexicography. Vol. 3, No. 4, 1990.
|
| |
4
|
|
| |
5
|
S.E. Robertson, S. Walker, S. Jones, M.M. Hancock-Beaulieu, M. Gatford. "Okapi at TREC-3," in The 3rd Text Retrieval Conference (TREC-3) ed. by D.K. Harman,. NIST Special Pub. 500-225, 1995.
|
| |
6
|
|
| |
7
|
Young-Suk Lee, Clifford Weinstein. "An Integrated Approach to English-Korean Translation and Translingual Information Access." Proceedings of the C-STAR Workshop. Schwetzingen, Germany, September 1999.
|
INDEX TERMS
Primary Classification:
H.
Information Systems
H.3
INFORMATION STORAGE AND RETRIEVAL
H.3.3
Information Search and Retrieval
Subjects:
Retrieval models
Additional Classification:
H.
Information Systems
H.3
INFORMATION STORAGE AND RETRIEVAL
H.3.3
Information Search and Retrieval
Subjects:
Search process
H.5
INFORMATION INTERFACES AND PRESENTATION (I.7)
H.5.2
User Interfaces (D.2.2, H.1.2, I.3.6)
Subjects:
Natural language
General Terms:
Design,
Experimentation,
Human Factors,
Performance
Keywords:
Roget's thesaurus,
WordNet,
brill tegger,
concept,
information retrieval,
word sense disambiguation
|