ACM Home Page
Please provide us with feedback. Feedback
Semantic term matching in axiomatic approaches to information retrieval
Full text PdfPdf (231 KB)
Source Annual ACM Conference on Research and Development in Information Retrieval archive
Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval table of contents
Seattle, Washington, USA
SESSION: Semantics table of contents
Pages: 115 - 122  
Year of Publication: 2006
ISBN:1-59593-369-7
Authors
Hui Fang  University of Illinois at Urbana-Champaign
ChengXiang Zhai  University of Illinois at Urbana-Champaign
Sponsors
SIGIR: ACM Special Interest Group on Information Retrieval
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 20,   Downloads (12 Months): 181,   Citation Count: 6
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1148170.1148193
What is a DOI?

ABSTRACT

A common limitation of many retrieval models, including the recently proposed axiomatic approaches, is that retrieval scores are solely based on exact (i.e., syntactic) matching of terms in the queries and documents, without allowing distinct but semantically related terms to match each other and contribute to the retrieval score. In this paper, we show that semantic term matching can be naturally incorporated into the axiomatic retrieval model through defining the primitive weighting function based on a semantic similarity function of terms. We define several desirable retrieval constraints for semantic term matching and use such constraints to extend the axiomatic model to directly support semantic term matching based on the mutual information of terms computed on some document set. We show that such extension can be efficiently implemented as query expansion. Experiment results on several representative data sets show that, with mutual information computed over the documents in either the target collection for retrieval or an external collection such as the Web, our semantic expansion consistently and substantially improves retrieval accuracy over the baseline axiomatic retrieval model. As a pseudo feedback method, our method also outperforms a state-of-the-art language modeling feedback method.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
2
3
4
5
6
7
8
 
9
 
10
Y. Jing and W. B. Croft. An association thesaurus for information retreival. In Proceedings of RIAO 1994.
 
11
M. Lesk. Word-word associations in document retrieval systems. American Documentation 20:27--38, 1969.
12
13
 
14
R. Mandala, T. Tokunaga, H. Tanaka, A. Okumura, and K. Satoh. Ad hoc retrieval experiments using wordnet and automatically constructed thesauri.In Proceedings of the Seventh Text REtrieval Conference (TREC-7), pages 475--481, 1998.
15
16
 
17
 
18
H. J. Peat and P. Willett. The limitations of term co-occurence data for query expansion in document retrieval systems. Journal of the american society for information science 42(5): 378--383, 1991.
19
20
 
21
J. Rocchio. Relevance feedback in information retrieval. In The SMART Retrieval System: Experiments in Automatic Document Processing pages 313--323. Prentice-Hall Inc., 1971.
 
22
 
23
 
24
A. F. Smeaton and C. J. van Rijsbergen. The retrieval effects of query expansion on feedback document retrieval system. The Computer Journal 26(3): 239--246, 1983.
 
25
 
26
 
27
E. M. Voorhees. Overview of the trec 2004 robust retrieval track. In Proceedings of the Thirteenth Text REtrieval Conference (TREC2004), 2005.
 
28
E. M. Voorhees. Overview of the trec 2005 robust retrieval track. In Proceedings of the Fourteenth Text REtrieval Conference (TREC2005), 2006.
29
 
30
C. Zhai and J. Lafferty. Model-based feedback in the KL-divergence retrieval model. In Tenth International Conference on Information and Knowledge Management (CIKM 2001), pages 403--410,2001.
31


Collaborative Colleagues:
Hui Fang: colleagues
ChengXiang Zhai: colleagues