|
ABSTRACT
A common limitation of many retrieval models, including the recently proposed axiomatic approaches, is that retrieval scores are solely based on exact (i.e., syntactic) matching of terms in the queries and documents, without allowing distinct but semantically related terms to match each other and contribute to the retrieval score. In this paper, we show that semantic term matching can be naturally incorporated into the axiomatic retrieval model through defining the primitive weighting function based on a semantic similarity function of terms. We define several desirable retrieval constraints for semantic term matching and use such constraints to extend the axiomatic model to directly support semantic term matching based on the mutual information of terms computed on some document set. We show that such extension can be efficiently implemented as query expansion. Experiment results on several representative data sets show that, with mutual information computed over the documents in either the target collection for retrieval or an external collection such as the Web, our semantic expansion consistently and substantially improves retrieval accuracy over the baseline axiomatic retrieval model. As a pseudo feedback method, our method also outperforms a state-of-the-art language modeling feedback method.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
 |
2
|
Jing Bai , Dawei Song , Peter Bruza , Jian-Yun Nie , Guihong Cao, Query expansion using term relationships in language models for information retrieval, Proceedings of the 14th ACM international conference on Information and knowledge management, October 31-November 05, 2005, Bremen, Germany
[doi> 10.1145/1099554.1099725]
|
 |
3
|
|
 |
4
|
|
 |
5
|
|
 |
6
|
|
 |
7
|
Jianfeng Gao , Ming Zhou , Jian-Yun Nie , Hongzhao He , Weijun Chen, Resolving query translation ambiguity using a decaying co-occurrence model and syntactic dependence relations, Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval, August 11-15, 2002, Tampere, Finland
[doi> 10.1145/564376.564409]
|
 |
8
|
Jianfeng Gao , Jian-Yun Nie , Endong Xun , Jian Zhang , Ming Zhou , Changning Huang, Improving query translation for cross-language information retrieval using statistical models, Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval, p.96-104, September 2001, New Orleans, Louisiana, United States
[doi> 10.1145/383952.383966]
|
| |
9
|
|
| |
10
|
Y. Jing and W. B. Croft. An association thesaurus for information retreival. In Proceedings of RIAO 1994.
|
| |
11
|
M. Lesk. Word-word associations in document retrieval systems. American Documentation 20:27--38, 1969.
|
 |
12
|
|
 |
13
|
Akira Maeda , Fatiha Sadat , Masatoshi Yoshikawa , Shunsuke Uemura, Query term disambiguation for Web cross-language information retrieval using a search engine, Proceedings of the fifth international workshop on on Information retrieval with Asian languages, p.25-32, September 30-October 01, 2000, Hong Kong, China
[doi> 10.1145/355214.355218]
|
| |
14
|
R. Mandala, T. Tokunaga, H. Tanaka, A. Okumura, and K. Satoh. Ad hoc retrieval experiments using wordnet and automatically constructed thesauri.In Proceedings of the Seventh Text REtrieval Conference (TREC-7), pages 475--481, 1998.
|
 |
15
|
|
 |
16
|
|
| |
17
|
|
| |
18
|
H. J. Peat and P. Willett. The limitations of term co-occurence data for query expansion in document retrieval systems. Journal of the american society for information science 42(5): 378--383, 1991.
|
 |
19
|
|
 |
20
|
|
| |
21
|
J. Rocchio. Relevance feedback in information retrieval. In The SMART Retrieval System: Experiments in Automatic Document Processing pages 313--323. Prentice-Hall Inc., 1971.
|
| |
22
|
|
| |
23
|
|
| |
24
|
A. F. Smeaton and C. J. van Rijsbergen. The retrieval effects of query expansion on feedback document retrieval system. The Computer Journal 26(3): 239--246, 1983.
|
| |
25
|
|
| |
26
|
|
| |
27
|
E. M. Voorhees. Overview of the trec 2004 robust retrieval track. In Proceedings of the Thirteenth Text REtrieval Conference (TREC2004), 2005.
|
| |
28
|
E. M. Voorhees. Overview of the trec 2005 robust retrieval track. In Proceedings of the Fourteenth Text REtrieval Conference (TREC2005), 2006.
|
 |
29
|
|
| |
30
|
C. Zhai and J. Lafferty. Model-based feedback in the KL-divergence retrieval model. In Tenth International Conference on Information and Knowledge Management (CIKM 2001), pages 403--410,2001.
|
 |
31
|
|
|