|
ABSTRACT
In information retrieval, queries can fail to find documents due to mismatch in terminology. Query expansion is a well-known technique addressing this problem, where additional query terms are automatically chosen from highly ranked documents, and it has been shown to be effective at improving query performance. However, current techniques for query expansion use fixed values for key parameters, determined by tuning on test collections. In this paper we show that these parameters may not be generally applicable, and more significantly that the assumption that the same parameter settings can be used for all queries is invalid. Using detailed experiments with two test collections, we demonstrate that new methods for choosing parameters must be found. However, our experiments also demonstrate that there is considerable further scope for improvement to effectiveness through better query expansion.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Arampatzis, A. & van der Weide, T. (2001), "Document filtering as an adaptive and temporally-dependent process".
|
| |
2
|
|
 |
3
|
|
 |
4
|
Bodo Billerbeck , Falk Scholer , Hugh E. Williams , Justin Zobel, Query expansion using associated queries, Proceedings of the twelfth international conference on Information and knowledge management, November 03-08, 2003, New Orleans, LA, USA
[doi> 10.1145/956863.956866]
|
 |
5
|
|
| |
6
|
Buckley, C., Salton, G., Allan, J. & Singhal, A (1994), Automatic query expansion using SMART: TREC 3, in "Text REtrieval Conference".
|
 |
7
|
|
 |
8
|
|
| |
9
|
|
| |
10
|
|
| |
11
|
|
 |
12
|
Keiichiro Hoashi , Kazunori Matsumoto , Naomi Inoue , Kazuo Hashimoto, Query expansion method based on word contribution (poster abstract), Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval, p.303-304, August 15-19, 1999, Berkeley, California, United States
[doi> 10.1145/312624.312733]
|
 |
13
|
|
| |
14
|
|
 |
15
|
|
 |
16
|
|
 |
17
|
Rila Mandala , Takenobu Tokunaga , Hozumi Tanaka, Combining multiple evidence from different types of thesaurus for query expansion, Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval, p.191-197, August 15-19, 1999, Berkeley, California, United States
[doi> 10.1145/312624.312677]
|
 |
18
|
|
| |
19
|
Page, L., Brin, S., Motwani, R. & Winograd, T. (1998), The pagerank citation ranking: Bringing order to the web, Technical report, Stanford Digital Library Technologies Project.
|
 |
20
|
|
| |
21
|
|
| |
22
|
Robertson, S. E. & Walker, S. (1999), Okapi/Keenbow at TREC-8, in "The Eighth Text REtrieval Conference (TREC-8)", NIST Special Publication 500-264, Gaithersburg, MD, pp. 151--161.
|
| |
23
|
Robertson, S. E. & Walker, S. (2000), Microsoft Cambridge at TREC-9: Filtering Track, in "The Ninth Text RE-trieval Conference (TREC-9)", NIST Special Publication 500-249, Gaithersburg, MD, pp. 361--368.
|
| |
24
|
Robertson, S. E., Walker, S., Hancock-Beaulieu, M., Gull, A. & Lau, M. (1992), Okapi at TREC, in "Text RETrieval Conference", pp. 21--30.
|
| |
25
|
Rocchio, J. J. (1971), Relevance feedback in information retrieval, in E. Ide & G. Salton, eds, "The Smart Retrieval System --- Experiments in Automatic Document Processing", Prentice-Hall, Englewood, Cliffs, New Jersey, pp. 313--323.
|
 |
26
|
|
| |
27
|
|
| |
28
|
|
 |
29
|
|
| |
30
|
|
| |
31
|
|
| |
32
|
|
| |
33
|
Voorhees, E. M. & Harman, D. K. (1999), Overview of the Eighth Text REtrieval Conference (TREC-8), in E. M. Voorhees & D. K. Harman, eds, "The Eighth Text REtrieval Conference (TREC 8)", National Institute of Standards and Technology Special Publication 500-249, Gaithersburg, MD, pp. 1--23.
|
| |
34
|
Voorhees, E. M. & Harman, D. K. (2000), Overview of the Ninth Text REtrieval Conference (TREC-9), in E. M. Voorhees & D. K. Harman, eds, "The Ninth Text REtrieval Conference (TREC 9)", National Institute of Standards and Technology Special Publication 500-249, Gaithersburg, MD, pp. 1--14.
|
| |
35
|
|
 |
36
|
|
CITED BY 12
|
|
|
|
|
|
|
|
|
|
|
Min Song , Il Yeol Song , Robert B. Allen , Zoran Obradovic, Keyphrase extraction-based query expansion in digital libraries, Proceedings of the 6th ACM/IEEE-CS joint conference on Digital libraries, June 11-15, 2006, Chapel Hill, NC, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Ralf Schenkel , Tom Crecelius , Mouna Kacimi , Sebastian Michel , Thomas Neumann , Josiane X. Parreira , Gerhard Weikum, Efficient top-k querying over social-tagging networks, Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval, July 20-24, 2008, Singapore, Singapore
|
|
|
|
|
|
Tom Crecelius , Mouna Kacimi , Sebastian Michel , Thomas Neumann , Josiane Xavier Parreira , Ralf Schenkel , Gerhard Weikum, Making SENSE: socially enhanced search and exploration, Proceedings of the VLDB Endowment, v.1 n.2, August 2008
|
|