ACM Home Page
Please provide us with feedback. Feedback
Questioning query expansion: an examination of behaviour and parameters
Full text PdfPdf (601 KB)
Source ACM International Conference Proceeding Series; Vol. 52 archive
Proceedings of the 15th Australasian database conference - Volume 27 table of contents
Dunedin, New Zealand
Pages: 69 - 76  
Year of Publication: 2004
Authors
Bodo Billerbeck  RMIT University, Melbourne, Australia
Justin Zobel  RMIT University, Melbourne, Australia
Publisher
Australian Computer Society, Inc.  Darlinghurst, Australia, Australia
Bibliometrics
Downloads (6 Weeks): 4,   Downloads (12 Months): 38,   Citation Count: 12
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  

ABSTRACT

In information retrieval, queries can fail to find documents due to mismatch in terminology. Query expansion is a well-known technique addressing this problem, where additional query terms are automatically chosen from highly ranked documents, and it has been shown to be effective at improving query performance. However, current techniques for query expansion use fixed values for key parameters, determined by tuning on test collections. In this paper we show that these parameters may not be generally applicable, and more significantly that the assumption that the same parameter settings can be used for all queries is invalid. Using detailed experiments with two test collections, we demonstrate that new methods for choosing parameters must be found. However, our experiments also demonstrate that there is considerable further scope for improvement to effectiveness through better query expansion.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Arampatzis, A. & van der Weide, T. (2001), "Document filtering as an adaptive and temporally-dependent process".
 
2
3
4
5
 
6
Buckley, C., Salton, G., Allan, J. & Singhal, A (1994), Automatic query expansion using SMART: TREC 3, in "Text REtrieval Conference".
7
8
 
9
 
10
 
11
12
13
 
14
15
16
17
18
 
19
Page, L., Brin, S., Motwani, R. & Winograd, T. (1998), The pagerank citation ranking: Bringing order to the web, Technical report, Stanford Digital Library Technologies Project.
20
 
21
 
22
Robertson, S. E. & Walker, S. (1999), Okapi/Keenbow at TREC-8, in "The Eighth Text REtrieval Conference (TREC-8)", NIST Special Publication 500-264, Gaithersburg, MD, pp. 151--161.
 
23
Robertson, S. E. & Walker, S. (2000), Microsoft Cambridge at TREC-9: Filtering Track, in "The Ninth Text RE-trieval Conference (TREC-9)", NIST Special Publication 500-249, Gaithersburg, MD, pp. 361--368.
 
24
Robertson, S. E., Walker, S., Hancock-Beaulieu, M., Gull, A. & Lau, M. (1992), Okapi at TREC, in "Text RETrieval Conference", pp. 21--30.
 
25
Rocchio, J. J. (1971), Relevance feedback in information retrieval, in E. Ide & G. Salton, eds, "The Smart Retrieval System --- Experiments in Automatic Document Processing", Prentice-Hall, Englewood, Cliffs, New Jersey, pp. 313--323.
26
 
27
 
28
29
 
30
 
31
 
32
 
33
Voorhees, E. M. & Harman, D. K. (1999), Overview of the Eighth Text REtrieval Conference (TREC-8), in E. M. Voorhees & D. K. Harman, eds, "The Eighth Text REtrieval Conference (TREC 8)", National Institute of Standards and Technology Special Publication 500-249, Gaithersburg, MD, pp. 1--23.
 
34
Voorhees, E. M. & Harman, D. K. (2000), Overview of the Ninth Text REtrieval Conference (TREC-9), in E. M. Voorhees & D. K. Harman, eds, "The Ninth Text REtrieval Conference (TREC 9)", National Institute of Standards and Technology Special Publication 500-249, Gaithersburg, MD, pp. 1--14.
 
35
36

CITED BY  12

Collaborative Colleagues:
Bodo Billerbeck: colleagues
Justin Zobel: colleagues