|
ABSTRACT
Query-expansion is an effective Relevance Feedback technique for improving performance in Information Retrieval. In general query-expansion methods select terms from the complete contents of relevant documents. One problem with this approach is that expansion terms unrelated to document relevance can be introduced into the modified query due to their presence in the relevant documents and distribution in the document collection. Motivated by the hypothesis that query-expansion terms should only be sought from the most relevant areas of a document, this investigation explores the use of document summaries in query-expansion. The investigation explores the use of both context-independent standard summaries and query-biased summaries. Experimental results using the Okapi BM25 probabilistic retrieval model with the TREC-8 ad hoc retrieval task show that query-expansion using document summaries can be considerably more effective than using full-document expansion. The paper also presents a novel approach to term-selection that separates the choice of relevant documents from the selection of a pool of potential expansion terms. Again, this technique is shown to be more effective that standard methods.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
J.J.Rocchio. "Relevance Feedback in Information Retrieval" In The smart retrieval system, G. Salton, pages 313-323, 1971, Prentice Hall, Inc.
|
| |
2
|
G.Salton and C.Buckley. Improving retrieval performance by relevance feedback, Journal of the American Society for Information Science, 41, pages 288-297, 1990.
|
| |
3
|
C.Buckley, G.Salton, J.Allan, A.Singhal. Automatic Query Expansion using SMART, In Proceedings of the Third Text REtrieval Conference (TREC-3), pages 65-80, 1995, NIST.
|
| |
4
|
S.E.Robertson, S.Walker, S.Jones, M. M.Hancock- Beaulieu and M.Gatford, Okapi at TREC-3, In Proceedings of the Third Text REtrieval Conference (TREC-3), pp109-216, 1995, NIST.
|
| |
5
|
|
| |
6
|
T.Strzalkowski, J.Wang and B.Wise. Summarization- Based Query Expansion in Information Retrieval, In Proceedings of the 17 th COLING, pages 1-21, Montreal, 1998.
|
| |
7
|
|
 |
8
|
|
| |
9
|
|
| |
10
|
|
| |
11
|
D.Knaus, E.Mittendorf, P.Schauble, P.Sheridan. Highlighting Relevant Passages for users of the Interactive SPIDER Retrieval System, In Proceedings of the Fourth Text Retrieval Conference (TREC-4), pages 233-238, 1995, NIST.
|
 |
12
|
|
| |
13
|
|
| |
14
|
S.E.Robertson, S.Walker and M.Beaulieu, Okapi at TREC-7: automatic ad hoc, filtering, VLS and interactive track, In Proceedings of the Seventh Text Retrieval Conference (TREC-7), pages 253-264, 1998, NIST.
|
| |
15
|
S.E.Robertson and S.Walker, Okapi/Keenbow at TREC-8, In Proceedings of the Eighth Text REtrieval Conference (TREC-8), pages 151-162, 1999, NIST
|
| |
16
|
Anastasios Tombros. Reflecting user information needs through query biased summaries. Thesis submitted towards the award of MSc in Advanced Information systems in the University of Glasgow. September 1997.
|
| |
17
|
H.P.Luhn. The Automatic Creation of Literature Abstracts, IBM Journal of Research and Development, 2(2), pages 159-165, 1958.
|
 |
18
|
|
| |
19
|
D.E.Kieras. Thematic Processes in the Comprehension of Technical Prose. In B.K Briton and J. B. Black eds, Understanding expository text: A theoretical and practical handbook for analyzing explanatory text, pages 89- 108, 1985, Lawrence Erlbaum Associates.
|
| |
20
|
M.F.Porter, An algorithm for suffix stripping, Program, 14(3), pages 130-137, 1980
|
| |
21
|
|
| |
22
|
S.E.Robertson and K.Sparck Jones. Relevance Weighting of Search Terms, Journal of the American Society for Information Science, 27(3), pages 129-146, 1976.
|
 |
23
|
|
 |
24
|
|
| |
25
|
|
| |
26
|
|
CITED BY 31
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Bodo Billerbeck , Falk Scholer , Hugh E. Williams , Justin Zobel, Query expansion using associated queries, Proceedings of the twelfth international conference on Information and knowledge management, November 03-08, 2003, New Orleans, LA, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Min Song , Il Yeol Song , Robert B. Allen , Zoran Obradovic, Keyphrase extraction-based query expansion in digital libraries, Proceedings of the 6th ACM/IEEE-CS joint conference on Digital libraries, June 11-15, 2006, Chapel Hill, NC, USA
|
|
|
|
|
|
|
|
|
Masayuki Okabe , Kyoji Umemura , Seiji Yamada, Query expansion with the minimum user feedback by transductive learning, Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing, p.963-970, October 06-08, 2005, Vancouver, British Columbia, Canada
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Paul - Alexandru Chirita , Stefania Costache , Wolfgang Nejdl , Siegfried Handschuh, P-TAG: large scale automatic generation of personalized annotation tags for the web, Proceedings of the 16th international conference on World Wide Web, May 08-12, 2007, Banff, Alberta, Canada
|
|
|
|
|
|
|
|
|
|
|
|
|
|