|
ABSTRACT
Query expansion methods have been studied for a long time - with debatable success in many instances. In this paper we present a probabilistic query expansion model based on a similarity thesaurus which was constructed automatically. A similarity thesaurus reflects domain knowledge about the particular collection from which it is constructed. We address the two important issues with query expansion: the selection and the weighting of additional search terms. In contrast to earlier methods, our queries are expanded by adding those terms that are most similar to the concept of the query, rather than selecting terms that are similar to the query terms. Our experiments show that this kind of query expansion results in a notable improvement in the retrieval effectiveness when measured using both recall-precision and usefulness.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
Cro 90
|
|
 |
Cro 92
|
|
| |
Dee 90
|
Deerwester, S., Dumais, S.T., Fumas, G.W., Landauer, T.K., Harshman R., Indexing by latent semantic analysis, J. of the ASIS, 41 (6): 391-407, 1990.
|
| |
Ekm 92
|
|
| |
Fre 91
|
|
 |
Gre 92
|
|
| |
Han 92
|
|
 |
Har92
|
|
| |
Les 69
|
Lesk, M.E., Word-word association in document retrieval systems, American Documentation, 20(1): 27-38, 1969.
|
| |
Min 72
|
Minker, J., Wilson, G.A., Zirnmerman, B.I-I., An evaluation of query expansion by the addition of clustered terms for a document retrieval system, Information Storage and Retrieval, 8(6): 329-48, 1972.
|
| |
Pea 91
|
Peat, H.J., Willett, P., The limitations of term co-occurrence data for query expansion in document retrieval systems, J. of the ASIS, 42(5): 378-83, 1991.
|
| |
Qiu 92
|
Qiu, Y., ISIR: an integrated system for information retrieval, Proc. 14th 1R Colloquium, British Computer Society, Lancaster, 1992.
|
| |
Rug 92
|
|
| |
Sal 71
|
Salton, G., Experiments in automatic thesaurus construction for information retrieval, Information Processing 71, 1: 115-123, 1971.
|
| |
Sal 75
|
Salton, G., Yang, C.S., Yu, C.T., A theory of term importance in automatic text analysis, J. of the ASIS, 26(1): 33-44, 1975.
|
| |
Sal 80
|
Salton, G., Automatic term class construction using relevance-a summary of work in automatic pseudoclassification, Information Processing & Management, 16(1): 1-15, 1980.
|
| |
Sal 88
|
|
| |
Sal 90
|
Salton, G., Buckley, C.: Improving Retrieval Performance by Relevance Feedback. J. of the ASIS, 41(4): 288-297, 1990.
|
| |
Sch 92
|
Sch~iuble, P., Knaus, D., The various roles of information structures, 16. Jahrestagung der Gesellschaft ffir Klassifikation, Dortmund, 1992.
|
| |
Sme 83
|
Smeaton, A.F., van Rijsbergen, C.J., The retrieval effects of query expansion on a feedback document retrieval system, The Computer Journal, 26(3): 239-46, 1983.
|
| |
Spa 71
|
Sparck-Jones, K., Barber, E.B., What makes an automatic keyword classification effective? J. of the ASIS, 18." 166-175, 1971.
|
 |
Spa 91
|
|
| |
Wad 88
|
|
 |
Won 87
|
|
CITED BY 111
|
|
|
|
|
Rila Mandala , Takenobu Tokunaga , Hozumi Tanaka, Combining multiple evidence from different types of thesaurus for query expansion, Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval, p.191-197, August 15-19, 1999, Berkeley, California, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Makoto Nakashima , Keizo Sato , Yanhua Qu , Tetsuro Ito, Browsing-based conceptual information retrieval incorporating dictionary term relations, keyword association, and a user's interest, Journal of the American Society for Information Science and Technology, v.54 n.1, p.16-28, January 2003
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Jing Bai , Dawei Song , Peter Bruza , Jian-Yun Nie , Guihong Cao, Query expansion using term relationships in language models for information retrieval, Proceedings of the 14th ACM international conference on Information and knowledge management, October 31-November 05, 2005, Bremen, Germany
|
|
|
|
|
|
Giannis Varelas , Epimenidis Voutsakis , Paraskevi Raftopoulou , Euripides G.M. Petrakis , Evangelos E. Milios, Semantic similarity methods in wordNet and their application to information retrieval on the web, Proceedings of the 7th annual ACM international workshop on Web information and data management, November 04-04, 2005, Bremen, Germany
|
|
|
|
|
|
|
|
|
|
|
|
Hang Cui , Ji-Rong Wen , Jian-Yun Nie , Wei-Ying Ma, Probabilistic query expansion using query logs, Proceedings of the 11th international conference on World Wide Web, May 07-11, 2002, Honolulu, Hawaii, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Dmitri Roussinov , Leon J. Zhao , Weiguo Fan, Mining context specific similarity relationships using the world wide web, Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing, p.499-506, October 06-08, 2005, Vancouver, British Columbia, Canada
|
|
|
Min Song , Il Yeol Song , Robert B. Allen , Zoran Obradovic, Keyphrase extraction-based query expansion in digital libraries, Proceedings of the 6th ACM/IEEE-CS joint conference on Digital libraries, June 11-15, 2006, Chapel Hill, NC, USA
|
|
|
|
|
|
|
|
|
|
|
|
Hemant Joshi , Shinya Ito , Santhosh Kanala , Sangeetha Hebbar , Coskun Bayrak, Concept set extraction with user session context, Proceedings of the 45th annual southeast regional conference, March 23-24, 2007, Winston-Salem, North Carolina
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Patrick Ruch , Imad Tbahriti , Julien Gobeill , Alan R. Aronson, Argumentative feedback: a linguistically-motivated term expansion for information retrieval, Proceedings of the COLING/ACL on Main conference poster sessions, p.675-682, July 17-18, 2006, Sydney, Australia
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Anthony Ventresque , Sylvie Cazalens , Philippe Lamarre , Patrick Valduriez, Dealing with P2P semantic heterogeneity through query expansion and interpretation, Proceedings of the 2008 international workshop on Data management in peer-to-peer systems, p.3-10, March 25-25, 2008, Nantes, France
|
|
|
Apostol (Paul) Natsev , Alexander Haubold , Jelena Tešić , Lexing Xie , Rong Yan, Semantic concept-based query expansion and re-ranking for multimedia retrieval, Proceedings of the 15th international conference on Multimedia, September 25-29, 2007, Augsburg, Germany
|
|
|
|
|
|
|
|
|
|
|
|
Marcos Antonio Vaz Salles , Jens-Peter Dittrich , Shant Kirakos Karakashian , Olivier René Girard , Lukas Blunschi, iTrails: pay-as-you-go information integration in dataspaces, Proceedings of the 33rd international conference on Very large data bases, September 23-27, 2007, Vienna, Austria
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|