| Classifying search engine queries using the web as background knowledge |
| Full text |
Pdf
(160 KB)
|
| Source
|
ACM SIGKDD Explorations Newsletter
archive
Volume 7 , Issue 2 (December 2005)
table of contents
Pages: 117 - 122
Year of Publication: 2005
ISSN:1931-0145
|
|
Authors
|
|
David Vogel
|
A.I. Insight, Inc., Orlando, Florida
|
|
Steffen Bickel
|
Humboldt-Universität zu Berlin, Berlin, Germany
|
|
Peter Haider
|
Humboldt-Universität zu Berlin, Berlin, Germany
|
|
Rolf Schimpfky
|
Humboldt-Universität zu Berlin, Berlin, Germany
|
|
Peter Siemen
|
Humboldt-Universität zu Berlin, Berlin, Germany
|
|
Steve Bridges
|
MEDai, Inc., Orlando, Florida
|
|
Tobias Scheffer
|
Humboldt-Universität zu Berlin, Berlin, Germany
|
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 6, Downloads (12 Months): 99, Citation Count: 8
|
|
|
ABSTRACT
The performance of search engines crucially depends on their ability to capture the meaning of a query most likely intended by the user. We study the problem of mapping a search engine query to those nodes of a given subject taxonomy that characterize its most likely meanings. We describe the architecture of a classification system that uses a web directory to identify the subject context that the query terms are frequently used in. Based on its performance on the classification of 800,000 example queries recorded from MSN search, the system received the Runner-Up Award for Query Categorization Performance of the KDD Cup 2005.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
 |
2
|
|
 |
3
|
|
| |
4
|
P. Bruza and S. Dennis. Query reformulation on the internet: Empirical data and the hyperindex search engine. Proceedings of the Conference on Computer-Assisted Information Searching on Internet, 1997.
|
 |
5
|
Douglass R. Cutting , David R. Karger , Jan O. Pedersen , John W. Tukey, Scatter/Gather: a cluster-based approach to browsing large document collections, Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval, p.318-329, June 21-24, 1992, Copenhagen, Denmark
[doi> 10.1145/133160.133214]
|
| |
6
|
|
| |
7
|
A. Doan, J. Madhavan, P. Domingos, and A. Halevy. Ontology matching: A machine learning approach. In S. Staab and R. Studer, editors, Handbook on Ontologies in Information Systems. Springer-Verlag, 2003.
|
 |
8
|
|
| |
9
|
J. Gonzalo, F. Verdejo, I. Chugur, and J. Cigarran. Indexing with wordnet synsets can improve text retrieval. In Proceedings of the Annual Meeting of the Association for Computational Linguistics and International Conference on Computational Linguistics, 1998.
|
| |
10
|
D. Hosmer and S. Lemeshow. Applied logistic regression. New York, Wiley, 1989.
|
| |
11
|
D. W. J. Stefanowski. Carrot2 and language properties in web search results clustering. In Proceedings of the International Atlantic Web Intelligence Conference, 2003.
|
| |
12
|
|
| |
13
|
H. Schutze and J. Pederson. Information retrieval based on word senses. In Proceedings of the Annual Symposium on Document Analysis and Information Retrieval, 1994.
|
 |
14
|
|
| |
15
|
M. K. Y. Wang. C4-2: Combining link and contents in clustering web search results to improve information interpretation. In Proceedings of the International Conference on Database and Expert Systems Applications, 2002.
|
| |
16
|
|
| |
17
|
D. Zhang and W. Lee. Learning to integrate web taxonomies. Web Semantics, 2(2):131--151, 2004.
|
CITED BY 8
|
|
Dou Shen , Jian-Tao Sun , Qiang Yang , Zheng Chen, Building bridges for web query classification, Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval, August 06-11, 2006, Seattle, Washington, USA
|
|
|
Dou Shen , Rong Pan , Jian-Tao Sun , Jeffrey Junfeng Pan , Kangheng Wu , Jie Yin , Qiang Yang, Query enrichment for web-query classification, ACM Transactions on Information Systems (TOIS), v.24 n.3, p.320-352, July 2006
|
|
|
|
|
|
Andrei Z. Broder , Marcus Fontoura , Evgeniy Gabrilovich , Amruta Joshi , Vanja Josifovski , Tong Zhang, Robust classification of rare queries using web knowledge, Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval, July 23-27, 2007, Amsterdam, The Netherlands
|
|
|
Andrei Z. Broder , Peter Ciccolo , Marcus Fontoura , Evgeniy Gabrilovich , Vanja Josifovski , Lance Riedel, Search advertising using web relevance feedback, Proceeding of the 17th ACM conference on Information and knowledge management, October 26-30, 2008, Napa Valley, California, USA
|
|
|
Jian Hu , Gang Wang , Fred Lochovsky , Jian-tao Sun , Zheng Chen, Understanding user's query intent with wikipedia, Proceedings of the 18th international conference on World wide web, April 20-24, 2009, Madrid, Spain
|
|
|
Evgeniy Gabrilovich , Andrei Broder , Marcus Fontoura , Amruta Joshi , Vanja Josifovski , Lance Riedel , Tong Zhang, Classifying search queries using the Web as a source of knowledge, ACM Transactions on the Web (TWEB), v.3 n.2, p.1-28, April 2009
|
|
|
|
|