| Context-aware query classification |
| Full text |
Pdf
(761 KB)
|
Source
|
Annual ACM Conference on Research and Development in Information Retrieval
archive
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
table of contents
Boston, MA, USA
SESSION: Classification and clustering
table of contents
Pages 3-10
Year of Publication: 2009
ISBN:978-1-60558-483-6
|
|
Authors
|
|
Huanhuan Cao
|
Univesity of Science and Technology of China, Hefei, China
|
|
Derek Hao Hu
|
Hong Kong University of Science and Technology, Hong Kong, China
|
|
Dou Shen
|
Micosoft Coporation, Redmond, USA
|
|
Daxin Jiang
|
Microsoft Research Asia, Beijing, China
|
|
Jian-Tao Sun
|
Microsoft Research Asia, Beijing, China
|
|
Enhong Chen
|
University of Science and Thechnology of China, Hefei, China
|
|
Qiang Yang
|
Hong Kong Univesity of Science and Thechnology , Hong Kong, China
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 140, Downloads (12 Months): 417, Citation Count: 0
|
|
|
ABSTRACT
Understanding users'search intent expressed through their search queries is crucial to Web search and online advertisement. Web query classification (QC) has been widely studied for this purpose. Most previous QC algorithms classify individual queries without considering their context information. However, as exemplified by the well-known example on query "jaguar", many Web queries are short and ambiguous, whose real meanings are uncertain without the context information. In this paper, we incorporate context information into the problem of query classification by using conditional random field (CRF) models. In our approach, we use neighboring queries and their corresponding clicked URLs (Web pages) in search sessions as the context information. We perform extensive experiments on real world search logs and validate the effectiveness and effciency of our approach. We show that we can improve the F1 score by 52% as compared to other state-of-the-art baselines.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
 |
3
|
Rakesh Agrawal , Tomasz Imieliński , Arun Swami, Mining association rules between sets of items in large databases, Proceedings of the 1993 ACM SIGMOD international conference on Management of data, p.207-216, May 25-28, 1993, Washington, D.C., United States
|
 |
4
|
Steven M. Beitzel , Eric C. Jensen , Abdur Chowdhury , David Grossman , Ophir Frieder, Hourly analysis of a very large topically categorized web query log, Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval, July 25-29, 2004, Sheffield, United Kingdom
[doi> 10.1145/1008992.1009048]
|
 |
5
|
Steven M. Beitzel , Eric C. Jensen , Ophir Frieder , David Grossman , David D. Lewis , Abdur Chowdhury , Aleksandr Kolcz, Automatic web query classification using labeled and unlabeled training data, Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval, August 15-19, 2005, Salvador, Brazil
[doi> 10.1145/1076034.1076138]
|
| |
6
|
Steven M. Beitzel , Eric C. Jensen , Ophir Frieder , David D. Lewis , Abdur Chowdhury , Aleksander Kolcz, Improving Automatic Query Classification via Semi-Supervised Learning, Proceedings of the Fifth IEEE International Conference on Data Mining, p.42-49, November 27-30, 2005
[doi> 10.1109/ICDM.2005.80]
|
| |
7
|
Broder, A., Z. A taxonomy of web search. In SIGIR Forums pages 3---10, 2002.
|
 |
8
|
Andrei Z. Broder , Marcus Fontoura , Evgeniy Gabrilovich , Amruta Joshi , Vanja Josifovski , Tong Zhang, Robust classification of rare queries using web knowledge, Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval, July 23-27, 2007, Amsterdam, The Netherlands
[doi> 10.1145/1277741.1277783]
|
 |
9
|
Huanhuan Cao , Daxin Jiang , Jian Pei , Qi He , Zhen Liao , Enhong Chen , Hang Li, Context-aware query suggestion by mining click-through and session data, Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, August 24-27, 2008, Las Vegas, Nevada, USA
[doi> 10.1145/1401890.1401995]
|
 |
10
|
Huanhuan Cao , Daxin Jiang , Jian Pei , Enhong Chen , Hang Li, Towards context-aware search by learning a very large variable length hidden markov model from search logs, Proceedings of the 18th international conference on World wide web, April 20-24, 2009, Madrid, Spain
[doi> 10.1145/1526709.1526736]
|
 |
11
|
Hang Cui , Ji-Rong Wen , Jian-Yun Nie , Wei-Ying Ma, Probabilistic query expansion using query logs, Proceedings of the 11th international conference on World Wide Web, May 07-11, 2002, Honolulu, Hawaii, USA
[doi> 10.1145/511446.511489]
|
 |
12
|
Honghua (Kathy) Dai , Lingzhi Zhao , Zaiqing Nie , Ji-Rong Wen , Lee Wang , Ying Li, Detecting online commercial intention (OCI), Proceedings of the 15th international conference on World Wide Web, May 23-26, 2006, Edinburgh, Scotland
[doi> 10.1145/1135777.1135902]
|
 |
13
|
Bruno M. Fonseca , Paulo Golgher , Bruno Pôssas , Berthier Ribeiro-Neto , Nivio Ziviani, Concept-based interactive query expansion, Proceedings of the 14th ACM international conference on Information and knowledge management, October 31-November 05, 2005, Bremen, Germany
[doi> 10.1145/1099554.1099726]
|
 |
14
|
|
| |
15
|
He, D., et al. Detecting session boundaries from Web user logs. In Proceedings of BCS-IRSG 22nd Annual Colloquium on Information Retrieval Research, pages 57--66, 2000.
|
| |
16
|
Qi He , Daxin Jiang , Zhen Liao , Steven C. H. Hoi , Kuiyu Chang , Ee-Peng Lim , Hang Li, Web Query Recommendation via Sequential Query Prediction, Proceedings of the 2009 IEEE International Conference on Data Engineering, p.1443-1454, March 29-April 02, 2009
[doi> 10.1109/ICDE.2009.71]
|
 |
17
|
|
 |
18
|
|
| |
19
|
|
 |
20
|
|
 |
21
|
|
| |
22
|
Mccallum, A. Efficiently inducing features of conditional random fields. In UAI'03, pages 403--410, 2003.
|
 |
23
|
|
 |
24
|
|
 |
25
|
|
| |
26
|
|
 |
27
|
Dou Shen , Jian-Tao Sun , Qiang Yang , Zheng Chen, Building bridges for web query classification, Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval, August 06-11, 2006, Seattle, Washington, USA
[doi> 10.1145/1148170.1148196]
|
 |
28
|
Charles Sutton , Khashayar Rohanimanesh , Andrew McCallum, Dynamic conditional random fields: factorized probabilistic models for labeling and segmenting sequence data, Proceedings of the twenty-first international conference on Machine learning, p.99, July 04-08, 2004, Banff, Alberta, Canada
[doi> 10.1145/1015330.1015422]
|
 |
29
|
|
|