ACM Home Page
Please provide us with feedback. Feedback
Classifying search queries using the Web as a source of knowledge
Full text PdfPdf (787 KB)
Source
ACM Transactions on the Web (TWEB) archive
Volume 3 ,  Issue 2  (April 2009) table of contents
Article No. 5  
Year of Publication: 2009
ISSN:1559-1131
Authors
Evgeniy Gabrilovich  Yahoo Research, Santa Clara, CA
Andrei Broder  Yahoo Research, Santa Clara, CA
Marcus Fontoura  PUC-Rio, Rio de Janeiro, Brazil
Amruta Joshi  UCLA, Los Angeles, CA
Vanja Josifovski  Yahoo Research, Santa Clara, CA
Lance Riedel  Yahoo Research, Santa Clara, CA
Tong Zhang  Rutgers University, Piscataway, NJ
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 60,   Downloads (12 Months): 357,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1513876.1513877
What is a DOI?

ABSTRACT

We propose a methodology for building a robust query classification system that can identify thousands of query classes, while dealing in real time with the query volume of a commercial Web search engine. We use a pseudo relevance feedback technique: given a query, we determine its topic by classifying the Web search results retrieved by the query. Motivated by the needs of search advertising, we primarily focus on rare queries, which are the hardest from the point of view of machine learning, yet in aggregate account for a considerable fraction of search engine traffic. Empirical evaluation confirms that our methodology yields a considerably higher classification accuracy than previously reported. We believe that the proposed methodology will lead to better matching of online ads to rare queries and overall to a better user experience.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
2
 
3
4
5
6
7
 
8
Duda, R. and Hart, P. 1973. Pattern Classification and Scene Analysis. John Wiley and Sons, New York, NY.
 
9
Efthimiadis, E. and Biron, P. 1994. UCLA-Okapi at TREC-2: Query expansion experiments. In Proceedings of the Text REtrieval Conference (TREC-2). National Institute of Standards and Technology (NIST).
 
10
11
12
 
13
14
 
15
16
 
17
Kowalczyk, P., Zukerman, I., and Niemann, M. 2004. Analyzing the effect of query class on document retrieval performance. In Proceedings of the Australian Conference on Artificial Intelligence. Springer, 550--561.
18
19
 
20
 
21
McCallum, A. and Nigam, K. 1998. A comparison of event models for naive Bayes text classification. In AAAI/ICML Workshop on Learning for Text Categorization. 41--48.
22
 
23
 
24
Robertson, S., Walker, S., Jones, S., Hancock-Beaulieu, M., and Gatford, M. 1995. Okapi at TREC-3. In Proceedings of the Text REtrieval Conference (TREC-3). NIST, Gaithersburg, MD.
 
25
Rocchio, J. 1971. Relevance feedback in information retrieval. In The SMART Retrieval System: Experiments in Automatic Document Processing. Prentice Hall, Englewood Cliffs, NJ, 313--323.
 
26
Sahami, M., Mittal, V., Baluja, S., and Rowley, H. 2004. The happy searcher: Challenges in web information retrieval. In Proceedings of the 8th Pacific Rim International Conference on Artificial Intelligence. Springer-Verlag.
 
27
 
28
Salton, G. and Buckley, C. 1990. Improving retrieval performance by relevance feedback. J. Am. Soc. Inform. Sci. 41, 4, 288--297.
 
29
Santner, T. and Duffy, D. 1989. The Statistical Analysis of Discrete Data. Springer-Verlag.
30
31
32
33
34
 
35
36
 
37
 
38

Collaborative Colleagues:
Evgeniy Gabrilovich: colleagues
Andrei Broder: colleagues
Marcus Fontoura: colleagues
Amruta Joshi: colleagues
Vanja Josifovski: colleagues
Lance Riedel: colleagues
Tong Zhang: colleagues