ACM Home Page
Please provide us with feedback. Feedback
Varying approaches to topical web query classification
Full text PdfPdf (260 KB)
Source
Annual ACM Conference on Research and Development in Information Retrieval archive
Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval table of contents
Amsterdam, The Netherlands
POSTER SESSION: Posters table of contents
Pages: 783 - 784  
Year of Publication: 2007
ISBN:978-1-59593-597-7
Authors
Steven M. Beitzel  Telcordia Technologies, Piscataway, NJ
Eric C. Jensen  Illinois Institute of Technology, Chicago, IL
Abdur Chowdhury  Illinois Institute of Technology, Chicago, IL
Ophir Frieder  Illinois Institute of Technology, Chicago, IL
Sponsors
ACM: Association for Computing Machinery
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 14,   Downloads (12 Months): 114,   Citation Count: 3
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1277741.1277907
What is a DOI?

ABSTRACT

Topical classification of web queries has drawn recent interest because of the promise it offers in improving retrieval effectiveness and efficiency. However, much of this promise depends on whether classification is performed before or after the query is used to retrieve documents. We examine two previously unaddressed issues in query classification: pre versus post-retrieval classification effectiveness and the effect of training explicitly from classified queries versus bridging a classifier trained using a document taxonomy. Bridging classifiers map the categories of a document taxonomy onto those of a query classification problem to provide sufficient training data. We find that training classifiers explicitly from manually classified queries outperforms the bridged classifier by 48% in F1 score. Also, a pre-retrieval classifier using only the query terms performs merely 11% worse than the bridged classifier which requires snippets from retrieved documents.




Collaborative Colleagues:
Steven M. Beitzel: colleagues
Eric C. Jensen: colleagues
Abdur Chowdhury: colleagues
Ophir Frieder: colleagues