ACM Home Page
Please provide us with feedback. Feedback
Determining the user intent of web search engine queries
Full text PdfPdf (195 KB)
Source
International World Wide Web Conference archive
Proceedings of the 16th international conference on World Wide Web table of contents
Banff, Alberta, Canada
POSTER SESSION: Search table of contents
Pages: 1149 - 1150  
Year of Publication: 2007
ISBN:978-1-59593-654-7
Authors
Bernard J. Jansen  Pennsylvania State University
Danielle L. Booth  Pennsylvania State University
Amanda Spink  Queensland University of Technology
Sponsor
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 23,   Downloads (12 Months): 226,   Citation Count: 8
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1242572.1242739
What is a DOI?

ABSTRACT

Determining the user intent of Web searches is a difficult problem due to the sparse data available concerning the searcher. In this paper, we examine a method to determine the user intent underlying Web search engine queries. We qualitatively analyze samples of queries from seven transaction logs from three different Web search engines containing more than five million queries. From this analysis, we identified characteristics of user queries based on three broad classifications of user intent. The classifications of informational, navigational, and transactional represent the type of content destination the searcher desired as expressed by their query. We implemented our classification algorithm and automatically classified a separate Web search engine transaction log of over a million queries submitted by several hundred thousand users. Our findings show that more than 80% of Web queries are informational in nature, with about 10% each being navigational and transactional. In order to validate the accuracy of our algorithm, we manually coded 400 queries and compared the classification to the results from our algorithm. This comparison showed that our automatic classification has an accuracy of 74%. Of the remaining 25% of the queries, the user intent is generally vague or multi-faceted, pointing to the need to for probabilistic classification. We illustrate how knowledge of searcher intent might be used to enhance future Web search engines.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Baeza-Yates, R., Calderón-Benavides, L. and González-Caro, C. 2006. The Intention Behind Web Queries. In Proceedings of STRING PROCESSING AND INFORMATION RETRIEVAL (SPIRE 2006). Glasgow, Scotland, 98--109.
2
 
3
 
4
 
5
6
7

CITED BY  8

Collaborative Colleagues:
Bernard J. Jansen: colleagues
Danielle L. Booth: colleagues
Amanda Spink: colleagues