ACM Home Page
Please provide us with feedback. Feedback
Click chain model in web search
Full text PdfPdf (1.20 MB)
Source
International World Wide Web Conference archive
Proceedings of the 18th international conference on World wide web table of contents
Madrid, Spain
SESSION: Data mining/session: click models table of contents
Pages 11-20  
Year of Publication: 2009
ISBN:978-1-60558-487-4
Authors
Fan Guo  Carnegie Mellon University, Pittsburgh, PA, USA
Chao Liu  Microsoft Research Redmond, Redmond, WA, USA
Anitha Kannan  Microsoft Research Search Labs, Mountain View, CA, USA
Tom Minka  Microsoft Research Cambridge, Cambridge, United Kingdom
Michael Taylor  Microsoft Research Cambridge, Cambridge, United Kingdom
Yi-Min Wang  Microsoft Research Redmond, Redmond, WA, USA
Christos Faloutsos  Carnegie Mellon University, Pittsburgh, PA, USA
Sponsor
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 46,   Downloads (12 Months): 208,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1526709.1526712
What is a DOI?

ABSTRACT

Given a terabyte click log, can we build an efficient and effective click model? It is commonly believed that web search click logs are a gold mine for search business, because they reflect users' preference over web documents presented by the search engine. Click models provide a principled approach to inferring user-perceived relevance of web documents, which can be leveraged in numerous applications in search businesses. Due to the huge volume of click data, scalability is a must.

We present the click chain model (CCM), which is based on a solid, Bayesian framework. It is both scalable and incremental, perfectly meeting the computational challenges imposed by the voluminous click logs that constantly grow. We conduct an extensive experimental study on a data set containing 8.8 million query sessions obtained in July 2008 from a commercial search engine. CCM consistently outperforms two state-of-the-art competitors in a number of metrics, with over 9.7% better log-likelihood, over 6.2% better click perplexity and much more robust (up to 30%) prediction of the first and the last clicked position.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
2
3
4
5
6
7
8
9
10
 
11
12
 
13
14
15
16
17

Collaborative Colleagues:
Fan Guo: colleagues
Chao Liu: colleagues
Anitha Kannan: colleagues
Tom Minka: colleagues
Michael Taylor: colleagues
Yi-Min Wang: colleagues
Christos Faloutsos: colleagues