ACM Home Page
Please provide us with feedback. Feedback
An experimental comparison of click position-bias models
Full text PdfPdf (4.51 MB)
Source
Web Search and Web Data Mining archive
Proceedings of the international conference on Web search and web data mining table of contents
Palo Alto, California, USA
SESSION: Ranking table of contents
Pages 87-94  
Year of Publication: 2008
ISBN:978-1-59593-927-9
Authors
Nick Craswell  Microsoft Research, Cambridge UK
Onno Zoeter  Microsoft Research, Cambridge UK
Michael Taylor  Microsoft Research, Cambridge UK
Bill Ramsey  Microsoft Research, Redmond USA
Sponsors
ACM: Association for Computing Machinery
SIGKDD: ACM Special Interest Group on Knowledge Discovery in Data
SIGMOD: ACM Special Interest Group on Management of Data
SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 47,   Downloads (12 Months): 286,   Citation Count: 14
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1341531.1341545
What is a DOI?

ABSTRACT

Search engine click logs provide an invaluable source of relevance information, but this information is biased. A key source of bias is presentation order: the probability of click is influenced by a document's position in the results page. This paper focuses on explaining that bias, modelling how probability of click depends on position. We propose four simple hypotheses about how position bias might arise. We carry out a large data-gathering effort, where we perturb the ranking of a major search engine, to see how clicks are affected. We then explore which of the four hypotheses best explains the real-world position effects, and compare these to a simple logistic regression model. The data are not well explained by simple position models, where some users click indiscriminately on rank 1 or there is a simple decay of attention over ranks. A ‘cascade' model, where users view results from top to bottom and leave as soon as they see a worthwhile document, is our best explanation for position bias in early ranks


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
 
3
Georges Dupret, Vanessa Murdock, and Benjamin Piwowarski. Web search engine evaluation using click-through data and a user model. In Proceedings of the Workshop on Query Log Analysis (WWW)2007.
 
4
Georges Dupret, Benjamin Piwowarski, Carlos A. Hurtado, and Marcelo Mendoza. A statistical model of query log generation. In String Processing and Information Retrieval, 13th International Conference, SPIRE 2006 pages 217--228, 2006.
5
6
 
7
 
8
F. Radlinski and T. Joachims. Minimally invasive randomization for collecting unbiased preferences from clickthrough logs. In Conference of the Association for the Advancement of Artificial Intelligence (AAAI) pages 1406--1412, 2006.
9

CITED BY  14

Collaborative Colleagues:
Nick Craswell: colleagues
Onno Zoeter: colleagues
Michael Taylor: colleagues
Bill Ramsey: colleagues