ACM Home Page
Please provide us with feedback. Feedback
A large-scale study of automated web search traffic
Full text PdfPdf (849 KB)
Source AIRWeb; Vol. 295 archive
Proceedings of the 4th international workshop on Adversarial information retrieval on the web table of contents
Beijing, China
SESSION: Usage analysis table of contents
Pages 1-8  
Year of Publication: 2008
ISBN:978-1-60558-159-0
Authors
Greg Buehrer  Microsoft Live Labs, One Microsoft Way, Redmond, WA
Jack W. Stokes  Microsoft Research, One Microsoft Way, Redmond, WA
Kumar Chellapilla  Microsoft Live Labs, One Microsoft Way, Redmond, WA
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 7,   Downloads (12 Months): 83,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1451983.1451985
What is a DOI?

ABSTRACT

As web search providers seek to improve both relevance and response times, they are challenged by the ever-increasing tax of automated search query traffic. Third party systems interact with search engines for a variety of reasons, such as monitoring a website's rank, augmenting online games, or possibly to maliciously alter click-through rates. In this paper, we investigate automated traffic in the query stream of a large search engine provider. We define automated traffic as any search query not generated by a human in real time. We first provide examples of different categories of query logs generated by bots. We then develop many different features that distinguish between queries generated by people searching for information, and those generated by automated processes. We categorize these features into two classes, either an interpretation of the physical model of human interactions, or as behavioral patterns of automated interactions. We believe these features formulate a basis for a production-level query stream classifier.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
2
 
3
Click Quality Team. How Fictitious Clicks Occur in Third-Party Click Fraud Audit Reports, Google, Inc, 2006.
 
4
5
6
7
8
 
9
10
 
11
A. Tuzhilin. The Lane's Gifts v. Google Report.
 
12
Weka. http://www.cs.waikato.ac.nz/~ml/weka/
 
13


Collaborative Colleagues:
Greg Buehrer: colleagues
Jack W. Stokes: colleagues
Kumar Chellapilla: colleagues