ACM Home Page
Please provide us with feedback. Feedback
Query-log mining for detecting spam
Full text PdfPdf (180 KB)
Source AIRWeb; Vol. 295 archive
Proceedings of the 4th international workshop on Adversarial information retrieval on the web table of contents
Beijing, China
SESSION: Usage analysis table of contents
Pages 17-20  
Year of Publication: 2008
ISBN:978-1-60558-159-0
Authors
Carlos Castillo  Yahoo! Research Labs, Barcelona, Spain
Claudio Corsi  University of Pisa, Italy
Debora Donato  Yahoo! Research Labs, Barcelona, Spain
Paolo Ferragina  University of Pisa, Italy
Aristides Gionis  Yahoo! Research Labs, Barcelona, Spain
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 10,   Downloads (12 Months): 44,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1451983.1451987
What is a DOI?

ABSTRACT

Every day millions of users search for information on the web via search engines, and provide implicit feedback to the results shown for their queries by clicking or not onto them. This feedback is encoded in the form of a query log that consists of a sequence of search actions, one per user query, each describing the following information: (i) terms composing a query, (ii) documents returned by the search engine, (iii) documents that have been clicked, (iv) the rank of those documents in the list of results, (v) date and time of the search action/click, (vi) an anonymous identifier for each session, and more.

In this work, we investigate the idea of characterizing the documents and the queries belonging to a given query log with the goal of improving algorithms for detecting spam, both at the document level and at the query level.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
2
3
4
 
5
D. Fetterly. Adversarial information retrieval: The manipulation of web content. ACM Computing Reviews, July 2007.
 
6
} Z. Gyöngyi and H. Garcia-Molina. Web spam taxonomy. In Procs AIRWEb, 39--47, 2005.
7
8
9
10
11
12
 
13
M. Szummer and T. Jaakkola. Partially labeled classification with markov random walks. In Advances in Neural Information Processing Systems, volume 14, 2001.
 
14
15
 
16
17

Collaborative Colleagues:
Carlos Castillo: colleagues
Claudio Corsi: colleagues
Debora Donato: colleagues
Paolo Ferragina: colleagues
Aristides Gionis: colleagues