ACM Home Page
Please provide us with feedback. Feedback
Web spam identification through content and hyperlinks
Full text PdfPdf (233 KB)
Source AIRWeb; Vol. 295 archive
Proceedings of the 4th international workshop on Adversarial information retrieval on the web table of contents
Beijing, China
SESSION: General table of contents
Pages 41-44  
Year of Publication: 2008
ISBN:978-1-60558-159-0
Authors
Jacob Abernethy  University of California, Berkeley, CA
Olivier Chapelle  Yahoo! Research, Santa Clara, CA
Carlos Castillo  Yahoo! Research, Barcelona, Spain
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 12,   Downloads (12 Months): 73,   Citation Count: 2
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1451983.1451994
What is a DOI?

ABSTRACT

We present an algorithm, witch, that learns to detect spam hosts or pages on the Web. Unlike most other approaches, it simultaneously exploits the structure of the Web graph as well as page contents and features. The method is efficient, scalable, and provides state-of-the-art accuracy on a standard Web spam benchmark.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Graph Labeling Workshop. http://graphlab.lip6.fr/, 2007.
 
2
Web Spam Challenge. http://webspam.lip6.fr/, 2007.
 
3
J. Abernethy, O. Chapelle, and C. Castillo. WITCH: A new approach to web spam detection. Technical Report 2008--001, Yahoo! Research, 2008.
 
4
M. Belkin, P. Niyogi, and V. Sindhwani. On manifold regularization. In Proceedings of the Tenth International Workshop on Artifical Intelligence and Statistics (AISTATS), 2005.
5
6
7
8
 
9
Z. Gyöngyi and H. Garcia-Molina. Web spam taxonomy. In First International Workshop on Adversarial Information Retrieval on the Web, pages 39--47, Chiba, Japan, 2005.
 
10
11
 
12
V. Krishnan and R. Raj. Web spam detection with anti-trust rank. In ACM SIGIR workshop on Adversarial Information Retrieval on the Web, 2006.
13
 
14
 
15
V. Vapnik. Statistical Learning Theory. John Wiley & Sons Inc, 1998.
16
17


Collaborative Colleagues:
Jacob Abernethy: colleagues
Olivier Chapelle: colleagues
Carlos Castillo: colleagues