ACM Home Page
Please provide us with feedback. Feedback
Social spam detection
Full text PdfPdf (1.13 MB)
Source ACM International Conference Proceeding Series archive
Proceedings of the 5th International Workshop on Adversarial Information Retrieval on the Web table of contents
Madrid, Spain
SESSION: Social spam table of contents
Pages 41-48  
Year of Publication: 2009
ISBN:978-1-60558-438-6
Authors
Benjamin Markines  Indiana University, Bloomington, Indiana and Institute for Scientific Interchange Foundation, Torino, Italy
Ciro Cattuto  Institute for Scientific Interchange Foundation, Torino, Italy
Filippo Menczer  Indiana University, Bloomington, Indiana and Institute for Scientific Interchange Foundation, Torino, Italy
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 46,   Downloads (12 Months): 134,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1531914.1531924
What is a DOI?

ABSTRACT

The popularity of social bookmarking sites has made them prime targets for spammers. Many of these systems require an administrator's time and energy to manually filter or remove spam. Here we discuss the motivations of social spam, and present a study of automatic detection of spammers in a social tagging system. We identify and analyze six distinct features that address various properties of social spam, finding that each of these features provides for a helpful signal to discriminate spammers from legitimate users. These features are then used in various machine learning algorithms for classification, achieving over 98% accuracy in detecting social spammers with 2% false positives. These promising results provide a new baseline for future efforts on social spam. We make our dataset publicly available to the research community.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
2
3
 
4
 
5
 
6
7
 
8
J. Chevalier and P. Gramme. RANK for spam detection ECML - Discovery Challenge. In Proc. Europ. Conf. on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML/PKDD), 2008.
 
9
A. Gkanogiannis and T. Kalamboukis. A novel supervised learning algorithm and its use for spam detection in social bookmarking systems. In Proc. Europ. Conf. on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML/PKDD), 2008.
 
10
 
11
 
12
T. Hammond, T. Hannay, B. Lund, and J. Scott. Social Bookmarking Tools (I): A General Review. D-Lib Magazine, 11(4), April 2005.
 
13
 
14
A. Hotho, R. Jäschke, C. Schmitz, and G. Stumme. Information retrieval in folksonomies: Search and ranking. In The Semantic Web: Research and Applications, vol. 4011 of LNAI, pages 411--426. Springer, 2006.
 
15
C. Kim and K.-B. Hwang. Naive bayes classifier learning with feature selection for spam detection in social bookmarking. In Proc. Europ. Conf. on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML/PKDD), 2008.
16
17
 
18
R. Lambiotte and M. Ausloos. Collaborative tagging as a tripartite network. LNCS, 3993:1114, Dec 2005.
19
20
 
21
P. Mika. Ontologies are us: A unified model of social networks and semantics. In Proc. ISWC, vol. 3729 of LNCS, pages 522--536, 2005.
 
22
 
23
Z. Xu, Y. Fu, J. Mao, and D. Su. Towards the semantic web: Collaborative tag suggestions. In Proc. WWW'06 Collaborative Web Tagging Workshop, 2006.

Collaborative Colleagues:
Benjamin Markines: colleagues
Ciro Cattuto: colleagues
Filippo Menczer: colleagues