| Social spam detection |
| Full text |
Pdf
(1.13 MB)
|
| Source
|
ACM International Conference Proceeding Series
archive
Proceedings of the 5th International Workshop on Adversarial Information Retrieval on the Web
table of contents
Madrid, Spain
SESSION: Social spam
table of contents
Pages 41-48
Year of Publication: 2009
ISBN:978-1-60558-438-6
|
|
Authors
|
|
Benjamin Markines
|
Indiana University, Bloomington, Indiana and Institute for Scientific Interchange Foundation, Torino, Italy
|
|
Ciro Cattuto
|
Institute for Scientific Interchange Foundation, Torino, Italy
|
|
Filippo Menczer
|
Indiana University, Bloomington, Indiana and Institute for Scientific Interchange Foundation, Torino, Italy
|
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 46, Downloads (12 Months): 134, Citation Count: 0
|
|
|
ABSTRACT
The popularity of social bookmarking sites has made them prime targets for spammers. Many of these systems require an administrator's time and energy to manually filter or remove spam. Here we discuss the motivations of social spam, and present a study of automatic detection of spammers in a social tagging system. We identify and analyze six distinct features that address various properties of social spam, finding that each of these features provides for a helpful signal to discriminate spammers from legitimate users. These features are then used in various machine learning algorithms for classification, achieving over 98% accuracy in detecting social spammers with 2% false positives. These promising results provide a new baseline for future efforts on social spam. We make our dataset publicly available to the research community.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
 |
2
|
Fabricio Benevenuto , Tiago Rodrigues , Virgilio Almeida , Jussara Almeida , Chao Zhang , Keith Ross, Identifying video spammers in online social networks, Proceedings of the 4th international workshop on Adversarial information retrieval on the web, April 22-22, 2008, Beijing, China
[doi> 10.1145/1451983.1451996]
|
 |
3
|
|
| |
4
|
|
| |
5
|
|
| |
6
|
Ciro Cattuto_aff3n2 , Christoph Schmitz , Andrea Baldassarri , Vito D. P. Servedio_aff2n3 , Vittorio Loreto_aff2n3 , Andreas Hotho , Miranda Grahl , Gerd Stumme, Network properties of folksonomies, AI Communications, v.20 n.4, p.245-262, December 2007
|
 |
7
|
|
| |
8
|
J. Chevalier and P. Gramme. RANK for spam detection ECML - Discovery Challenge. In Proc. Europ. Conf. on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML/PKDD), 2008.
|
| |
9
|
A. Gkanogiannis and T. Kalamboukis. A novel supervised learning algorithm and its use for spam detection in social bookmarking systems. In Proc. Europ. Conf. on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML/PKDD), 2008.
|
| |
10
|
|
| |
11
|
|
| |
12
|
T. Hammond, T. Hannay, B. Lund, and J. Scott. Social Bookmarking Tools (I): A General Review. D-Lib Magazine, 11(4), April 2005.
|
| |
13
|
|
| |
14
|
A. Hotho, R. Jäschke, C. Schmitz, and G. Stumme. Information retrieval in folksonomies: Search and ranking. In The Semantic Web: Research and Applications, vol. 4011 of LNAI, pages 411--426. Springer, 2006.
|
| |
15
|
C. Kim and K.-B. Hwang. Naive bayes classifier learning with feature selection for spam detection in social bookmarking. In Proc. Europ. Conf. on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML/PKDD), 2008.
|
 |
16
|
Georgia Koutrika , Frans Adjie Effendi , Zoltán Gyöngyi , Paul Heymann , Hector Garcia-Molina, Combating spam in tagging systems, Proceedings of the 3rd international workshop on Adversarial information retrieval on the web, May 08-08, 2007, Banff, Alberta, Canada
[doi> 10.1145/1244408.1244420]
|
 |
17
|
|
| |
18
|
R. Lambiotte and M. Ausloos. Collaborative tagging as a tripartite network. LNCS, 3993:1114, Dec 2005.
|
 |
19
|
Benjamin Markines , Ciro Cattuto , Filippo Menczer , Dominik Benz , Andreas Hotho , Gerd Stumme, Evaluating similarity measures for emergent semantics of social tagging, Proceedings of the 18th international conference on World wide web, April 20-24, 2009, Madrid, Spain
[doi> 10.1145/1526709.1526796]
|
 |
20
|
|
| |
21
|
P. Mika. Ontologies are us: A unified model of social networks and semantics. In Proc. ISWC, vol. 3729 of LNCS, pages 522--536, 2005.
|
| |
22
|
|
| |
23
|
Z. Xu, Y. Fu, J. Mao, and D. Su. Towards the semantic web: Collaborative tag suggestions. In Proc. WWW'06 Collaborative Web Tagging Workshop, 2006.
|
INDEX TERMS
Primary Classification:
H.
Information Systems
H.3
INFORMATION STORAGE AND RETRIEVAL
H.3.5
On-line Information Services
Additional Classification:
K.
Computing Milieux
K.4
COMPUTERS AND SOCIETY
K.4.2
Social Issues
K.6
MANAGEMENT OF COMPUTING AND INFORMATION SYSTEMS
K.6.5
Security and Protection (D.4.6, K.4.2)
General Terms:
Algorithms,
Design,
Experimentation,
Human Factors,
Performance
Keywords:
annotations,
post,
resource,
social spam,
tag,
tag similarity,
web 2.0
|