| Identifying video spammers in online social networks |
| Full text |
Pdf
(9.56 MB)
|
| Source
|
AIRWeb; Vol. 295
archive
Proceedings of the 4th international workshop on Adversarial information retrieval on the web
table of contents
Beijing, China
SESSION: Social networks
table of contents
Pages 45-52
Year of Publication: 2008
ISBN:978-1-60558-159-0
|
|
Authors
|
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 19, Downloads (12 Months): 119, Citation Count: 4
|
|
|
ABSTRACT
In many video social networks, including YouTube, users are permitted to post video responses to other users' videos. Such a response can be legitimate or can be a video response spam, which is a video response whose content is not related to the topic being discussed. Malicious users may post video response spam for several reasons, including increase the popularity of a video, marketing advertisements, distribute pornography, or simply pollute the system. In this paper we consider the problem of detecting video spammers. We first construct a large test collection of YouTube users, and manually classify them as either legitimate users or spammers. We then devise a number of attributes of video users and their social behavior which could potentially be used to detect spammers. Employing these attributes, we apply machine learning to provide a heuristic for classifying an arbitrary video as either legitimate or spam. The machine learning algorithm is trained with our test collection. We then show that our approach succeeds at detecting much of the spam while only falsely classifying a small percentage of the legitimate videos as spam. Our results highlight the most important attributes for video response spam detection.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Alexa. http://www.alexa.com.
|
| |
2
|
|
| |
3
|
|
| |
4
|
|
 |
5
|
Carlos Castillo , Debora Donato , Aristides Gionis , Vanessa Murdock , Fabrizio Silvestri, Know your neighbors: web spam detection using the web topology, Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval, July 23-27, 2007, Amsterdam, The Netherlands
[doi> 10.1145/1277741.1277814]
|
 |
6
|
Meeyoung Cha , Haewoon Kwak , Pablo Rodriguez , Yong-Yeol Ahn , Sue Moon, I tube, you tube, everybody tubes: analyzing the world's largest user generated content video system, Proceedings of the 7th ACM SIGCOMM conference on Internet measurement, October 24-26, 2007, San Diego, California, USA
[doi> 10.1145/1298306.1298309]
|
| |
7
|
|
| |
8
|
|
 |
9
|
Dennis Fetterly , Mark Manasse , Marc Najork, Spam, damn spam, and statistics: using statistical analysis to locate spam web pages, Proceedings of the 7th International Workshop on the Web and Databases: colocated with ACM SIGMOD/PODS 2004, June 17-18, 2004, Paris, France
[doi> 10.1145/1017074.1017077]
|
 |
10
|
Phillipa Gill , Martin Arlitt , Zongpeng Li , Anirban Mahanti, Youtube traffic characterization: a view from the edge, Proceedings of the 7th ACM SIGCOMM conference on Internet measurement, October 24-26, 2007, San Diego, California, USA
[doi> 10.1145/1298306.1298310]
|
| |
11
|
Luiz Henrique Gomes , Cristiano Cazita , Jussara M. Almeida , Virgílio Almeida , Wagner Meira, Jr., Workload models of spam and legitimate e-mails, Performance Evaluation, v.64 n.7-8, p.690-714, August, 2007
[doi> 10.1016/j.peva.2006.11.001]
|
| |
12
|
Luiz H. Gomes , Fernando D. O. Castro , Virgílio A. F. Almeida , Jussara M. Almeida , Rodrigo B. Almeida , Luis M. A. Bettencourt, Improving spam detection based on structural similarity, Proceedings of the Steps to Reducing Unwanted Traffic on the Internet on Steps to Reducing Unwanted Traffic on the Internet Workshop, p.12-12, July 07, 2005, Cambridge, MA
|
| |
13
|
|
| |
14
|
|
| |
15
|
R. Jain. The Art of Computer Systems Performance Analysis: Techniques for Experimental Design, Measurement, Simulation, and Modeling. John Wiley and Sons, INC, 1991.
|
 |
16
|
|
 |
17
|
Georgia Koutrika , Frans Adjie Effendi , Zoltán Gyöngyi , Paul Heymann , Hector Garcia-Molina, Combating spam in tagging systems, Proceedings of the 3rd international workshop on Adversarial information retrieval on the web, May 08-08, 2007, Banff, Alberta, Canada
[doi> 10.1145/1244408.1244420]
|
 |
18
|
|
| |
19
|
A. Thomason. Blog spam: A review. In Proc. of Conf. on Email and Anti-Spam (CEAS), 2007.
|
| |
20
|
|
| |
21
|
|
| |
22
|
C. Wu, K. Cheng, Q. Zhu, and Y. Wu. Using visual features for anti-spam filtering. In Proc. of IEEE Int'l Conf. on Image Processing (ICIP), 2005.
|
| |
23
|
|
 |
24
|
|
CITED BY 4
|
|
Fabricio Benevenuto , Fernando Duarte , Tiago Rodrigues , Virgilio A.F. Almeida , Jussara M. Almeida , Keith W. Ross, Understanding video interactions in youtube, Proceeding of the 16th ACM international conference on Multimedia, October 26-31, 2008, Vancouver, British Columbia, Canada
|
|
|
Fabrício Benevenuto , Tiago Rodrigues , Virgílio Almeida , Jussara Almeida , Marcos Gonçalves, Detecting spammers and content promoters in online video social networks, Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval, July 19-23, 2009, Boston, MA, USA
|
|
|
Daniel Hasan Dalip , Marcos André Gonçalves , Marco Cristo , Pável Calado, Automatic quality assessment of content created collaboratively by web communities: a case study of wikipedia, Proceedings of the 9th ACM/IEEE-CS joint conference on Digital libraries, June 15-19, 2009, Austin, TX, USA
|
|
|
|
|