ACM Home Page
Please provide us with feedback. Feedback
Combating spam in tagging systems: An evaluation
Full text PdfPdf (5.62 MB)
Source
ACM Transactions on the Web (TWEB) archive
Volume 2 ,  Issue 4  (October 2008) table of contents
Article No. 22  
Year of Publication: 2008
ISSN:1559-1131
Authors
Georgia Koutrika  Stanford University, Stanford, CA
Frans Adjie Effendi  Stanford University, Stanford, CA
Zolt´n Gyöngyi  Stanford University, Stanford, CA
Paul Heymann  Stanford University, Stanford, CA
Hector Garcia-Molina  Stanford University, Stanford, CA
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 42,   Downloads (12 Months): 480,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1409220.1409225
What is a DOI?

ABSTRACT

Tagging systems allow users to interactively annotate a pool of shared resources using descriptive strings called tags. Tags are used to guide users to interesting resources and help them build communities that share their expertise and resources. As tagging systems are gaining in popularity, they become more susceptible to tag spam: misleading tags that are generated in order to increase the visibility of some resources or simply to confuse users. Our goal is to understand this problem better. In particular, we are interested in answers to questions such as: How many malicious users can a tagging system tolerate before results significantly degrade? What types of tagging systems are more vulnerable to malicious attacks? What would be the effort and the impact of employing a trusted moderator to find bad postings? Can a system automatically protect itself from spam, for instance, by exploiting user tag patterns? In a quest for answers to these questions, we introduce a framework for modeling tagging systems and user tagging behavior. We also describe a method for ranking documents matching a tag based on taggers' reliability. Using our framework, we study the behavior of existing approaches under malicious attacks and the impact of a moderator and our ranking method.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
3spots. http://3spots.blogspot.com/2006/01/all-social-that-can-bookmark.html.
 
2
Adlam, T. 2006. Tag and ping phenomenon. http://www.optiniche.com/blog/174/tag-and-ping/.
3
 
4
CiteULike. http://www.citeulike.org/.
 
5
Control, N. http://asp.net/ajax/control-toolkit/live/NoBot/NoBot.aspx.
 
6
del.icio.us. http://del.icio.us/.
 
7
Diigo. http://www.diigo.com/.
 
8
EbiquityBlogger. 2007 http://ebiquity.umbc.edu/blogger/2007/01/24/tag-spam-on-the-rise.
 
9
Farrell, S. and Lau, T. 2006. Fringe contacts: people tagging for the enterprise. In Proceedings of the Collaborative Web Tagging Workshop in conjunction with the 15th International Conference on the World Wide Web.
 
10
Flickr. url: http://www.flickr.com/.
 
11
12
 
13
 
14
Gyöngyi, Z. and Garcia-Molina, H. 2005. Web spam taxonomy. In Proceedings of the 1st International Workshop on Adversarial Information Retrieval on the Web. 39--47.
 
15
 
16
Henzinger, M. 2000. Link analysis in web information retrieval. IEEE Data Eng. Bull. 23, 3, 3--8.
 
17
John, A. and Seligmann, D. 2006. Collaborative tagging and expertise in the enterprise. In Proceedings of the Collaborative Web Tagging Workshop in conjunction with the 15th International Conference on the World Wide Web.
 
18
Jots. http://www.jots.com/.
19
20
21
 
22
Mathes, A. 2004. Folksonomies—cooperative classification and communication through shared metadata. Computer Mediated Communication, LIS590CMC (Doctoral Seminar), Graduate School of Library and Information Science, University of Illinois Urbana-Champaign.
 
23
Merholz, P. 2004. Metadata for the masses. http://www.adaptivepath.com/ideas/essays/archives/000361.php.
24
 
25
MyWeb. http://myweb2.search.yahoo.com/.
 
26
Ohkura, T., Kiyota, Y., and Nakagawa, H. 2006. Browsing system for weblog articles based on automated folksonomy. In Proceedings of the 15th International Conference on the World Wide Web.
 
27
Rawsugar. http://rawsugar.com/.
 
28
RealTravel, X. http://realtravel.com/.
 
29
Schmitz, P. 2006. Inducing ontology from flickr tags. In Proceedings of the Collaborative Web Tagging Workshop in conjunction with the 15th International Conference on the World Wide Web.
30
 
31
Slideshare. http://slideshare.net/.
 
32
Technorati. http://www.technorati.com/.
 
33
Wasserman, S. and Faust, K. 1994. Social Network Analysis: Methods and Applications. Cambridge University Press, Cambridge, UK.
34
 
35
Xu, Z., Fu, Y., Mao, J., and Su, D. 2006. Towards the semantic web: collaborative tag suggestions. In Proceedings of the Collaborative Web Tagging Workshop in 15th International Conference on the World Wide Web.
 
36
YouTube. http://www.youtube.com/.

Collaborative Colleagues:
Georgia Koutrika: colleagues
Frans Adjie Effendi: colleagues
Zolt´n Gyöngyi: colleagues
Paul Heymann: colleagues
Hector Garcia-Molina: colleagues