|
ABSTRACT
The debate within the Web community over the optimal means by which to organize information often pits formalized classifications against distributed collaborative tagging systems. A number of questions remain unanswered, however, regarding the nature of collaborative tagging systems including whether coherent categorization schemes can emerge from unsupervised tagging by users. This paper uses data from the social bookmarking site delicio. us to examine the dynamics of collaborative tagging systems. In particular, we examine whether the distribution of the frequency of use of tags for "popular" sites with a long history (many tags and many users) can be described by a power law distribution, often characteristic of what are considered complex systems. We produce a generative model of collaborative tagging in order to understand the basic dynamics behind tagging, including how a power law distribution of tags could arise. We empirically examine the tagging history of sites in order to determine how this distribution arises over time and to determine the patterns prior to a stable distribution. Lastly, by focusing on the high-frequency tags of a site where the distribution of tags is a stabilized power law, we show how tag co-occurrence networks for a sample domain of tags can be used to analyze the meaning of particular tags given their relationship to other tags.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
V. Batagelj and A. Mrvar. Pajek -- A program for large network analysis. Connections, 21:47--57, 1998.
|
| |
2
|
B. Bollobas. Random Graphs. Academic Press, London, England, 1985.
|
| |
3
|
D. Brickley and R. Guha. RDF Vocabulary Description Language 1.0: RDF Schema, W3C Recomendation, 2004. http://www.w3.org/TR/rdf-schema.
|
| |
4
|
S. Butterfield. Folksonomy, 2004. http://www.sylloge.com/personal/2004/08/folksonomy-social-classification-great.html.
|
| |
5
|
R. F. Cancho and R. V. Sole. The small world of human language. Proc. Roy. Soc. London, B 268:2261--2266, 2001.
|
| |
6
|
R. F. Cancho and R. V. Sole. Least effort and the origins of scaling in human language. Procs. Natl. Acad. Sci. USA, 100:788--791, 2003.
|
| |
7
|
P. Diaconis, M. McGrath, and J. Pitman. Riffle shuffles, cycles and descents. Combinatorica, 15:11--29, 1995.
|
| |
8
|
S. Golder and B. Huberman. The structure of collaborative tagging systems, 2006. HP Labs Technical Report http://www.hpl.hp.com/research/idl/papers/tags/.
|
| |
9
|
E. Jacob. Classification and categorization: A difference that makes a difference. Library Trends, 52(3):515--540, 2004.
|
| |
10
|
C. Marlow, M. Naaman, D. Boyd, and M. Davis. Position paper, tagging, taxonomy, flickr, article, to read. In Collaborative Web Tagging Workshop at WWW'06, Edinburgh, UK, 2006.
|
| |
11
|
A. Mathes. Folksonomies: Cooperative classification and communication through shared metadata, 2004. http://www.adammathes.com/academic/computer-mediated-communication/folksonomies.html.
|
| |
12
|
P. Mika. Ontologies are us: A unified model of social networks and semantics. In Proc. of the 4th Int. Semantic Web Conference (ISWC'05). Springer LNCS vol. 3729, 2005.
|
| |
13
|
M. Newman. Power laws, pareto distributions and zipf's law. Contemporary Physics, 46:323--351, 2005.
|
| |
14
|
V. Robu and JAL. Poutré. Retrieving utility graphs used in multi-item negotiation through collaborative filtering. In Proc. of RRS'06, Hakodate, Japan, 2006.
|
| |
15
|
K. Shen and L. Wu. Folksonomy as a complex network, 2005. http://arxiv.org/abs/cs.IR/0509072.
|
| |
16
|
C. Shirky. Ontology is over-rated, 2005. http://www.shirky.com/writings/ontology-overrated.html.
|
| |
17
|
RV. Sole. Syntax for free? Nature, 434:289, 2005.
|
| |
18
|
D. Watts and S. Strogatz. Collective dynamics of 'small-world' networks. Nature, 393(6684):440--442, 1998.
|
| |
19
|
G. Zipf. Human Behaviour and the Principle of Least Effort. Addison-Wesley, Cambridge, Massachusets, 1949.
|
CITED BY 37
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Ralf Schenkel , Tom Crecelius , Mouna Kacimi , Sebastian Michel , Thomas Neumann , Josiane X. Parreira , Gerhard Weikum, Efficient top-k querying over social-tagging networks, Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval, July 20-24, 2008, Singapore, Singapore
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Markus Heckner , Tanja Neubauer , Christian Wolff, Tree, funny, to_read, google: what are tags supposed to achieve? a comparative analysis of user keywords for different digital resource types, Proceeding of the 2008 ACM workshop on Search in social media, October 30-30, 2008, Napa Valley, California, USA
|
|
|
|
|
|
Kerstin Bischoff , Claudiu S. Firan , Wolfgang Nejdl , Raluca Paiu, Can all tags be used for search?, Proceeding of the 17th ACM conference on Information and knowledge management, October 26-30, 2008, Napa Valley, California, USA
|
|
|
Sheila Kinsella , Adriana Budura , Gleb Skobeltsyn , Sebastian Michel , John G. Breslin , Karl Aberer, From Web 1.0 to Web 2.0 and back -: how did your grandma use to tag?, Proceeding of the 10th ACM workshop on Web information and data management, October 30-30, 2008, Napa Valley, California, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Tae-Gil Noh , Seong-Bae Park , Hee-Geun Yoon , Sang-Jo Lee , Se-Young Park, An automatic translation of tags for multimedia contents using folksonomy networks, Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval, July 19-23, 2009, Boston, MA, USA
|
|
|
Ziyu Guan , Jiajun Bu , Qiaozhu Mei , Chun Chen , Can Wang, Personalized tag recommendation using graph-based ranking on multi-type interrelated objects, Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval, July 19-23, 2009, Boston, MA, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Kerstin Bischoff , Claudiu S. Firan , Wolfgang Nejdl , Raluca Paiu, How do you feel about "dancing queen"?: deriving mood & theme annotations from user tags, Proceedings of the 9th ACM/IEEE-CS joint conference on Digital libraries, June 15-19, 2009, Austin, TX, USA
|
|
|
|
|
|
Fabian Abel , Matteo Baldoni , Cristina Baroglio , Nicola Henze , Daniel Krause , Viviana Patti, Context-based ranking in folksonomies, Proceedings of the 20th ACM conference on Hypertext and hypermedia, June 29-July 01, 2009, Torino, Italy
|
|
|
|
|
|
Helen Ashman , Michael Antunovic , Christoph Donner , Rebecca Frith , Eric Rebelos , Jan-Felix Schmakeit , Gavin Smith , Mark Truran, Are Clickthroughs Useful for Image Labelling?, Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology, p.191-197, September 15-18, 2009
|
|
|
|
|