ACM Home Page
Please provide us with feedback. Feedback
Evaluating similarity measures for emergent semantics of social tagging
Full text PdfPdf (978 KB)
Source
International World Wide Web Conference archive
Proceedings of the 18th international conference on World wide web table of contents
Madrid, Spain
SESSION: Semantic/data web/session: mining for semantics table of contents
Pages 641-650  
Year of Publication: 2009
ISBN:978-1-60558-487-4
Authors
Benjamin Markines  Indiana University, Bloomington, IN, USA
Ciro Cattuto  ISI Foundation, Turin, Italy
Filippo Menczer  Indiana University, Bloomington, IN, USA
Dominik Benz  University of Kassel, Kassel, Germany
Andreas Hotho  University of Kassel, Kassel, Germany
Gerd Stumme  University of Kassel, Kassel, Germany
Sponsor
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 82,   Downloads (12 Months): 306,   Citation Count: 3
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1526709.1526796
What is a DOI?

ABSTRACT

Social bookmarking systems are becoming increasingly important data sources for bootstrapping and maintaining Semantic Web applications. Their emergent information structures have become known as folksonomies. A key question for harvesting semantics from these systems is how to extend and adapt traditional notions of similarity to folksonomies, and which measures are best suited for applications such as community detection, navigation support, semantic search, user profiling and ontology learning. Here we build an evaluation framework to compare various general folksonomy-based similarity measures, which are derived from several established information-theoretic, statistical, and practical measures. Our framework deals generally and symmetrically with users, tags, and resources. For evaluation purposes we focus on similarity between tags and between resources and consider different methods to aggregate annotations across users. After comparing the ability of several tag similarity measures to predict user-created tag relations, we provide an external grounding by user-validated semantic proxies based on WordNet and the Open Directory Project. We also investigate the issue of scalability. We find that mutual information with distributional micro-aggregation across users yields the highest accuracy, but is not scalable; per-user projection with collaborative aggregation provides the best scalable approach via incremental computations. The results are consistent across resource and tag similarity.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
M. Aurnhammer, P. Hanappe, and S. L. Integrating collaborative tagging and emergent semantics for image retrieval. In Proc. WWW Collaborative Web Tagging Workshop, 2006.
2
 
3
J. Bar-Ilan, S. Shoham, A. Idan, Y. Miller, and A. Shachak. Structured vs. unstructured tagging -- a case study. In Proc. WWW Collaborative Web Tagging Workshop, 2006.
 
4
G. Begelman, P. Keller, and F. Smadja. Automated tag clustering: Improving search and exploration in the tag space. In Proc. WWW Collaborative Web Tagging Workshop, 2006.
 
5
P. Boldi, M. Santini, and S. Vigna. Do your worst to make the best: Paradoxical effects in pagerank incremental computations. Internet Mathematics, 2(3):387--404, 2005.
 
6
7
 
8
 
9
C. Cattuto, A. Baldassarri, V. D. P. Servedio, and V. Loreto. Emergent community structure in social tagging systems. Advances in Complex Systems, 11:597--608, 2008.
 
10
 
11
S. Christiaens. Metadata mechanisms: From ontology to folksonomy ... and back. In Proc. On the Move to Meaningful Internet Systems Workshop, LNCS. Springer, 2006.
 
12
J. Diederich and T. Iofciu. Finding communities of practice from user profiles based on folksonomies. Proc. 1st Intl. Workshop on Building Technology Enhanced Learning Solutions for Communities of Practice, 2006.
13
14
 
15
 
16
H. Halpin, V. Robu, and H. Shepard. The dynamics and semantics of collaborative tagging. In Proc. 1st Semantic Authoring and Annotation Workshop (SAAW), 2006.
 
17
T. Hammond, T. Hannay, B. Lund, and J. Scott. Social Bookmarking Tools (I): A General Review. D-Lib Magazine, 11(4), April 2005.
 
18
Y. Hassan-Montero and V. Herrero-Solana. Improving tag-clouds as visual information retrieval interfaces. In Proc. Intl. Conf. on Multidisciplinary Information Sciences and Technologies, 2006.
 
19
P. Heymann and H. Garcia-Molina. Collaborative creation of communal hierarchical taxonomies in social tagging systems. Technical Report 2006-10, Stanford InfoLab, April 2006.
20
21
 
22
A. Hotho, R. Jaschke, C. Schmitz, and G. Stumme. BibSonomy: A social bookmark and publication sharing system. In Proc. Conceptual Structures Tool Interoperability Workshop at the 14th Intl. Conf. on Conceptual Structures, pages 87--102, 2006.
 
23
A. Hotho, R. Jäschke, C. Schmitz, and G. Stumme. Information retrieval in folksonomies: Search and ranking. In Y. Sure and J. Domingue, editors, The Semantic Web: Research and Applications, volume 4011 of LNAI, pages 411--426, Heidelberg, 2006. Springer.
 
24
 
25
J. J. Jiang and D. W. Conrath. Semantic Similarity based on Corpus Statistics and Lexical Taxonomy. In Proc. Intl. Conf. on Research in Computational Linguistics (ROCLING), 1997.
 
26
27
 
28
 
29
B. Lund, T. Hammond, M. Flack, and T. Hannay. Social Bookmarking Tools (II): A Case Study -- Connotea. D-Lib Magazine, 11(4), April 2005.
 
30
31
32
 
33
B. Markines, L. Stoilova, and F. Menczer. Social bookmarks for collaborative search and recommendation. In Proc. AAAI, 2006.
34
 
35
P. Mika. Ontologies are us: A unified model of social networks and semantics. In Y. Gil, E. Motta, V. R. Benjamins, and M. A. Musen, editors, Proc. Intl. Semantic Web Conf., volume 3729 of LNCS, pages 522--536. Springer, 2005.
36
 
37
K. P. G. A. Mislove and P. Druschel. Exploiting social networks for internet search. In Proc. 5th Workshop on Hot Topics in Networks, Irvine, CA, 2006.
 
38
 
39
P. Resnik. Using Information Content to Evaluate Semantic Similarity in a Taxonomy. In Proc. IJCAI XI, pages 448--453, 1995.
40
 
41
C. Schmitz, A. Hotho, R. Jaschke, and G. Stumme. Kollaboratives wissensmanagement. In T. Pellegrini and A. Blumauer, editors, Semantic Web -- Wege zur vernetzten Wissensgesellschaft, pages 273--290. Springer, 2006.
 
42
C. Schmitz, A. Hotho, R. Jaschke, and G. Stumme. Mining association rules in folksonomies. In Data Science and Classification: Proc. of the 10th IFCS Conf., pages 261--270, Berlin, Heidelberg, 2006. Springer.
 
43
P. Schmitz. Inducing ontology from Flickr tags. In WWW Collaborative Web Tagging Workshop, May 2006.
44
45
 
46
J. Voss. Tagging, folksonomy & co -- renaissance of manual indexing? Technical report, arXiv:cs/0701072, 2007.
47
 
48
Z. Xu, Y. Fu, J. Mao, and D. Su. Towards the semantic web: Collaborative tag suggestions. In Proc. WWW Collaborative Web Tagging Workshop, 2006.
 
49
L. Zhang, X. Wu, and Y. Yu. Emergent semantics from folksonomies: A quantitative study. Journal on Data Semantics VI, 2006.


Collaborative Colleagues:
Benjamin Markines: colleagues
Ciro Cattuto: colleagues
Filippo Menczer: colleagues
Dominik Benz: colleagues
Andreas Hotho: colleagues
Gerd Stumme: colleagues