|
ABSTRACT
Social bookmarking systems are becoming increasingly important data sources for bootstrapping and maintaining Semantic Web applications. Their emergent information structures have become known as folksonomies. A key question for harvesting semantics from these systems is how to extend and adapt traditional notions of similarity to folksonomies, and which measures are best suited for applications such as community detection, navigation support, semantic search, user profiling and ontology learning. Here we build an evaluation framework to compare various general folksonomy-based similarity measures, which are derived from several established information-theoretic, statistical, and practical measures. Our framework deals generally and symmetrically with users, tags, and resources. For evaluation purposes we focus on similarity between tags and between resources and consider different methods to aggregate annotations across users. After comparing the ability of several tag similarity measures to predict user-created tag relations, we provide an external grounding by user-validated semantic proxies based on WordNet and the Open Directory Project. We also investigate the issue of scalability. We find that mutual information with distributional micro-aggregation across users yields the highest accuracy, but is not scalable; per-user projection with collaborative aggregation provides the best scalable approach via incremental computations. The results are consistent across resource and tag similarity.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
M. Aurnhammer, P. Hanappe, and S. L. Integrating collaborative tagging and emergent semantics for image retrieval. In Proc. WWW Collaborative Web Tagging Workshop, 2006.
|
 |
2
|
Shenghua Bao , Guirong Xue , Xiaoyuan Wu , Yong Yu , Ben Fei , Zhong Su, Optimizing web search using social annotations, Proceedings of the 16th international conference on World Wide Web, May 08-12, 2007, Banff, Alberta, Canada
[doi> 10.1145/1242572.1242640]
|
| |
3
|
J. Bar-Ilan, S. Shoham, A. Idan, Y. Miller, and A. Shachak. Structured vs. unstructured tagging -- a case study. In Proc. WWW Collaborative Web Tagging Workshop, 2006.
|
| |
4
|
G. Begelman, P. Keller, and F. Smadja. Automated tag clustering: Improving search and exploration in the tag space. In Proc. WWW Collaborative Web Tagging Workshop, 2006.
|
| |
5
|
P. Boldi, M. Santini, and S. Vigna. Do your worst to make the best: Paradoxical effects in pagerank incremental computations. Internet Mathematics, 2(3):387--404, 2005.
|
| |
6
|
|
 |
7
|
|
| |
8
|
|
| |
9
|
C. Cattuto, A. Baldassarri, V. D. P. Servedio, and V. Loreto. Emergent community structure in social tagging systems. Advances in Complex Systems, 11:597--608, 2008.
|
| |
10
|
|
| |
11
|
S. Christiaens. Metadata mechanisms: From ontology to folksonomy ... and back. In Proc. On the Move to Meaningful Internet Systems Workshop, LNCS. Springer, 2006.
|
| |
12
|
J. Diederich and T. Iofciu. Finding communities of practice from user profiles based on folksonomies. Proc. 1st Intl. Workshop on Building Technology Enhanced Learning Solutions for Communities of Practice, 2006.
|
 |
13
|
Justin J. Donaldson , Michael Conover , Benjamin Markines , Heather Roinestad , Filippo Menczer, Visualizing social links in exploratory search, Proceedings of the nineteenth ACM conference on Hypertext and hypermedia, June 19-21, 2008, Pittsburgh, PA, USA
[doi> 10.1145/1379092.1379132]
|
 |
14
|
|
| |
15
|
|
| |
16
|
H. Halpin, V. Robu, and H. Shepard. The dynamics and semantics of collaborative tagging. In Proc. 1st Semantic Authoring and Annotation Workshop (SAAW), 2006.
|
| |
17
|
T. Hammond, T. Hannay, B. Lund, and J. Scott. Social Bookmarking Tools (I): A General Review. D-Lib Magazine, 11(4), April 2005.
|
| |
18
|
Y. Hassan-Montero and V. Herrero-Solana. Improving tag-clouds as visual information retrieval interfaces. In Proc. Intl. Conf. on Multidisciplinary Information Sciences and Technologies, 2006.
|
| |
19
|
P. Heymann and H. Garcia-Molina. Collaborative creation of communal hierarchical taxonomies in social tagging systems. Technical Report 2006-10, Stanford InfoLab, April 2006.
|
 |
20
|
|
 |
21
|
|
| |
22
|
A. Hotho, R. Jaschke, C. Schmitz, and G. Stumme. BibSonomy: A social bookmark and publication sharing system. In Proc. Conceptual Structures Tool Interoperability Workshop at the 14th Intl. Conf. on Conceptual Structures, pages 87--102, 2006.
|
| |
23
|
A. Hotho, R. Jäschke, C. Schmitz, and G. Stumme. Information retrieval in folksonomies: Search and ranking. In Y. Sure and J. Domingue, editors, The Semantic Web: Research and Applications, volume 4011 of LNAI, pages 411--426, Heidelberg, 2006. Springer.
|
| |
24
|
Robert Jäschke , Leandro Marinho , Andreas Hotho , Lars Schmidt-Thieme , Gerd Stumme, Tag Recommendations in Folksonomies, Proceedings of the 11th European conference on Principles and Practice of Knowledge Discovery in Databases, September 17-21, 2007, Warsaw, Poland
[doi> 10.1007/978-3-540-74976-9_52]
|
| |
25
|
J. J. Jiang and D. W. Conrath. Semantic Similarity based on Corpus Statistics and Lexical Taxonomy. In Proc. Intl. Conf. on Research in Computational Linguistics (ROCLING), 1997.
|
| |
26
|
|
 |
27
|
|
| |
28
|
|
| |
29
|
B. Lund, T. Hammond, M. Flack, and T. Hannay. Social Bookmarking Tools (II): A Case Study -- Connotea. D-Lib Magazine, 11(4), April 2005.
|
| |
30
|
|
 |
31
|
|
 |
32
|
|
| |
33
|
B. Markines, L. Stoilova, and F. Menczer. Social bookmarks for collaborative search and recommendation. In Proc. AAAI, 2006.
|
 |
34
|
Cameron Marlow , Mor Naaman , Danah Boyd , Marc Davis, HT06, tagging paper, taxonomy, Flickr, academic article, to read, Proceedings of the seventeenth conference on Hypertext and hypermedia, August 22-25, 2006, Odense, Denmark
[doi> 10.1145/1149941.1149949]
|
| |
35
|
P. Mika. Ontologies are us: A unified model of social networks and semantics. In Y. Gil, E. Motta, V. R. Benjamins, and M. A. Musen, editors, Proc. Intl. Semantic Web Conf., volume 3729 of LNCS, pages 522--536. Springer, 2005.
|
 |
36
|
|
| |
37
|
K. P. G. A. Mislove and P. Druschel. Exploiting social networks for internet search. In Proc. 5th Workshop on Hot Topics in Networks, Irvine, CA, 2006.
|
| |
38
|
|
| |
39
|
P. Resnik. Using Information Content to Evaluate Semantic Similarity in a Taxonomy. In Proc. IJCAI XI, pages 448--453, 1995.
|
 |
40
|
Badrul Sarwar , George Karypis , Joseph Konstan , John Reidl, Item-based collaborative filtering recommendation algorithms, Proceedings of the 10th international conference on World Wide Web, p.285-295, May 01-05, 2001, Hong Kong, Hong Kong
[doi> 10.1145/371920.372071]
|
| |
41
|
C. Schmitz, A. Hotho, R. Jaschke, and G. Stumme. Kollaboratives wissensmanagement. In T. Pellegrini and A. Blumauer, editors, Semantic Web -- Wege zur vernetzten Wissensgesellschaft, pages 273--290. Springer, 2006.
|
| |
42
|
C. Schmitz, A. Hotho, R. Jaschke, and G. Stumme. Mining association rules in folksonomies. In Data Science and Classification: Proc. of the 10th IFCS Conf., pages 261--270, Berlin, Heidelberg, 2006. Springer.
|
| |
43
|
P. Schmitz. Inducing ontology from Flickr tags. In WWW Collaborative Web Tagging Workshop, May 2006.
|
 |
44
|
|
 |
45
|
Lubomira Stoilova , Todd Holloway , Ben Markines , Ana G. Maguitman , Filippo Menczer, GiveALink: mining a semantic network of bookmarks for web search and recommendation, Proceedings of the 3rd international workshop on Link discovery, p.66-73, August 21-25, 2005, Chicago, Illinois
[doi> 10.1145/1134271.1134281]
|
| |
46
|
J. Voss. Tagging, folksonomy & co -- renaissance of manual indexing? Technical report, arXiv:cs/0701072, 2007.
|
 |
47
|
|
| |
48
|
Z. Xu, Y. Fu, J. Mao, and D. Su. Towards the semantic web: Collaborative tag suggestions. In Proc. WWW Collaborative Web Tagging Workshop, 2006.
|
| |
49
|
L. Zhang, X. Wu, and Y. Yu. Emergent semantics from folksonomies: A quantitative study. Journal on Data Semantics VI, 2006.
|
|