|
ABSTRACT
We describe an approach for extracting semantics for tags, unstructured text-labels assigned to resources on the Web, based on each tag's usage patterns. In particular, we focus on the problem of extracting place semantics for tags that are assigned to photos on Flickr, a popular-photo sharing Web site that supports location (latitude/longitude) metadata for photos. We propose the adaptation of two baseline methods, inspired by well-known burst-analysis techniques, for the task; we also describe two novel methods, TagMaps and scale-structure identification. We evaluate the methods on a subset of Flickr data. We show that our scale-structure identification method outperforms existing techniques and that a hybrid approach generates further improvements (achieving 85% precision at 81% recall). The approach and methods described in this work can be used in other domains such as geo-annotated Web pages, where text terms can be extracted and associated with usage patterns.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
Shane Ahern , Mor Naaman , Rahul Nair , Jeannie Hui-I Yang, World explorer: visualizing aggregate data from unstructured text in geo-referenced collections, Proceedings of the 7th ACM/IEEE-CS joint conference on Digital libraries, June 18-23, 2007, Vancouver, BC, Canada
[doi> 10.1145/1255175.1255177]
|
| |
2
|
Aipperspach, R., Rattenbury, T., Woodruff, A., and Canny, J. 2006. A quantitative method for revealing and comparing places in the home. In Proceedings of the International Conference on Ubiquitous Computing (Ubicomp). Springer.
|
 |
3
|
|
 |
4
|
|
| |
5
|
Arampatzis, A., van Kreveld, M., Reinbacher, I., Clough, P., Joho, H., Sanderson, M., Jones, C. B., Vaid, S., Benkert, M., and Wolff, A. 2004. Web-Based delineation of imprecise regions. In Proceedings of the Workshop on Geographic Information Retrieval.
|
| |
6
|
|
| |
7
|
|
| |
8
|
Brunsdon, C., Fotheringham, A., and Charlton, M. 2002. Geographically weighted summary statistics: A framework for localized exploratory data analysis. In Comput. Environm. Urban Syst. 26, 501--524.
|
| |
9
|
|
| |
10
|
Buyukokkten, O., Cho, J., Garcia-Molina, H., Gravano, L., and Shivakumar, N. 1999. Exploiting geographical location information of Web pages. In Proceedings of the Workshop on Web Databases (WebDB). Held in conjunction with ACM SIGMOD'99. http://dbpubs.stanford.edu/pub/1999-4.
|
 |
11
|
|
 |
12
|
Marc Davis , Simon King , Nathan Good , Risto Sarvas, From context to content: leveraging context to infer media metadata, Proceedings of the 12th annual ACM international conference on Multimedia, October 10-16, 2004, New York, NY, USA
[doi> 10.1145/1027527.1027572]
|
| |
13
|
|
 |
14
|
Micah Dubinko , Ravi Kumar , Joseph Magnani , Jasmine Novak , Prabhakar Raghavan , Andrew Tomkins, Visualizing tags over time, Proceedings of the 15th international conference on World Wide Web, May 23-26, 2006, Edinburgh, Scotland
[doi> 10.1145/1135777.1135810]
|
 |
15
|
|
| |
16
|
|
 |
17
|
|
 |
18
|
Alexandar Jaffe , Mor Naaman , Tamir Tassa , Marc Davis, Generating summaries and visualization for large collections of geo-referenced photographs, Proceedings of the 8th ACM international workshop on Multimedia information retrieval, October 26-27, 2006, Santa Barbara, California, USA
[doi> 10.1145/1178677.1178692]
|
| |
19
|
|
| |
20
|
|
| |
21
|
Kruskal, J. B. 1956. On the shortest spanning subtree of a graph and the traveling salesman problem. In Proc. Amer. Math. Soc. 7, 1, 48--50.
|
| |
22
|
Kulldorff, M. 1999. Spatial scan statistics: Models, calculations, and applications. In Scan Statistics and Applications, Glaz and Balakrishnan, eds., Springer, Boston, Birkhauser, 303--322.
|
 |
23
|
Cameron Marlow , Mor Naaman , Danah Boyd , Marc Davis, HT06, tagging paper, taxonomy, Flickr, academic article, to read, Proceedings of the seventeenth conference on Hypertext and hypermedia, August 22-25, 2006, Odense, Denmark
[doi> 10.1145/1149941.1149949]
|
| |
24
|
McDowall, D., McCleary, R., Meidinger, E. E., and Jr., R. A. H. 1980. Interrupted Time Series Analysis. Sage University PaperSeries on Quantitative Applications in the Social Sciences.
|
| |
25
|
Naaman, M., Paepcke, A., and Garcia-Molina, H. 2003. From where to what: Metadata sharing for digital photographs with geographic coordinates. In Proceedings of the 10th International Conference on Cooperative Information Systems (CoopIS). Springer, Berlin, 196--217.
|
| |
26
|
Ng, A., Jordan, M., and Weiss, Y. 2001. On spectral clustering: Analysis and an algorithm. In Advances in Neural Information Processing Systems. Vol. 14.
|
| |
27
|
Openshaw, S. 1984. The Modifiable Areal Unit Problem: Concepts and Techniques in Modern Geography. Geo Books, Norwich.
|
| |
28
|
Openshaw, S., Charlton, M., Wymer, C., and Craft, A. 1987. A mark 1 geographical analysis machine for the automated analysis of point data sets. Int. J. Geograph. Inf. Syst. 1, 4, 335--358.
|
| |
29
|
Purves, R., Clough, P., and Joho, H. 2005. Identifying imprecise regions for geographic information retrieval using the web. In Proceedings of the Conference GISRUK.
|
 |
30
|
|
| |
31
|
Sarin, S., Nagahashi, T., Miyosawa, T., and Kameyama, W. 2007. Exploiting users' personal and public information for personal photo annotation. In Proceedings of the IEEE International Conference on Multimedia. IEEE, 564--567.
|
| |
32
|
Schmitz, P. 2006. Inducing ontology from Flickr tags. In Proceedings of the Workshop on Collaborative Web Tagging at WWW2006.
|
 |
33
|
Michail Vlachos , Christopher Meek , Zografoula Vagena , Dimitrios Gunopulos, Identifying similarities, periodicities and bursts for online search queries, Proceedings of the 2004 ACM SIGMOD international conference on Management of data, June 13-18, 2004, Paris, France
[doi> 10.1145/1007568.1007586]
|
 |
34
|
|
 |
35
|
Lee Wang , Chuang Wang , Xing Xie , Josh Forman , Yansheng Lu , Wei-Ying Ma , Ying Li, Detecting dominant locations from search queries, Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval, August 15-19, 2005, Salvador, Brazil
[doi> 10.1145/1076034.1076107]
|
| |
36
|
Witkin, A. 1983. Scale space filtering. In Proceedings of the International Joint Conference on Artificial Intelligence.
|
 |
37
|
Hugo Zaragoza , Henning Rode , Peter Mika , Jordi Atserias , Massimiliano Ciaramita , Giuseppe Attardi, Ranking very many typed entities on wikipedia, Proceedings of the sixteenth ACM conference on Conference on information and knowledge management, November 06-10, 2007, Lisbon, Portugal
[doi> 10.1145/1321440.1321599]
|
 |
38
|
|
|