|
ABSTRACT
In this paper, we describe an approach for mining images of objects (such as touristic sights) from community photo collections in an unsupervised fashion. Our approach relies on retrieving geotagged photos from those web-sites using a grid of geospatial tiles. The downloaded photos are clustered into potentially interesting entities through a processing pipeline of several modalities, including visual, textual and spatial proximity. The resulting clusters are analyzed and are automatically classified into objects and events. Using mining techniques, we then find text labels for these clusters, which are used to again assign each cluster to a corresponding Wikipedia article in a fully unsupervised manner. A final verification step uses the contents (including images) from the selected Wikipedia article to verify the cluster-article assignment. We demonstrate this approach on several urban areas, densely covering an area of over 700 square kilometers and mining over 200,000 photos, making it probably the largest experiment of its kind to date.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
Rakesh Agrawal , Tomasz Imieliński , Arun Swami, Mining association rules between sets of items in large databases, Proceedings of the 1993 ACM SIGMOD international conference on Management of data, p.207-216, May 25-28, 1993, Washington, D.C., United States
|
| |
2
|
M. Aurnhammer, P. Hanappe, and L. Steels. Integrating collaborative tagging and emergent semantics for image retrieval. In Collaborative Web Tagging Workshop (WWW'06), 2006.
|
| |
3
|
H. Bay, T. Tuytelaars, and L. Van Gool. Surf: Speeded up robust features. In ECCV'06, 2006.
|
 |
4
|
|
| |
5
|
A. Bosch, A. Zisserman, and X. Muñoz. Scene classification via pLSA. In ECCV'06, 2006.
|
| |
6
|
W. B. Croft and D. J. Harper. Using probabilistic models of document retrieval without relevance information. Journal of Documentation, 35, 1997.
|
| |
7
|
R. Fergus, P. Perona, and A. Zisserman. Object class recognition by unsupervised scale-invariant learning. In CVPR'03, 2003.
|
 |
8
|
|
| |
9
|
M. Goesele, N. Snavely, B. Curless, H. Hoppe, and S. Seitz. Multi-view stereo for community photo collections. In ICCV'07, 2007.
|
| |
10
|
|
 |
11
|
Alexandar Jaffe , Mor Naaman , Tamir Tassa , Marc Davis, Generating summaries and visualization for large collections of geo-referenced photographs, Proceedings of the 8th ACM international workshop on Multimedia information retrieval, October 26-27, 2006, Santa Barbara, California, USA
[doi> 10.1145/1178677.1178692]
|
| |
12
|
|
 |
13
|
Michael S. Lew , Nicu Sebe , Chabane Djeraba , Ramesh Jain, Content-based multimedia information retrieval: State of the art and challenges, ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP), v.2 n.1, p.1-19, February 2006
[doi> 10.1145/1126004.1126005]
|
| |
14
|
|
| |
15
|
J. Matas, O. Chum, M. Urban, and T. Pajdla. Robust wide-baseline stereo from maximally stable extremal regions. In BMVC'02, 2002.
|
| |
16
|
|
| |
17
|
|
| |
18
|
L. Paletta, G. Fritz, C. Seifert, P. Luley, and A. Almer. A mobile vision service for multimedia tourist applications in urban environments. In IEEE Intel. Transp. Syst. Conf., 2006.
|
| |
19
|
J. Philbin, O. Chum, M. Isard, J. Sivic, and A. Zisserman. Object retrieval with large vocabularies and fast spatial matching. In CVPR'07, 2007.
|
| |
20
|
T. Quack, H. Bay, and L. Van Gool. Object recognition for the internet of things. In Internet of Things 2008, 2008.
|
| |
21
|
T. Quack, V. Ferrari, and L. Van Gool. Video mining with frequent itemset configurations. In CIVR'06, 2006.
|
| |
22
|
|
| |
23
|
|
| |
24
|
S. Savarese and L. Fei-Fei. 3d generic object categorization, localization and pose estimation. In ICCV'07, 2007.
|
| |
25
|
I. Simon, N. Snavely, and S. M. Seitz. Scene summarization for online image collections. In ICCV'07, 2007.
|
 |
26
|
|
| |
27
|
|
| |
28
|
J. Sivic and A. Zisserman. Video data mining using configurations of viewpoint invariant regions. In CVPR'04, 2004.
|
| |
29
|
J. Sivic and A. Zisserman. Video data mining using configurations of viewpoint invariant regions. In CVPR'04, 2004.
|
| |
30
|
T. Tuytelaars and L. Van Gool. Wide baseline stereo matching based on local, affinely invariant regions. In BMVC'00, 2000.
|
| |
31
|
|
| |
32
|
A. Webb. Statistical Pattern Recognition. Wiley, second edition, 2002.
|
CITED BY 13
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Luc Van Gool , Michael D. Breitenstein , Stephan Gammeter , Helmut Grabner , Till Quack, Mining from large image sets, Proceeding of the ACM International Conference on Image and Video Retrieval, July 08-10, 2009, Santorini, Fira, Greece
|
|
|
|
|
|
|
|
|
Douglas Gray , Igor Kozintsev , Yi Wu , Horst Haussecker, Wikireality: augmenting reality with community driven websites, Proceedings of the 2009 IEEE international conference on Multimedia and Expo, p.1290-1293, June 28-July 03, 2009, New York, NY, USA
|
|
|
Robert Sorschag , Roland Mörzinger , Georg Thallinger, Automatic region of interest detection in tagged images, Proceedings of the 2009 IEEE international conference on Multimedia and Expo, p.1612-1615, June 28-July 03, 2009, New York, NY, USA
|
|
|
Xavier Baró , Sergio Escalera , Petia Radeva , Jordi Vitrià, Visual content layer for scalable object recognition in urban image databases, Proceedings of the 2009 IEEE international conference on Multimedia and Expo, p.1616-1619, June 28-July 03, 2009, New York, NY, USA
|
|
|
Emily Moxley , Jim Kleban , Jiejun Xu , B. S. Manjunath, Not all tags are created equal: learning Flickr tag semantics for global annotation, Proceedings of the 2009 IEEE international conference on Multimedia and Expo, p.1452-1455, June 28-July 03, 2009, New York, NY, USA
|
|
|
E. Chatzilari , S. Nikolopoulos , I. Kompatsiaris , E. Giannakidou , A. Vakali, Leveraging social media for training object detectors, Proceedings of the 16th international conference on Digital Signal Processing, p.232-239, July 05-07, 2009, Santorini, Greece
|
INDEX TERMS
Primary Classification:
H.
Information Systems
H.3
INFORMATION STORAGE AND RETRIEVAL
H.3.1
Content Analysis and Indexing
General Terms:
Algorithms,
Design,
Measurement,
Theory
Keywords:
database,
geo-referenced,
image,
mining,
object recognition,
photo collection,
retrieval,
web
|