ACM Home Page
Please provide us with feedback. Feedback
Digital Library logoTake a look at the new version of this page: [ beta version ]. Tell us what you think.
World-scale mining of objects and events from community photo collections
Full text PdfPdf (4.35 MB)
Source
Conference On Image And Video Retrieval archive
Proceedings of the 2008 international conference on Content-based image and video retrieval table of contents
Niagara Falls, Canada
SESSION: Objects, events and concepts table of contents
Pages: 47-56  
Year of Publication: 2008
ISBN:978-1-60558-070-8
Authors
Till Quack  kooaba AG, Zurich, Switzerland and ETH Zurich, Zurich, Switzerland
Bastian Leibe  ETH Zurich, Zurich, Switzerland
Luc Van Gool  ETH Zurich, Zurich, Switzerland and K.U. Leuven, Leuven, Belgium
Sponsors
SIGIR: ACM Special Interest Group on Information Retrieval
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 22,   Downloads (12 Months): 340,   Citation Count: 13
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1386352.1386363
What is a DOI?

ABSTRACT

In this paper, we describe an approach for mining images of objects (such as touristic sights) from community photo collections in an unsupervised fashion. Our approach relies on retrieving geotagged photos from those web-sites using a grid of geospatial tiles. The downloaded photos are clustered into potentially interesting entities through a processing pipeline of several modalities, including visual, textual and spatial proximity. The resulting clusters are analyzed and are automatically classified into objects and events. Using mining techniques, we then find text labels for these clusters, which are used to again assign each cluster to a corresponding Wikipedia article in a fully unsupervised manner. A final verification step uses the contents (including images) from the selected Wikipedia article to verify the cluster-article assignment. We demonstrate this approach on several urban areas, densely covering an area of over 700 square kilometers and mining over 200,000 photos, making it probably the largest experiment of its kind to date.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
M. Aurnhammer, P. Hanappe, and L. Steels. Integrating collaborative tagging and emergent semantics for image retrieval. In Collaborative Web Tagging Workshop (WWW'06), 2006.
 
3
H. Bay, T. Tuytelaars, and L. Van Gool. Surf: Speeded up robust features. In ECCV'06, 2006.
4
 
5
A. Bosch, A. Zisserman, and X. Muñoz. Scene classification via pLSA. In ECCV'06, 2006.
 
6
W. B. Croft and D. J. Harper. Using probabilistic models of document retrieval without relevance information. Journal of Documentation, 35, 1997.
 
7
R. Fergus, P. Perona, and A. Zisserman. Object class recognition by unsupervised scale-invariant learning. In CVPR'03, 2003.
8
 
9
M. Goesele, N. Snavely, B. Curless, H. Hoppe, and S. Seitz. Multi-view stereo for community photo collections. In ICCV'07, 2007.
 
10
11
 
12
13
 
14
 
15
J. Matas, O. Chum, M. Urban, and T. Pajdla. Robust wide-baseline stereo from maximally stable extremal regions. In BMVC'02, 2002.
 
16
 
17
 
18
L. Paletta, G. Fritz, C. Seifert, P. Luley, and A. Almer. A mobile vision service for multimedia tourist applications in urban environments. In IEEE Intel. Transp. Syst. Conf., 2006.
 
19
J. Philbin, O. Chum, M. Isard, J. Sivic, and A. Zisserman. Object retrieval with large vocabularies and fast spatial matching. In CVPR'07, 2007.
 
20
T. Quack, H. Bay, and L. Van Gool. Object recognition for the internet of things. In Internet of Things 2008, 2008.
 
21
T. Quack, V. Ferrari, and L. Van Gool. Video mining with frequent itemset configurations. In CIVR'06, 2006.
 
22
 
23
 
24
S. Savarese and L. Fei-Fei. 3d generic object categorization, localization and pose estimation. In ICCV'07, 2007.
 
25
I. Simon, N. Snavely, and S. M. Seitz. Scene summarization for online image collections. In ICCV'07, 2007.
26
 
27
 
28
J. Sivic and A. Zisserman. Video data mining using configurations of viewpoint invariant regions. In CVPR'04, 2004.
 
29
J. Sivic and A. Zisserman. Video data mining using configurations of viewpoint invariant regions. In CVPR'04, 2004.
 
30
T. Tuytelaars and L. Van Gool. Wide baseline stereo matching based on local, affinely invariant regions. In BMVC'00, 2000.
 
31
 
32
A. Webb. Statistical Pattern Recognition. Wiley, second edition, 2002.

CITED BY  13

Collaborative Colleagues:
Till Quack: colleagues
Bastian Leibe: colleagues
Luc Van Gool: colleagues