|
ABSTRACT
Can we leverage the community-contributed collections of rich media on the web to automatically generate representative and diverse views of the world's landmarks? We use a combination of context- and content-based tools to generate representative sets of images for location-driven features and landmarks, a common search task. To do that, we using location and other metadata, as well as tags associated with images, and the images' visual features. We present an approach to extracting tags that represent landmarks. We show how to use unsupervised methods to extract representative views and images for each landmark. This approach can potentially scale to provide better search and representation for landmarks, worldwide. We evaluate the system in the context of image search using a real-life dataset of 110,000 images from the San Francisco area.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
S. Ahern, S. King, M. Naaman, R. Nair, and J. H.-I. Yang. ZoneTag: Rich, community-supported context-aware media capture and annotation. In Workshop on Mobile Spatial Interaction (MSI) at the SIGCHI conference on Human Factors in computing systems (CHI '07), 2007.
|
 |
2
|
Shane Ahern , Mor Naaman , Rahul Nair , Jeannie Hui-I Yang, World explorer: visualizing aggregate data from unstructured text in geo-referenced collections, Proceedings of the 7th ACM/IEEE-CS joint conference on Digital libraries, June 18-23, 2007, Vancouver, BC, Canada
[doi> 10.1145/1255175.1255177]
|
| |
3
|
T. L. Berg and D. A. Forsyth. Automatic ranking of iconic images. Technical report, U.C. Berkeley, January 2007.
|
 |
4
|
Deng Cai , Xiaofei He , Zhiwei Li , Wei-Ying Ma , Ji-Rong Wen, Hierarchical clustering of WWW image search results using visual, textual and link information, Proceedings of the 12th annual ACM international conference on Multimedia, October 10-16, 2004, New York, NY, USA
[doi> 10.1145/1027527.1027747]
|
| |
5
|
C.-C. Chang and C.-J. Lin. LIBSVM: a library for support vector machines, 2001. Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm.
|
| |
6
|
S. Chang, W. Hsu, L. Kennedy, L. Xie, A. Yanagawa, E. Zavesky, and D. Zhang. Columbia University TRECVID-2005 Video Search and High-Level Feature Extraction. NIST TRECVID Workshop, Gaithersburg, MD, November, 2005.
|
| |
7
|
M. Davis, M. Smith, F. Stentiford, A. Bambidele, J. Canny, N. Good, S. King, and R. Janakiraman. Using context and similarity for face and location identification. In Proceedings of the IS&T/SPIE 18th Annual Symposium on Electronic Imaging Science and Technology, 2006.
|
| |
8
|
Flickr.com, yahoo! inc. http://www.flickr.com.
|
 |
9
|
|
 |
10
|
|
 |
11
|
Lyndon Kennedy , Mor Naaman , Shane Ahern , Rahul Nair , Tye Rattenbury, How flickr helps us make sense of the world: context and content in community-contributed media collections, Proceedings of the 15th international conference on Multimedia, September 25-29, 2007, Augsburg, Germany
[doi> 10.1145/1291233.1291384]
|
| |
12
|
|
| |
13
|
|
| |
14
|
M. Naaman, A. Paepcke, and H. Garcia-Molina. From where to what: Metadata sharing for digital photographs with geographic coordinates. In 10th International Conference on Cooperative Information Systems (CoopIS), 2003.
|
 |
15
|
|
| |
16
|
N. O'Hare, C. Gurrin, G. J. Jones, and A. F. Smeaton. Combination of content analysis and context features for digital photograph retrieval. In 2nd IEE European Workshop on the Integration of Knowledge, Semantic and Digital Media Technologies, 2005.
|
| |
17
|
M. S. M. Orengo. Similarity of color images. Proc. SPIE Storage and Retrieval for Image and Video Databases, 2420:381--392, 1995.
|
| |
18
|
S. Palmer, E. Rosch, and P. Chase. Canonical perspective and the perception of objects. Attention and Performance IX, pages 135--151, 1981.
|
 |
19
|
|
| |
20
|
I. Simon, N. Snavely, and S. M. Seitz. Scene summarization for online image collections. In ICCV '07: Proceedings of the 11th IEEE international Conference on Computer Vision. IEEE, 2007.
|
| |
21
|
|
| |
22
|
S. Sontag. On Photography. Picador USA, 2001.
|
 |
23
|
|
| |
24
|
C.-M. Tsai, A. Qamra, and E. Chang. Extent: Inferring image metadata from context and content. In IEEE International Conference on Multimedia and Expo, 2005.
|
| |
25
|
|
 |
26
|
Shuo Wang , Feng Jing , Jibo He , Qixing Du , Lei Zhang, IGroup: presenting web image search results in semantic clusters, Proceedings of the SIGCHI conference on Human factors in computing systems, April 28-May 03, 2007, San Jose, California, USA
[doi> 10.1145/1240624.1240718]
|
 |
27
|
|
| |
28
|
Youtube.com, google inc. http://www.youtube.com.
|
CITED BY 11
|
|
|
|
|
|
|
|
Haojie Li , Jinhui Tang , Guangda Li , Tat-Seng Chua, Word2Image: towards visual interpreting of words, Proceeding of the 16th ACM international conference on Multimedia, October 26-31, 2008, Vancouver, British Columbia, Canada
|
|
|
Yi Hsuan Yang , Po Tun Wu , Ching Wei Lee , Kuan Hung Lin , Winston H. Hsu , Homer H. Chen, ContextSeer: context search and recommendation at query time for shared consumer photos, Proceeding of the 16th ACM international conference on Multimedia, October 26-31, 2008, Vancouver, British Columbia, Canada
|
|
|
|
|
|
Munmun De Choudhury , Hari Sundaram , Ajita John , Dorée Duncan Seligmann, What makes conversations interesting?: themes, participants and consequences of conversations in online social media, Proceedings of the 18th international conference on World wide web, April 20-24, 2009, Madrid, Spain
|
|
|
|
|
|
|
|
|
Liangda Li , Ke Zhou , Gui-Rong Xue , Hongyuan Zha , Yong Yu, Enhancing diversity, coverage and balance for summarization through structure learning, Proceedings of the 18th international conference on World wide web, April 20-24, 2009, Madrid, Spain
|
|
|
|
|
|
|
|