ACM Home Page
Please provide us with feedback. Feedback
Flickr distance
Full text PdfPdf (887 KB)
Source
International Multimedia Conference archive
Proceeding of the 16th ACM international conference on Multimedia table of contents
Vancouver, British Columbia, Canada
SESSION: Best paper session table of contents
Pages 31-40  
Year of Publication: 2008
ISBN:978-1-60558-303-7
Authors
Lei Wu  University of Science and Technology of China, Hefei, China
Xian-Sheng Hua  Microsoft Research Asia, Beijing, China
Nenghai Yu  University of Science and Technology of China, Hefei, China
Wei-Ying Ma  Microsoft Research Asia, Beijing, China
Shipeng Li  Microsoft Research Asia, Beijing, China
Sponsors
ACM: Association for Computing Machinery
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 21,   Downloads (12 Months): 506,   Citation Count: 3
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1459359.1459364
What is a DOI?

ABSTRACT

This paper presents Flickr distance, which is a novel measurement of the relationship between semantic concepts (objects, scenes) in visual domain. For each concept, a collection of images are obtained from Flickr, based on which the improved latent topic based visual language model is built to capture the visual characteristic of this concept. Then Flickr distance between different concepts is measured by the square root of Jensen-Shannon (JS) divergence between the corresponding visual language models. Comparing with WordNet, Flickr distance is able to handle far more concepts existing on the Web, and it can scale up with the increase of concept vocabularies. Comparing with Google distance, which is generated in textual domain, Flickr distance is more precise for visual domain concepts, as it captures the visual relationship between the concepts instead of their co-occurrence in text search results. Besides, unlike Google distance, Flickr distance satisfies triangular inequality, which makes it a more reasonable distance metric. Both subjective user study and objective evaluation show that Flickr distance is more coherent to human perception than Google distance. We also design several application scenarios, such as concept clustering and image annotation, to demonstrate the effectiveness of this proposed distance in image related applications.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
S. Borgatti. Netdraw. http://www.analytictech.com/Netdraw/netdraw.htm, 2008.
3
 
4
 
5
6
 
7
8
 
9
T. S. Huang, C. K. Dagli, S. Rajaram, E. Y. Chang, M. I. Mandel, G. E. Poliner, and D. P. W. Ellis. Active learning for interactive multimedia retrieval. In Proc. of the IEEE, 2008.
10
 
11
V. Lavrenko, R. Manmatha, and J. Jeon. A model for learning the semantics of pictures. In Proc. of NIPS'03., 2003.
12
13
14
15
 
16
G. A. Miller and et.al. Wordnet, a lexical database for the english language. Cognition Science Lab, Princeton University, 1995.
17
18
 
19
B. Wang, Z. Li, M. Li, and W.-Y. Ma. Large-scale duplicate detection for web image search. In Proc. of IEEE International Conference on Multimedia & Expo (ICME'06), 2006.
 
20
C. Wang, F. Jing, L. Zhang, and H.-J. Zhang. Content-based image annotation refinement. 2007.
21
 
22
L. Wu, J. Liu, M. Li, and N. Yu. Query oriented subspace shifting for near-duplicate image detection. In Proc. of IEEE International Conference on Multimedia & Expo (ICME'08), 2008.
 
23
J. Yu and Q. Tian. Semantic subspace projection and its application in image retrieval. IEEE Transactions on Circuits and Systems for Video Technology (CSVT), pages 544--548, 2008.


Collaborative Colleagues:
Lei Wu: colleagues
Xian-Sheng Hua: colleagues
Nenghai Yu: colleagues
Wei-Ying Ma: colleagues
Shipeng Li: colleagues