| Dual cross-media relevance model for image annotation |
| Full text |
Pdf
(1.44 MB)
|
Source
|
International Multimedia Conference
archive
Proceedings of the 15th international conference on Multimedia
table of contents
Augsburg, Germany
SESSION: Content 4 - image annotation
table of contents
Pages: 605 - 614
Year of Publication: 2007
ISBN:978-1-59593-702-5
|
|
Authors
|
|
Jing Liu
|
Chinese Academy of Sciences, Beijing, China
|
|
Bin Wang
|
University of Science and Technology of China, Hefei, China
|
|
Mingjing Li
|
Microsoft Research Asia, Beijing, China
|
|
Zhiwei Li
|
Microsoft Research Asia, Beijing, China
|
|
Weiying Ma
|
Microsoft Research Asia, Beijing, China
|
|
Hanqing Lu
|
Chinese Academy of Sciences, Beijing, China
|
|
Songde Ma
|
Chinese Academy of Sciences, Beijing, China
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 10, Downloads (12 Months): 135, Citation Count: 5
|
|
|
ABSTRACT
Image annotation has been an active research topic in recent years due to its potential impact on both image understanding and web image retrieval. Existing relevance-model-based methods perform image annotation by maximizing the joint probability of images and words, which is calculated by the expectation over training images. However, the semantic gap and the dependence on training data restrict their performance and scalability. In this paper, a dual cross-media relevance model (DCMRM) is proposed for automatic image annotation, which estimates the joint probability by the expectation over words in a pre-defined lexicon. DCMRM involves two kinds of critical relations in image annotation. One is the word-to-image relation and the other is the word-to-word relation. Both relations can be estimated by using search techniques on the web data as well as available training data. Experiments conducted on the Corel dataset and a web image dataset demonstrate the effectiveness of the proposed model.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Cilibrasi, R., Vitanyi, P. M. B. Automatic Extraction of Meaning from the Web, Proc. IEEE International Symp. Information Theory, USA, 2006.
|
| |
2
|
Chang, E., et al. CBSA: Content-based Soft Annotation for Multimodal Image Retrieval Using Bayes Point Machines. CirSysVideo, 2003. 13(1): p. 26--38.
|
| |
3
|
Cusano, C., G. Ciocca, and R. Schettini. Image Annotation Using SVM. Proceedings of Internet Imaging, Vol. SPIE 5304. 2004.
|
| |
4
|
|
| |
5
|
Feng S., Manmatha, R., and Laverenko V. Multiple Bernoulli Relevance Models for Image and Video Annotation. IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 1002--1009, 2004.
|
 |
6
|
Xin Fan , Xing Xie , Zhiwei Li , Mingjing Li , Wei-Ying Ma, Photo-to-search: using multimodal queries to search the web from mobile devices, Proceedings of the 7th ACM SIGMM international workshop on Multimedia information retrieval, November 10-11, 2005, Hilton, Singapore
[doi> 10.1145/1101826.1101851]
|
| |
7
|
Goldstone R. Similarity, Interactive Activation and Mapping. Journal Experimental Psychology: Learning, Memory, and Cognition, 20, 1994, 3--28
|
| |
8
|
|
 |
9
|
|
| |
10
|
Jiang, J., Conrath, D. Semantic Similarity based on Corpus Statistics and Lexical Taxonomy. Proceedings on International Conference on Research in Computational Linguistics, 1997.
|
 |
11
|
|
 |
12
|
|
| |
13
|
|
| |
14
|
Lavrenko, V., Manmatha, R., and Jeon, J. A Model for Learning the Semantics of Pictures. Proceedings of Advance in Neutral Information Processing, 2003.
|
| |
15
|
|
 |
16
|
|
| |
17
|
|
 |
18
|
Changhu Wang , Feng Jing , Lei Zhang , Hong-Jiang Zhang, Image annotation refinement using random walk with restarts, Proceedings of the 14th annual ACM international conference on Multimedia, October 23-27, 2006, Santa Barbara, CA, USA
[doi> 10.1145/1180639.1180774]
|
| |
19
|
|
| |
20
|
Yeh, T., Tollmar, K., Darrell, T. Searching the Web with Mobile Images for Location Recognition. International Conference on Computer Vision and Pattern Recognition, 2004, pp. 76--81.
|
 |
21
|
|
CITED BY 5
|
|
|
|
|
|
|
|
Lei Wu , Xian-Sheng Hua , Nenghai Yu , Wei-Ying Ma , Shipeng Li, Flickr distance, Proceeding of the 16th ACM international conference on Multimedia, October 26-31, 2008, Vancouver, British Columbia, Canada
|
|
|
|
|
|
|
|