ACM Home Page
Please provide us with feedback. Feedback
Scalable Markov model-based image annotation
Full text PdfPdf (343 KB)
Source
Conference On Image And Video Retrieval archive
Proceedings of the 2008 international conference on Content-based image and video retrieval table of contents
Niagara Falls, Canada
SESSION: Subspace learning in content-based image retrieval table of contents
Pages 113-118  
Year of Publication: 2008
ISBN:978-1-60558-070-8
Authors
Changhu Wang  University of Science and Technology of China, Hefei, China
Lei Zhang  Microsoft Research Asia, Beijing, China
Hong-Jiang Zhang  Microsoft Advanced Technology Center, Beijing, China
Sponsors
SIGIR: ACM Special Interest Group on Information Retrieval
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 11,   Downloads (12 Months): 108,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1386352.1386371
What is a DOI?

ABSTRACT

In this paper, we propose a novel Markov model-based formulation for the image annotation problem. In this formulation, we treat image annotation as a graph ranking problem, by defining all possible labels in the lexicon as the states of a Markov chain. To fully utilize the correlation between labels, a query-biased transition matrix is dynamically constructed according to the query image. Based on this formulation, a scalable Markov model-based image annotation (MBIA) algorithm is presented to rank all the possible labels. To be scalable, we adapt search techniques on a Web-scale image set, and make MBIA capable of annotating arbitrary images with unlimited vocabulary. By fully exploring the correlation between labels, MBIA leads to superior performance than standard techniques. Experimental results on the typical Corel dataset and U. Washington dataset show the effectiveness and efficiency of the proposed algorithm.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
Chang, E., Kingshy, G., Sychay, G., and Wu, G. CBSA: content-based soft annotation for multimodal image retrieval using Bayes point machines. IEEE Trans. on CSVT, 13(1):26--38, Jan. 2003.
 
3
Cusano, C., G. Ciocca, and R. Schettini. Image Annotation Using Svm. in Proceedings of Internet imaging IV, Vol. SPIE 5304. 2004.
 
4
5
6
 
7
Lavrenko, V., Manmatha, R., and Jeon, J. "A Model for Learning the Semantics of Pictures," Proc. Advances in Neutral Information Processing Systems, 2003.
 
8
9
 
10
 
11
Y. Mori, H. Takahashi, and R. Oka. Image-to-word transformation based on dividing and vector quantizing images with words. In First International Workshop on Multimedia Intelligent Storage and Retrieval Management, 1999.
12
13
 
14
Wang, C., Jing, F., Zhang, L., and Zhang, H. 2007. Content-based Image Annotation Refinement. In Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR. IEEE Computer Society, Minneapolis, Minnesota, 2007, 1--8.
15

Collaborative Colleagues:
Changhu Wang: colleagues
Lei Zhang: colleagues
Hong-Jiang Zhang: colleagues