ACM Home Page
Please provide us with feedback. Feedback
Semi-supervised topic modeling for image annotation
Full text PdfPdf (867 KB)
Source
International Multimedia Conference archive
Proceedings of the seventeen ACM international conference on Multimedia table of contents
Beijing, China
SESSION: Short papers session 1: content analysis table of contents
Pages 521-524  
Year of Publication: 2009
ISBN:978-1-60558-608-3
Authors
Yuanlong Shao  State Key Laboratory of CAD&CG, Zhejiang University, Hangzhou, Zhejiang, China
Yuan Zhou  State Key Laboratory of CAD&CG, Zhejiang University, Hangzhou, Zhejiang, China
Xiaofei He  State Key Laboratory of CAD&CG, Zhejiang University, Hangzhou, Zhejiang, China
Deng Cai  State Key Laboratory of CAD&CG, Zhejiang University, Hangzhou, Zhejiang, China
Hujun Bao  State Key Laboratory of CAD&CG, Zhejiang University, Hangzhou, Zhejiang, China
Sponsor
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 26,   Downloads (12 Months): 26,   Citation Count: 0
Additional Information:

abstract   references   index terms  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1631272.1631346
What is a DOI?

ABSTRACT

We propose a novel technique for semi-supervised image annotation which introduces a harmonic regularizer based on the graph Laplacian of the data into the probabilistic semantic model for learning latent topics of the images. By using a probabilistic semantic model, we connect visual features and textual annotations of images by their latent topics. Meanwhile, we incorporate the manifold assumption into the model to say that the probabilities of latent topics of images are drawn from a manifold, so that for images sharing similar visual features or the same annotations, their probability distribution of latent topics should also be similar. We create a nearest neighbor graph to model the manifold and propose a regularized EM algorithm to simultaneously learn a generative model and assign probability density of latent topics to images discriminatively. In this way, databases with very few labeled images can be annotated better than previous works.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
K. Barnard, P. Duygulu, D. Forsyth, N. de Freitas, D. M. Blei, and M. I. Jordan. Matching words and pictures. Journal of Machine Learning Research, 3:1107--1135, 2003.
 
2
M. Belkin, P. Niyogi, and V. Sindhwani. Manifold regularization: A geometric framework for learning from labeled and unlabeled examples. Journal of Machine Learning Research, 7:2399--2434, 2006.
 
3
D. M. Blei and M. I. Jordan. Modeling annotated data. In Proc. ACM Int. Conf. on Research and Development in Informaion Retrieval(ACM SIGIR), pages 127--134, 2003.
 
4
D. Cai, Q. Mei, J. Han, and C. Zhai. Modeling hidden topics on document manifold. In Proc. ACM Conf. on Information and knowledge management(CIKM'08), pages 911--920, 2008.
 
5
G. Csurka, C. R. Dance, L. Fan, J. Willamowski, and C. Bray. Visual categorization with bags of keypoints. In Workshop on Statistical Learning in Computer Vision, ECCV, pages 1--22, 2004.
 
6
X. He, D. Cai, Y. Shao, H. Bao, and J. Han. Laplacian regularized gaussian mixture model for data clustering. Preprint.
 
7
Q. Mei, D. Cai, D. Zhang, and C. Zhai. Topic modeling with network regularization. In Proc. ACM Int. Conf. on World Wide Web (WWW'08), pages 101--110, 2008.
 
8
F. Monay and D. Gatica-Perez. On image auto-annotation with latent space models. In Proc. ACM Int. Conf. on Multimedia (SIGMM'03), pages 275--278, 2003.
 
9
F. Monay and D. Gatica-Perez. Plsa-based image auto-annotation: constraining the latent space. In Proc. ACM Int. Conf. on Multimedia (SIGMM'04), pages 348--351, 2004.
 
10
R. M. Neal and G. E. Hinton. A view of the em algorithm that justifies incremental, sparse, and other variants. In Learning in graphical models, pages 355--368. 1999.
 
11
R. Zhang, Z. M. Zhang, M. Li, W.-Y. Ma, and H.-J. Zhang. A probabilistic semantic model for image annotation and multi-modal image retrieval. In Proc. IEEE Int. Conf. on Computer Vision (ICCV'05), pages 846--851, 2005.
 
12
X. Zhu, J. Lafferty, and Z. Ghahramani. Semi-supervised learning using gaussian fields and harmonic functions. In Proc. Int. Conf. Machine Learning(ICML'05), 2005.