|
ABSTRACT
Automatic keyword annotation is a promising solution to enable more effective image search by using keywords. In this paper, we propose a novel automatic image annotation method based on manifold ranking learning, in which the visual and textual information are well integrated. Due to complex and unbalanced data distribution and limited prior information in practice, we design two new schemes to make manifold ranking efficient for image annotation. Firstly, we design a new scheme named the Nearest Spanning Chain (NSC) to generate an adaptive similarity graph, which is robust across data distribution and easy to implement. Secondly, the word-to-word correlations obtained from WordNet and the pairwise co-occurrence are taken into consideration to expand the annotations and prune irrelevant annotations for each image. Experiments conducted on standard Corel dataset and web image dataset demonstrate the effectiveness and efficiency of the proposed method for image annotation.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Budanitsky, A. and Hirst, G. Semantic distance in WordNet: An experimental, application-oriented evaluation of five measures. In Workshop on WordNet and Other Lexical Resources, 2 nd of the North American Chapter of the ACL, Pittsburgh, 2001.
|
| |
2
|
Claudio, C., Gianluigi, C., Raimondo, S. Image annotation using SVM. In Proceeding Of Internet imaging IV, Vol. SPIE, 2004.
|
| |
3
|
Cai, D., Yu, S., Wen, J.R. and Ma, W.Y. VIPS: a vision-based page segmentation algorithm. Microsoft Technical Report (MSR-TR-2003-79), 2003.
|
| |
4
|
Edward Chang, Kingshy Goh, Gerard Sychay, Gang Wu. CBSA: content-base soft annotation for multimodal image retrieval using bayes point machines. CirSysVideo, pp. 26--38, 13(1), 2003.
|
 |
5
|
|
 |
6
|
|
 |
7
|
Jingrui He , Mingjing Li , Hong-Jiang Zhang , Hanghang Tong , Changshui Zhang, Manifold-ranking based image retrieval, Proceedings of the 12th annual ACM international conference on Multimedia, October 10-16, 2004, New York, NY, USA
[doi> 10.1145/1027527.1027531]
|
| |
8
|
|
 |
9
|
|
 |
10
|
|
 |
11
|
|
 |
12
|
|
| |
13
|
Jiang, J. and Conrath, D. Semantic similarity based on corpus statistics and lexical taxonomy. In Proceedings on International Conference on Research in Computational Linguistics, 1997.
|
| |
14
|
|
| |
15
|
|
| |
16
|
Pucher, M. Performance Evaluation of WordNet-based Semantic Relatedness Measures for Word Prediction in Conversational Speech. In Sixth International Workshop on Computational Semantics, Tilburg, Netherlands, 2005.
|
| |
17
|
|
 |
18
|
Jia-Yu Pan , Hyung-Jeong Yang , Christos Faloutsos , Pinar Duygulu, Automatic multimedia cross-modal correlation discovery, Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining, August 22-25, 2004, Seattle, WA, USA
[doi> 10.1145/1014052.1014135]
|
| |
19
|
R. Manmatha, V. Lavrenko, and J. Jeon, A Model for Learning the Semantics of Pictures. In Proc. of the 17th Annual Conf. on Neural Information Processing Systems, 2003.
|
| |
20
|
S. L. Feng, R. Manmatha and V. Lavrenko. Multiple Bernouli Relevance Models for Image and Video Annotation. In Proc. Of CVPR, Washington, DC, June, 2004.
|
| |
21
|
|
 |
22
|
Hanghang Tong , Jingrui He , Mingjing Li , Changshui Zhang , Wei-Ying Ma, Graph based multi-modality learning, Proceedings of the 13th annual ACM international conference on Multimedia, November 06-11, 2005, Hilton, Singapore
[doi> 10.1145/1101149.1101337]
|
| |
23
|
Wojciech, M., Hanspeter, P., Matt, B. A data-driven reflectance model. In Proc. of SIGGRAPH, 2003.
|
| |
24
|
Zhou, D., Bousquet, O., Lal, T.N., Weston, J., and Schölkopf, B. Ranking on Data Manifolds. 18th Annual Conf. on Neural Information Processing System, pp. 169--176, 2003.
|
| |
25
|
Zhou, D., J. Huang and B. Schölkopf. Learning with local and global consistency. 18 th Annual Conference on Neural Information Processing Systems, 2003.
|
 |
26
|
|
CITED BY 7
|
|
|
|
|
Xiangdong Zhou , Mei Wang , Qi Zhang , Junqi Zhang , Baile Shi, Automatic image annotation by an iterative approach: incorporating keyword correlations and region matching, Proceedings of the 6th ACM international conference on Image and video retrieval, p.25-32, July 09-11, 2007, Amsterdam, The Netherlands
|
|
|
|
|
|
|
|
|
|
|
|
Xianming Liu , Rongrong Ji , Hongxun Yao , Pengfei Xu , Xiaoshuai Sun , Tianqiang Liu, Cross-media manifold learning for image retrieval & annotation, Proceeding of the 1st ACM international conference on Multimedia information retrieval, October 30-31, 2008, Vancouver, British Columbia, Canada
|
|
|
Jing Liu , Mingjing Li , Qingshan Liu , Hanqing Lu , Songde Ma, Image annotation via graph learning, Pattern Recognition, v.42 n.2, p.218-228, February, 2009
|
|