| Enhancing image annotation by integrating concept ontology and text-based bayesian learning model |
| Full text |
Pdf
(413 KB)
|
Source
|
International Multimedia Conference
archive
Proceedings of the 15th international conference on Multimedia
table of contents
Augsburg, Germany
POSTER SESSION: Short papers poster session 1 - content analysis
table of contents
Pages: 341 - 344
Year of Publication: 2007
ISBN:978-1-59593-702-5
|
|
Authors
|
|
Rui Shi
|
National University of Singapore, Singapore
|
|
Chin-Hui Lee
|
Georgia Institute of Technology, Atlanta, GA
|
|
Tat-Seng Chua
|
National University of Singapore, Singapore
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 10, Downloads (12 Months): 89, Citation Count: 1
|
|
|
ABSTRACT
Automatic image annotation (AIA) has been a hot research topic in recent years since it can be used to support concept-based image retrieval. However, most existing AIA models depend heavily on the availability of a large number of labeled training samples, which require significant human labeling efforts. In this paper, we propose a novel learning framework which integrates text-based Bayesian model (TBM) and concept ontology to effectively expand the training set of each concept class without the need of additional human labeling efforts or collecting additional training images from other data sources. The basic idea lies in exploiting the text information from training set to provide additional effective annotations for training images so that training data for each concept class can be augmented. In this study we employ Bayesian Hierarchical Multinomial Mixture Models (BHMMMs) as our baseline AIA model. By combining additional annotations obtained from TBM into each concept class in the training phase, the performance of BHMMMs can be significantly improved on Corel image dataset with 263 testing concepts as compared to the state-of-the-art AIA models under the same experimental configurations.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
K. Barnard, P. Duygulu and D. Forsyth, "Clustering Art", In Proc. Of IEEE Computer Vision and Pattern Recognition, 2001.
|
| |
2
|
|
| |
3
|
|
 |
4
|
|
| |
5
|
S. L. Feng, R. Manmatha and V. Lavrenko, "Multiple Bernoulli Relevance Models for Image and Video Annotation", Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR'04.
|
| |
6
|
S. Gao, D.-H. Wang and C.-H. Lee, "Automatic Image Annotation through Multi-Topic Text Categorization", Proc. ICASSP, Toulouse, France, May 2006.
|
 |
7
|
|
| |
8
|
V. Lavrenko, R. Manmatha and J. Jeon, "A Model for Learning the Semantics of Pictures", NIPS, 2003.
|
| |
9
|
G. A. Miller, R. Beckwith, C. Fellbaum, D. Gross and K. J. Miller, "Introduction to WordNet: an on-line lexical database", Intl. Jour. Of Lexicography, pp. 235--244, 1990.
|
| |
10
|
J. Novovicova and A. Malik, "Application of Multinomial Mixture Model to Text Classification", Pattern Recognition and Image Analysis, pp. 646--653, 2003.
|
 |
11
|
|
| |
12
|
R. Shi, T. S. Chua, C. H. Lee and S. Gao, "Bayesian Learning of Hierarchical Multinomial Mixture Models of Concepts for Automatic Image Annotation", In Proc. of CIVR'06, pp. 102--112, Arizona, United States, 2006.
|
 |
13
|
|
| |
14
|
R. Yan, and A. G. Hauptmann, "Multi-class Active Learning for Video Semantic Feature Extraction", In Proc. of ICME'04, pp. 69--72, 2004.
|
 |
15
|
|
CITED BY
|
|
Dong Liu , Xian-Sheng Hua , Linjun Yang , Meng Wang , Hong-Jiang Zhang, Tag ranking, Proceedings of the 18th international conference on World wide web, April 20-24, 2009, Madrid, Spain
|
|