ACM Home Page
Please provide us with feedback. Feedback
Exploiting ontologies for automatic image annotation
Full text PdfPdf (147 KB)
Source Annual ACM Conference on Research and Development in Information Retrieval archive
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval table of contents
Salvador, Brazil
SESSION: Video and image table of contents
Pages: 552 - 558  
Year of Publication: 2005
ISBN:1-59593-034-5
Authors
Munirathnam Srikanth  Language Computer Corporation, Richardson, TX
Joshua Varner  Language Computer Corporation, Richardson, TX
Mitchell Bowden  Language Computer Corporation, Richardson, TX
Dan Moldovan  Language Computer Corporation, Richardson, TX
Sponsor
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 16,   Downloads (12 Months): 163,   Citation Count: 8
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1076034.1076128
What is a DOI?

ABSTRACT

Automatic image annotation is the task of automatically assigning words to an image that describe the content of the image. Machine learning approaches have been explored to model the association between words and images from an annotated set of images and generate annotations for a test image. The paper proposes methods to use a hierarchy defined on the annotation words derived from a text ontology to improve automatic image annotation and retrieval. Specifically, the hierarchy is used in the context of generating a visual vocabulary for representing images and as a framework for the proposed hierarchical classification approach for automatic image annotation. The effect of using the hierarchy in generating the visual vocabulary is demonstrated by improvements in the annotation performance of translation models. In addition to performance improvements, hierarchical classification approaches yield well to constructing multimedia ontologies.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
K. Barnard, P. Duygulu, and D. A. Forsyth. Modeling the Statistics of Image Features and Associated Text. In Document Recognition and Retrieval IX - Electronic Imaging, 2002.
 
2
K. Barnard and D. A. Forsyth. Learning the Semantics of Words and Pictures. In Proceedings of International Conference on Computer Vision, pages 408--415, 2001.
3
 
4
 
5
C. Cusano, G. Ciocca, and R. Schettini. Image Annotation using SVM. In Proceedings of Internet Imaging IV, 2004.
 
6
A. P. Dempster, N. M. Laird, and D. B. Rubin. Maximum Likelihood from Incomplete Data via the EM algorithm. Journal of the Royal Statistical Society, Series B, 39:1--38, 1977.
 
7
 
8
S. L. Feng, R. Manmatha, and V. Lavrenko. Multiple Bernoulli Relevance Models for Image and Video Annotation. In Proceedings of CVPR'04, 2004.
 
9
10
11
12
 
13
V. Lavrenko, R. Manmatha, and J. Jeon. A Model for Learning the Semantics of Pictures. In Proceedings of NIPS'03, 2004.
 
14
 
15
 
16
 
17
Y. Mori, H. Takahashi, and R. Oka. Image-to-Word Transformation based on Dividing and Vector Quantizing Images with Words. In First International Workshop on Multimedia Intelligent Storage and Retrieval Management, 1999.
 
18
19
 
20
Text retrieval conference. http://trec.nist.gov.
 
21
Trec video data set. http://www-nlpir.nist.gov/projects/trecvid.

CITED BY  8

Collaborative Colleagues:
Munirathnam Srikanth: colleagues
Joshua Varner: colleagues
Mitchell Bowden: colleagues
Dan Moldovan: colleagues