| Exploiting ontologies for automatic image annotation |
| Full text |
Pdf
(147 KB)
|
| Source
|
Annual ACM Conference on Research and Development in Information Retrieval
archive
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
table of contents
Salvador, Brazil
SESSION: Video and image
table of contents
Pages: 552 - 558
Year of Publication: 2005
ISBN:1-59593-034-5
|
|
Authors
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 16, Downloads (12 Months): 163, Citation Count: 8
|
|
|
ABSTRACT
Automatic image annotation is the task of automatically assigning words to an image that describe the content of the image. Machine learning approaches have been explored to model the association between words and images from an annotated set of images and generate annotations for a test image. The paper proposes methods to use a hierarchy defined on the annotation words derived from a text ontology to improve automatic image annotation and retrieval. Specifically, the hierarchy is used in the context of generating a visual vocabulary for representing images and as a framework for the proposed hierarchical classification approach for automatic image annotation. The effect of using the hierarchy in generating the visual vocabulary is demonstrated by improvements in the annotation performance of translation models. In addition to performance improvements, hierarchical classification approaches yield well to constructing multimedia ontologies.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
K. Barnard, P. Duygulu, and D. A. Forsyth. Modeling the Statistics of Image Features and Associated Text. In Document Recognition and Retrieval IX - Electronic Imaging, 2002.
|
| |
2
|
K. Barnard and D. A. Forsyth. Learning the Semantics of Words and Pictures. In Proceedings of International Conference on Computer Vision, pages 408--415, 2001.
|
 |
3
|
|
| |
4
|
|
| |
5
|
C. Cusano, G. Ciocca, and R. Schettini. Image Annotation using SVM. In Proceedings of Internet Imaging IV, 2004.
|
| |
6
|
A. P. Dempster, N. M. Laird, and D. B. Rubin. Maximum Likelihood from Incomplete Data via the EM algorithm. Journal of the Royal Statistical Society, Series B, 39:1--38, 1977.
|
| |
7
|
|
| |
8
|
S. L. Feng, R. Manmatha, and V. Lavrenko. Multiple Bernoulli Relevance Models for Image and Video Annotation. In Proceedings of CVPR'04, 2004.
|
| |
9
|
|
 |
10
|
|
 |
11
|
|
 |
12
|
|
| |
13
|
V. Lavrenko, R. Manmatha, and J. Jeon. A Model for Learning the Semantics of Pictures. In Proceedings of NIPS'03, 2004.
|
| |
14
|
|
| |
15
|
|
| |
16
|
|
| |
17
|
Y. Mori, H. Takahashi, and R. Oka. Image-to-Word Transformation based on Dividing and Vector Quantizing Images with Words. In First International Workshop on Multimedia Intelligent Storage and Retrieval Management, 1999.
|
| |
18
|
|
 |
19
|
|
| |
20
|
Text retrieval conference. http://trec.nist.gov.
|
| |
21
|
Trec video data set. http://www-nlpir.nist.gov/projects/trecvid.
|
CITED BY 8
|
|
|
|
|
Gang Chen , Xiaoyan Li , Lidan Shou , Jinxiang Dong , Chun Chen, HISA: a query system bridging the semantic gap for large image databases, Proceedings of the 32nd international conference on Very large data bases, September 12-15, 2006, Seoul, Korea
|
|
|
|
|
|
|
|
|
|
|
|
Xiangdong Zhou , Mei Wang , Qi Zhang , Junqi Zhang , Baile Shi, Automatic image annotation by an iterative approach: incorporating keyword correlations and region matching, Proceedings of the 6th ACM international conference on Image and video retrieval, p.25-32, July 09-11, 2007, Amsterdam, The Netherlands
|
|
|
|
|
|
|
|