| A semi-naïve Bayesian method incorporating clustering with pair-wise constraints for auto image annotation |
| Full text |
Pdf
(259 KB)
|
| Source
|
International Multimedia Conference
archive
Proceedings of the 12th annual ACM international conference on Multimedia
table of contents
New York, NY, USA
POSTER SESSION: Technical poster session 1: multimedia analysis, processing, and retrieval
table of contents
Pages: 336 - 339
Year of Publication: 2004
ISBN:1-58113-893-8
|
|
Authors
|
|
Wanjun Jin
|
National University of Singapore and Fudan University, China
|
|
Rui Shi
|
National University of Singapore
|
|
Tat-Seng Chua
|
National University of Singapore
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 2, Downloads (12 Months): 45, Citation Count: 2
|
|
|
ABSTRACT
We propose a novel approach for auto image annotation. In our approach, we first perform the segmentation of images into regions, followed by clustering of regions, before learning the relationship between concepts and region clusters using the set of training images with pre-assigned concepts. The main focus of this paper is two-fold. First, in the learning stage, we perform clustering of regions into region clusters by incorporating pair-wise constraints which are derived by considering the language model underlying the annotations assigned to training images. Second, in the annotation stage, we employ a semi-naïve Bayes model to compute the posterior probability of concepts given the region clusters. Experiment results show that our proposed system utilizing these two strategies outperforms the state-of-the-art techniques in annotating large image collection.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
 |
2
|
Mikhail Bilenko , Sugato Basu , Raymond J. Mooney, Integrating constraints and metric learning in semi-supervised clustering, Proceedings of the twenty-first international conference on Machine learning, p.11, July 04-08, 2004, Banff, Alberta, Canada
[doi> 10.1145/1015330.1015360]
|
 |
3
|
|
| |
4
|
|
 |
5
|
|
| |
6
|
|
| |
7
|
Lavrenko, V., Manmatha, R. & Jeon, J.: A model for learning the semantics of pictures. Neural Information Processing System (NIPS), 2003.
|
| |
8
|
|
| |
9
|
|
| |
10
|
Mori, Y., Takahashi, H. & Oka, R.: Image-to-word transformation based on dividing and vector quantizing images with words. First Int'l Workshop on multimedia Intelligent Storage & Retrieval Management, 1999.
|
| |
11
|
Platt, J.C: "Probabilities for SV machines," In A.Smola, P. Bartlett, B. Scholkopf, and D. Schuurmans,editors, Advances in Large Margin Classifiers, pages 61--74, Cambridge, MA, 1999, MIT Press.
|
| |
12
|
Shi, R., Feng, H.M., Chua, T.-S. & Lee, C.-H., An adaptive image content representation and segmentation approach to automatic image annotation. Int'l Conf. on Image and Video Retrieval, July 21-23, 2004
|
| |
13
|
|
| |
14
|
|
|