|
ABSTRACT
In this paper, we have proposed a novel framework to enable hierarchical image classification via statistical learning. By integrating the concept hierarchy for semantic image concept organization, a hierarchical mixture model is proposed to enable multi-level modeling of semantic image concepts and hierarchical classifier combination. Thus, learning the classifiers for the semantic image concepts at the high level of the concept hierarchy can be effectively achieved by detecting the presences of the relevant base-level atomic image concepts. To effectively learn the base-level classifiers for the atomic image concepts at the first level of the concept hierarchy, we have proposed a novel adaptive EM algorithm to achieve more effective model selection and parameter estimation. In addition, a novel penalty term is proposed to effectively eliminate the misleading effects of the outlying unlabeled images on semi-supervised classifier training. Our experimental results in a specific image domain of outdoor photos are very attractive.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
Osmar R. Zaïane , Jiawei Han , Ze-Nian Li , Sonny H. Chee , Jenny Y. Chiang, MultiMediaMiner: a system prototype for multimedia data mining, Proceedings of the 1998 ACM SIGMOD international conference on Management of data, p.581-583, June 01-04, 1998, Seattle, Washington, United States
|
 |
2
|
|
| |
3
|
C. Djeraba, "When image indexing meets knowledge discovery", ACM MDM/KDD, pp.73--81, 2000.
|
| |
4
|
|
 |
5
|
|
 |
6
|
|
| |
7
|
E. Chang, K. Goh, G. Sychay, G. Wu, "CBSA: Content-based annotation for multimodal image retrieval using Bayes point machines", IEEE Trans. CSVT, 2002.
|
| |
8
|
|
 |
9
|
|
| |
10
|
|
| |
11
|
Y. Wu, A. Zhang, "Adaptive pattern discovery for interactive multimedia retrieval", IEEE CVPR, 2003.
|
 |
12
|
|
| |
13
|
|
 |
14
|
Jia-Yu Pan , Hyung-Jeong Yang , Christos Faloutsos , Pinar Duygulu, Automatic multimedia cross-modal correlation discovery, Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining, August 22-25, 2004, Seattle, WA, USA
[doi> 10.1145/1014052.1014135]
|
 |
15
|
|
 |
16
|
Y. Alp Aslandogan , Chuck Thier , Clement T. Yu , Jon Zou , Naphtali Rishe, Using semantic contents and WordNet in image retrieval, Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval, p.286-295, July 27-31, 1997, Philadelphia, Pennsylvania, United States
|
| |
17
|
G. McLachlan and T. Krishnan, The EM algorithm and extensions, New York, John Wiley & Sons, 2000.
|
| |
18
|
|
| |
19
|
|
| |
20
|
|
| |
21
|
|
| |
22
|
|
| |
23
|
|
| |
24
|
Soumen Chakrabarti , Byron Dom , Rakesh Agrawal , Prabhakar Raghavan, Using Taxonomy, Discriminants, and Signatures for Navigating in Text Databases, Proceedings of the 23rd International Conference on Very Large Data Bases, p.446-455, August 25-29, 1997
|
| |
25
|
C. Fellbaum, WordNet: An electronic lexical database, MIT Press, 1998.
|
 |
26
|
|
| |
27
|
C.A. Bouman, M. Shapiro, G. Cook, C. Atkins, H. Cheng, "Cluster: An unsupervised algorithm for modeling Gaussian mixtures", Technical Report, Purdue University.
|
| |
28
|
|
| |
29
|
F. Cozman, I. Cohen, "Unlabeled data can degrade classification performance of generative classifier", TR-HPL-2001-234, 2001.
|
| |
30
|
M.R. Naphade, X. Zhou, and T.S. Huang, "Image classification using a set of labeled and unlabeled images", Proc. SPIE, 2000.
|
| |
31
|
|
| |
32
|
M. Szummer and T. Jaakkola, "Information Regularization with Partially Labeled Data", Proc. NIPS, 2002.
|
 |
33
|
Kristin P. Bennett , Usama Fayyad , Dan Geiger, Density-based indexing for approximate nearest-neighbor queries, Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining, p.233-243, August 15-18, 1999, San Diego, California, United States
[doi> 10.1145/312129.312236]
|
|