|
ABSTRACT
Automatic linguistic indexing of pictures is an important but highly challenging problem for researchers in computer vision and content-based image retrieval. In this paper, we introduce a statistical modeling approach to this problem. Categorized images are used to train a dictionary of hundreds of concepts automatically based on statistical modeling. Images of any given concept category are regarded as instances of a stochastic process that characterizes the category. To measure the extent of association between an image and the textual description of a category of images, the likelihood of the occurrence of the image based on the stochastic process derived from the category is computed. A high likelihood indicates a strong association. In our experimental implementation, the ALIP (Automatic Linguistic Indexing of Pictures) system, we focus on a particular group of stochastic processes for describing images, that is, the two-dimensional multiresolution hidden Markov models (2-D MHMMs). We implemented and tested the system on a photographic image database of 600 different semantic cat- egories, each with about 40 training images. Tested using 3,000 images outside the training database, the system has demonstrated good accuracy and high potential in linguistic indexing of these test images.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
K. Barnard, D. Forsyth, "Learning the Semantics of Words and Pictures," Proc. ICCV, vol 2, pp. 408--415, 2001.
|
| |
2
|
|
| |
3
|
L. Breiman, J. Friedman, C.J. Stone, R.A. Olshen, Classification and Regression Trees, Wadsworth Int. Group, Belmont, Calif., 1984.
|
| |
4
|
|
| |
5
|
R. Chellappa and A. K. Jain, Markov Random Fields: Theory and Applications, Academic Press, 1993.
|
| |
6
|
|
| |
7
|
|
| |
8
|
|
| |
9
|
|
| |
10
|
R. L. Dobrushin, "The description of a random field by means of conditional probabilities and conditions of its regularity," Theory Prob. Appl., vol. 13, pp. 197--224, 1968.
|
| |
11
|
Myron Flickner , Harpreet Sawhney , Wayne Niblack , Jonathan Ashley , Qian Huang , Byron Dom , Monika Gorkani , Jim Hafner , Denis Lee , Dragutin Petkovic , David Steele , Peter Yanker, Query by Image and Video Content: The QBIC System, Computer, v.28 n.9, p.23-32, September 1995
[doi> 10.1109/2.410146]
|
| |
12
|
|
| |
13
|
S. Geman and D. Geman, "Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. PAMI-6, no. 6, pp. 721--741, Nov. 1984.
|
| |
14
|
R. Kindermann and L. Snell, Markov Random Fields and Their Applications, American Mathematical Society, 1980.
|
| |
15
|
J. Li, R. M. Gray, R. A. Olshen, "Multiresolution image classification by hierarchical modeling with two dimensional hidden Markov models," IEEE Trans. on Information Theory, vol. 46, no. 5, pp. 1826--41, August 2000.
|
| |
16
|
|
| |
17
|
J. Li, A. Najmi, R. M. Gray, "Image classification by a two dimensional hidden Markov model," IEEE Trans. on Signal Processing, vol. 48, no. 2, pp. 517--33, February 2000.
|
| |
18
|
|
 |
19
|
|
| |
20
|
A. Pentland, R. W. Picard, S. Sclaroff, "Photobook: Tools for content-based manipulation of image databases," Proc. of SPIE, vol. 2185, pp. 34--47, San Jose, February 7--8, 1994.
|
| |
21
|
|
| |
22
|
J. R. Smith, S.-F. Chang, "An image and video search engine for the World-Wide Web," Proc. of SPIE, vol. 3022, pp. 84--95, 1997.
|
| |
23
|
M. Unser, "Texture classification and segmentation using wavelet frames," IEEE Trans. Image Processing, vol. 4, no. 11, pp. 1549--1560, Nov. 1995.
|
| |
24
|
|
| |
25
|
|
| |
26
|
|
| |
27
|
|
CITED BY 11
|
|
|
|
|
|
|
|
Yuhang Wang , Fillia Makedon , James Ford , Li Shen , Dina Goldin, Generating fuzzy semantic metadata describing spatial relations from images using the R-histogram, Proceedings of the 4th ACM/IEEE-CS joint conference on Digital libraries, June 07-11, 2004, Tuscon, AZ, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|