|
ABSTRACT
Automated annotation of digital pictures has been a highly challenging problem for computer scientists since the invention of computers. The capability of annotating pictures by computers can lead to breakthroughs in a wide range of applications including Web image search, online picture-sharing communities, and scientific experiments. In our work, by advancing statistical modeling and optimization techniques, we can train computers about hundreds of semantic concepts using example pictures from each concept. The ALIPR (Automatic Linguistic Indexing of Pictures -Real Time)system of fully automatic and high speed annotation for online pictures has been constructed. Thousands of pictures from an Internet photo-sharing site, unrelated to the source of those pictures used in the training process, have been tested. The experimental results show that a single computer processor can suggest annotation terms in real-time and with good accuracy.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
D. Beymer and T. Poggio, "Image representations for visual learning," Science vol. 272, pp. 1905--1909, 1996.
|
| |
3
|
P.J. Bickel and D.A. Freedman, "Some asymptotic theory for the bootstrap," Annals of Statistics vol. 9, pp. 1196--1217, 1981.
|
| |
4
|
S.-F. Chang, W .Chen, and H. Sundaram, "Semantic visual templates: Linking visual features to semantics," In Proc. Int. Conf. on Image Processing, vol. 3, pp. 531--535, Chicago, IL, 1998.
|
 |
5
|
|
| |
6
|
|
| |
7
|
M. Evans, N. Hastings, and B. Peacock, Statistical Distributions 3rd ed., John Wiley & Sons, Inc., 2000.
|
| |
8
|
|
| |
9
|
T. Hastie, R. Tibshirani, and J. Friedman, The Elements of Statistical Learning: Data Mining, Inferences, and Prediction Springer-Verlag, New York, 2001.
|
 |
10
|
Jingrui He , Hanghang Tong , Mingjing Li , Hong-Jiang Zhang , Changshui Zhang, Mean version space: a new active learning method for content-based image retrieval, Proceedings of the 6th ACM SIGMM international workshop on Multimedia information retrieval, October 15-16, 2004, New York, NY, USA
[doi> 10.1145/1026711.1026715]
|
 |
11
|
|
| |
12
|
E. Levina and P. Bickel, "The earth mover's distance is the Mallows distance: Some insights from statistics," In Proc. Int. Conf. on Computer Vision pp. 251--256, Vancouver, Canada, 2001.
|
| |
13
|
|
| |
14
|
C.L. Mallows, "A not on asymptotic joint normality," Annals of Mathematical Statistics vol. 43, no. 2, pp. 508--515, 1972.
|
 |
15
|
|
| |
16
|
Y. Rui, T.S. Huang, M. Ortega, and S. Mehrotra, "Relevance feedback: A power tool in interactive content-based image retrieval," IEEE Transactions on Circuits and Systems for Video Technology, vol. 8, no. 5, pp. 644--655, 1998.
|
| |
17
|
A. Singhal, J. Luo, and W. Zhu, "Probabilistic spatial context models for scene content understanding," In Proc. IEEE Int. Conf. on Computer Vision and Pattern Recognition 2003.
|
| |
18
|
|
 |
19
|
|
| |
20
|
|
| |
21
|
C. Tomasi, "Past performance and future results," Nature vol. 428, page 378, March 2004.
|
 |
22
|
|
| |
23
|
N. Vasconcelos and A. Lippman, "A multiresolution manifold distance for invariant image similarity," IEEE Transactions on Multimedia vol. 7, no. 1, pp. 127--142, 2005.
|
| |
24
|
|
 |
25
|
|
| |
26
|
C. Zhang and T. Chen, "An active learning framework for content-based information retrieval," IEEE Transactions on Multimedia vol. 4, no. 2, pp. 260--268, 2002.
|
CITED BY 19
|
|
|
|
|
Jun Yang , Yu-Gang Jiang , Alexander G. Hauptmann , Chong-Wah Ngo, Evaluating bag-of-visual-words representations in scene classification, Proceedings of the international workshop on Workshop on multimedia information retrieval, September 24-29, 2007, Augsburg, Bavaria, Germany
|
|
|
|
|
|
|
|
|
|
|
|
Ritendra Datta , Dhiraj Joshi , Jia Li , James Z. Wang, Image retrieval: Ideas, influences, and trends of the new age, ACM Computing Surveys (CSUR), v.40 n.2, p.1-60, April 2008
|
|
|
|
|
|
Xiangdong Zhou , Mei Wang , Qi Zhang , Junqi Zhang , Baile Shi, Automatic image annotation by an iterative approach: incorporating keyword correlations and region matching, Proceedings of the 6th ACM international conference on Image and video retrieval, p.25-32, July 09-11, 2007, Amsterdam, The Netherlands
|
|
|
|
|
|
Xiaoguang Rui , Mingjing Li , Zhiwei Li , Wei-Ying Ma , Nenghai Yu, Bipartite graph reinforcement model for web image annotation, Proceedings of the 15th international conference on Multimedia, September 25-29, 2007, Augsburg, Germany
|
|
|
|
|
|
|
|
|
Stanislav Barton , Vlastislav Dohnal , Jan Sedmidubsky , Pavel Zezula, Building self-organized image retrieval network, Proceeding of the 2008 ACM workshop on Large-Scale distributed systems for information retrieval, October 30-30, 2008, Napa Valley, California, USA
|
|
|
|
|
|
Klimis Ntalianis , Anastasios Doulamis , Nicolas Tsapatsoulis , Nikolaos Doulamis, Human action analysis, annotation and modeling in video streams based on implicit user interaction, Proceeding of the 1st ACM workshop on Analysis and retrieval of events/actions and workflows in video streams, October 31-31, 2008, Vancouver, British Columbia, Canada
|
|
|
|
|
|
|
|
|
|
|
|
Chien-Ju Ho , Tsung-Hsiang Chang , Jane Yung-Jen Hsu, PhotoSlap: a multi-player online game for semantic annotation, Proceedings of the 22nd national conference on Artificial intelligence, p.1359-1364, July 22-26, 2007, Vancouver, British Columbia, Canada
|
|