ACM Home Page
Please provide us with feedback. Feedback
Real-time computerized annotation of pictures
Full text PdfPdf (1.59 MB)
Source International Multimedia Conference archive
Proceedings of the 14th annual ACM international conference on Multimedia table of contents
Santa Barbara, CA, USA
SESSION: Content session 5: image annotation table of contents
Pages: 911 - 920  
Year of Publication: 2006
ISBN:1-59593-447-2
Authors
Jia Li  Pennsylvania State University
James Z. Wang  Pennsylvania State University
Sponsors
ACM: Association for Computing Machinery
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 14,   Downloads (12 Months): 82,   Citation Count: 19
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1180639.1180841
What is a DOI?

ABSTRACT

Automated annotation of digital pictures has been a highly challenging problem for computer scientists since the invention of computers. The capability of annotating pictures by computers can lead to breakthroughs in a wide range of applications including Web image search, online picture-sharing communities, and scientific experiments. In our work, by advancing statistical modeling and optimization techniques, we can train computers about hundreds of semantic concepts using example pictures from each concept. The ALIPR (Automatic Linguistic Indexing of Pictures -Real Time)system of fully automatic and high speed annotation for online pictures has been constructed. Thousands of pictures from an Internet photo-sharing site, unrelated to the source of those pictures used in the training process, have been tested. The experimental results show that a single computer processor can suggest annotation terms in real-time and with good accuracy.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
D. Beymer and T. Poggio, "Image representations for visual learning," Science vol. 272, pp. 1905--1909, 1996.
 
3
P.J. Bickel and D.A. Freedman, "Some asymptotic theory for the bootstrap," Annals of Statistics vol. 9, pp. 1196--1217, 1981.
 
4
S.-F. Chang, W .Chen, and H. Sundaram, "Semantic visual templates: Linking visual features to semantics," In Proc. Int. Conf. on Image Processing, vol. 3, pp. 531--535, Chicago, IL, 1998.
5
 
6
 
7
M. Evans, N. Hastings, and B. Peacock, Statistical Distributions 3rd ed., John Wiley & Sons, Inc., 2000.
 
8
 
9
T. Hastie, R. Tibshirani, and J. Friedman, The Elements of Statistical Learning: Data Mining, Inferences, and Prediction Springer-Verlag, New York, 2001.
10
11
 
12
E. Levina and P. Bickel, "The earth mover's distance is the Mallows distance: Some insights from statistics," In Proc. Int. Conf. on Computer Vision pp. 251--256, Vancouver, Canada, 2001.
 
13
 
14
C.L. Mallows, "A not on asymptotic joint normality," Annals of Mathematical Statistics vol. 43, no. 2, pp. 508--515, 1972.
15
 
16
Y. Rui, T.S. Huang, M. Ortega, and S. Mehrotra, "Relevance feedback: A power tool in interactive content-based image retrieval," IEEE Transactions on Circuits and Systems for Video Technology, vol. 8, no. 5, pp. 644--655, 1998.
 
17
A. Singhal, J. Luo, and W. Zhu, "Probabilistic spatial context models for scene content understanding," In Proc. IEEE Int. Conf. on Computer Vision and Pattern Recognition 2003.
 
18
19
 
20
 
21
C. Tomasi, "Past performance and future results," Nature vol. 428, page 378, March 2004.
22
 
23
N. Vasconcelos and A. Lippman, "A multiresolution manifold distance for invariant image similarity," IEEE Transactions on Multimedia vol. 7, no. 1, pp. 127--142, 2005.
 
24
25
 
26
C. Zhang and T. Chen, "An active learning framework for content-based information retrieval," IEEE Transactions on Multimedia vol. 4, no. 2, pp. 260--268, 2002.

CITED BY  19