ACM Home Page
Please provide us with feedback. Feedback
Tagging over time: real-world image annotation by lightweight meta-learning
Full text PdfPdf (884 KB)
Source
International Multimedia Conference archive
Proceedings of the 15th international conference on Multimedia table of contents
Augsburg, Germany
SESSION: Content 3 - multimedia model learning table of contents
Pages: 393 - 402  
Year of Publication: 2007
ISBN:978-1-59593-702-5
Authors
Ritendra Datta  Pennsylvania State University, University Park, PA
Dhiraj Joshi  Pennsylvania State University, University Park, PA
Jia Li  Pennsylvania State University, University Park, PA
James Z. Wang  Pennsylvania State University, University Park, PA
Sponsors
ACM: Association for Computing Machinery
SIGGRAPH: ACM Special Interest Group on Computer Graphics and Interactive Techniques
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 9,   Downloads (12 Months): 102,   Citation Count: 3
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1291233.1291328
What is a DOI?

ABSTRACT

Automatic image annotation has been a hot-pursuit among multimedia researchers of late. Modest performance guarantees and limited adaptability often restrict its applicability to real-world settings. We propose tagging over time (T/T) to push the technology toward real-world applicability. Of particular interest are online systems that receive user-provided images and feedback over time, with user focus possibly changing and evolving. The T/T framework consists of a principled probabilistic approach to meta-learning, which acts as a go-between for a 'black-box' annotation system and the users. Inspired by inductive transfer, the approach attempts to harness available information, including the black-box model's performance, the image representations, and the WordNet ontology. Being computationally 'lightweight', this meta-learner efficiently re-trains over time, to improve and/or adapt to changes. The black-box annotation model is not required to be re-trained, allowing computationally intensive algorithms to be used. We experiment with standard image datasets and real-world data streams, using two existing annotation systems as black-boxes. Both batch and online annotation settings are experimented with. It is observed that the addition of this meta-learning layer produces much improved results that outperform best-known results. For the online setting, the T/T approach produces progressively better annotation with time, significantly outperforming the black-box as well as the static form of the meta-learner, on real-world data.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Alipr. http://www.alipr.com, 2006.
 
2
 
3
 
4
 
5
G. Cauwenberghs and T. Poggio. Incremental and decremental support vector machine learning. In Proc. NIPS, 2001.
 
6
E. Chang, G. Kingshy, G. Sychay, and G. Wu. CBSA: Content-based soft annotation for multimodal image retrieval using bayes point machines. IEEE Trans. on Circuits and Systems for Video Tech, 13:26--38, 2003.
 
7
8
 
9
S. L. Feng, R. Manmatha, and V. Lavrenko. Multiple bernoulli relevance models for image and video annotation. In Proc. IEEE CVPR, 2004.
 
10
Flickr. http://www.flickr.com, Yahoo!, 2005.
 
11
Y. Freund and R. E. Schapire. Experiments with a new boosting algorithm. In Proc. ICML, 1996.
12
13
14
 
15
C. Leacock and M. Chodorow. Combining local context and wordnet similarity for word sense identification. C. Fellbaum, Ed., WordNet: An Electronic Lexical Database, pages 265--283, 1998.
16
17
18
19
20
 
21
D. Silver, G. Bakir, K. Bennett, R. Caruana, M. Pontil, S. Russell, and P. Tadepalli. Inductive transfer: 10 years later. In Int. Workshop at NIPS, 2005.
 
22
23
24
 
25
26


Collaborative Colleagues:
Ritendra Datta: colleagues
Dhiraj Joshi: colleagues
Jia Li: colleagues
James Z. Wang: colleagues