ACM Home Page
Please provide us with feedback. Feedback
Multimodal concept-dependent active learning for image retrieval
Full text PdfPdf (1.20 MB)
Source International Multimedia Conference archive
Proceedings of the 12th annual ACM international conference on Multimedia table of contents
New York, NY, USA
SESSION: Technical session 6: learning in multi-modal data table of contents
Pages: 564 - 571  
Year of Publication: 2004
ISBN:1-58113-893-8
Authors
King-Shy Goh  ECE at University of California and VIMA Technologies, Santa Barbara, CA
Edward Y. Chang  ECE at University of California and VIMA Technologies, Santa Barbara, CA
Wei-Cheng Lai  ECE at University of California and VIMA Technologies, Santa Barbara, CA
Sponsors
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 8,   Downloads (12 Months): 60,   Citation Count: 14
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1027527.1027664
What is a DOI?

ABSTRACT

It has been established that active learning is effective for learning complex, subjective query concepts for image retrieval. However, active learning has been applied in a concept independent way, (i.e., the kernel-parameters and the sampling strategy are identically chosen) for learning query concepts of differing <i>complexity</i>. In this work, we first characterize a concept's complexity using three measures: <i>hit-rate</i>, <i>isolation</i> and <i>diversity</i>. We then propose a multimodal learning approach that uses images' semantic labels to guide a <i>concept-dependent</i>, <i>active-learning</i> process. Based on the complexity of a concept, we make intelligent adjustments to the sampling strategy and the sampling pool from which images are to be selected and labeled, to improve concept learnability. Our empirical study on a $300$K-image dataset shows that concept-dependent learning is highly effective for image-retrieval accuracy.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
K. Barnard, P. Duygulu, and D. Forsyth. Exploiting text and image feature co-occurrence statistics in large datasets. Trends and Advances in Content-Based Image and Video Retrieval (To Appear), 2004.
 
3
K. Barnard and D. Forsyth. Learning the semantics of words and pictures. In International Conference on Computer Vision, volume~2, pages 408--415, 2000.
 
4
A. B. Benitez and S.-F. Chang. Image classification using multimedia knowledge networks. Proc. of the Int. Conf. on Image Processing, September 2003.
 
5
A. B. Benitez, J. R. Smith, and S.-F. Chang. Medianet: A multimedia information network for knowledge representation. Proceeding of the SPIE Conference on Internet Multimedia Management Systems, November 2000.
 
6
K. Brinker. Incorporating diversity in active learning with support vector machines. Proceedings of the Twentieth International Conference on Machine Learning, pages 59--66, August 2003.
7
8
 
9
 
10
11
 
12
 
13
14
 
15
Y. Mori, H. Takahashi, and R. Oka. Automatic words assignment to images based on image division and vector quantization. In Proc. of RIAO 2000: Content-Based Multimedia Information Access, Apr. 2000.
16
 
17
T. Westerveld. Image retrieval: Content versus context. Content-Based Multimedia Information Access, RIAO, pages 276--284, 2000.
 
18
 
19
20

CITED BY  15

Collaborative Colleagues:
King-Shy Goh: colleagues
Edward Y. Chang: colleagues
Wei-Cheng Lai: colleagues