ACM Home Page
Please provide us with feedback. Feedback
Semantic manifold learning for image retrieval
Full text PdfPdf (2.12 MB)
Source International Multimedia Conference archive
Proceedings of the 13th annual ACM international conference on Multimedia table of contents
Hilton, Singapore
SESSION: Best student papers table of contents
Pages: 249 - 258  
Year of Publication: 2005
ISBN:1-59593-044-2
Authors
Yen-Yu Lin  Academia Sinica, Taipei, Taiwan and National Taiwan University, Taipei, Taiwan
Tyng-Luh Liu  Academia Sinica, Taipei, Taiwan
Hwann-Tzong Chen  Academia Sinica, Taipei, Taiwan and National Taiwan University, Taipei, Taiwan
Sponsors
ACM: Association for Computing Machinery
SIGGRAPH: ACM Special Interest Group on Computer Graphics and Interactive Techniques
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 17,   Downloads (12 Months): 110,   Citation Count: 9
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1101149.1101193
What is a DOI?

ABSTRACT

Learning the user's semantics for CBIR involves two different sources of information: the similarity relations entailed by the content-based features, and the relevance relations specified in the feedback. Given that, we propose an augmented relation embedding (ARE) to map the image space into a semantic manifold that faithfully grasps the user's preferences. Besides ARE, we also look into the issues of selecting a good feature set for improving the retrieval performance. With these two aspects of efforts we have established a system that yields far better results than those previously reported. Overall, our approach can be characterized by three key properties: 1) The framework uses one relational graph to describe the similarity relations, and the other two to encode the relevant/irrelevant relations indicated in the feedback. 2) With the relational graphs so defined, learning a semantic manifold can be transformed into solving a constrained optimization problem, and is reduced to the ARE algorithm accounting for both the representation and the classification points of views. 3) An image representation based on augmented features is introduced to couple with the ARE learning. The use of these features is significant in capturing the semantics concerning different scales of image regions. We conclude with experimental results and comparisons to demonstrate the effectiveness of our method.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
M. Belkin and P. Niyogi. Laplacian eigenmaps and spectral techniques for embedding and clustering. In Neural Information Processing Systems, 2001.
 
3
Y. Bengio, J.-F. Paiement, P. Vincent, O. Delalleau, N. Roux, and M. Ouimet. Out-of-sample extensions for lle, isomap, mds, eigenmaps, and spectral clustering. In Neural Information Processing Systems, 2003.
 
4
 
5
 
6
 
7
T. Cox and M. Cox. Multidimentional Scaling. Chapman & Hall, London, 1994.
 
8
9
10
 
11
X. He and P. Niyogi. Locality preserving projections. In Neural Information Processing Systems, 2003.
12
 
13
14
 
15
 
16
 
17
K. Mikolajczyk and C. Schmid. A performance evaluation of local descriptors. In Int'l Conference on Computer Vision and Pattern Recognition, pages 275--263, 2003.
 
18
A. Opelt, M. Fussenegger, A. Pinz, and P. Auer. Weak hypotheses and boosting for generic object detection and recognition. In Euro. Conference on Computer Vision, pages 71--84, 2004.
19
20
 
21
S. Roweis and L. Saul. Nonlinear dimensionality reduction by locally linear embedding. Science, 290:2323--2326, 2000.
 
22
 
23
Y. Rui and T. Huang. Optimizing learning in image retrieval. In Int'l Conference on Computer Vision and Pattern Recognition, pages 236--243, 2000.
 
24
Y. Rui, T. Huang, and S. Mehrotra. Content-based image retrieval with relevance feedback in mars. In Int'l Conference on Image Processing, pages 815--818, 1997.
 
25
 
26
 
27
J. Tenenbaum, V. de Silva, and J. Langford. A global geometric framework for nonlinear dimensionality reduction. Science, 290:2319--2323, 2000.
 
28
K. Tieu and P. Viola. Boosting image retrieval. In Int'l Conference on Computer Vision and Pattern Recognition, pages 1228--1235, 2000.
29
 
30
V. Vapnik. Statistical Learning Theory. Wiley, 1998.
 
31

CITED BY  9

Collaborative Colleagues:
Yen-Yu Lin: colleagues
Tyng-Luh Liu: colleagues
Hwann-Tzong Chen: colleagues