|
ABSTRACT
Learning the user's semantics for CBIR involves two different sources of information: the similarity relations entailed by the content-based features, and the relevance relations specified in the feedback. Given that, we propose an augmented relation embedding (ARE) to map the image space into a semantic manifold that faithfully grasps the user's preferences. Besides ARE, we also look into the issues of selecting a good feature set for improving the retrieval performance. With these two aspects of efforts we have established a system that yields far better results than those previously reported. Overall, our approach can be characterized by three key properties: 1) The framework uses one relational graph to describe the similarity relations, and the other two to encode the relevant/irrelevant relations indicated in the feedback. 2) With the relational graphs so defined, learning a semantic manifold can be transformed into solving a constrained optimization problem, and is reduced to the ARE algorithm accounting for both the representation and the classification points of views. 3) An image representation based on augmented features is introduced to couple with the ARE learning. The use of these features is significant in capturing the semantics concerning different scales of image regions. We conclude with experimental results and comparisons to demonstrate the effectiveness of our method.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
M. Belkin and P. Niyogi. Laplacian eigenmaps and spectral techniques for embedding and clustering. In Neural Information Processing Systems, 2001.
|
| |
3
|
Y. Bengio, J.-F. Paiement, P. Vincent, O. Delalleau, N. Roux, and M. Ouimet. Out-of-sample extensions for lle, isomap, mds, eigenmaps, and spectral clustering. In Neural Information Processing Systems, 2003.
|
| |
4
|
|
| |
5
|
|
| |
6
|
|
| |
7
|
T. Cox and M. Cox. Multidimentional Scaling. Chapman & Hall, London, 1994.
|
| |
8
|
|
 |
9
|
|
 |
10
|
|
| |
11
|
X. He and P. Niyogi. Locality preserving projections. In Neural Information Processing Systems, 2003.
|
 |
12
|
|
| |
13
|
|
 |
14
|
|
| |
15
|
|
| |
16
|
|
| |
17
|
K. Mikolajczyk and C. Schmid. A performance evaluation of local descriptors. In Int'l Conference on Computer Vision and Pattern Recognition, pages 275--263, 2003.
|
| |
18
|
A. Opelt, M. Fussenegger, A. Pinz, and P. Auer. Weak hypotheses and boosting for generic object detection and recognition. In Euro. Conference on Computer Vision, pages 71--84, 2004.
|
 |
19
|
Greg Pass , Ramin Zabih , Justin Miller, Comparing images using color coherence vectors, Proceedings of the fourth ACM international conference on Multimedia, p.65-73, November 18-22, 1996, Boston, Massachusetts, United States
[doi> 10.1145/244130.244148]
|
 |
20
|
|
| |
21
|
S. Roweis and L. Saul. Nonlinear dimensionality reduction by locally linear embedding. Science, 290:2323--2326, 2000.
|
| |
22
|
|
| |
23
|
Y. Rui and T. Huang. Optimizing learning in image retrieval. In Int'l Conference on Computer Vision and Pattern Recognition, pages 236--243, 2000.
|
| |
24
|
Y. Rui, T. Huang, and S. Mehrotra. Content-based image retrieval with relevance feedback in mars. In Int'l Conference on Image Processing, pages 815--818, 1997.
|
| |
25
|
|
| |
26
|
|
| |
27
|
J. Tenenbaum, V. de Silva, and J. Langford. A global geometric framework for nonlinear dimensionality reduction. Science, 290:2319--2323, 2000.
|
| |
28
|
K. Tieu and P. Viola. Boosting image retrieval. In Int'l Conference on Computer Vision and Pattern Recognition, pages 1228--1235, 2000.
|
 |
29
|
|
| |
30
|
V. Vapnik. Statistical Learning Theory. Wiley, 1998.
|
| |
31
|
|
CITED BY 9
|
|
|
|
|
|
|
|
Ritendra Datta , Weina Ge , Jia Li , James Z. Wang, Toward bridging the annotation-retrieval gap in image search by a generative modeling approach, Proceedings of the 14th annual ACM international conference on Multimedia, October 23-27, 2006, Santa Barbara, CA, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Ritendra Datta , Dhiraj Joshi , Jia Li , James Z. Wang, Image retrieval: Ideas, influences, and trends of the new age, ACM Computing Surveys (CSUR), v.40 n.2, p.1-60, April 2008
|
|
|
|
|