|
ABSTRACT
Near-duplicate image retrieval plays an important role in many real-world multimedia applications. Most previous approaches have some limitations. For example, conventional appearance-based methods may suffer from the illumination variations and occlusion issue, and local feature correspondence-based methods often do not consider local deformations and the spatial coherence between two point sets. In this paper, we propose a novel and effective Nonrigid Image Matching (NIM) approach to tackle the task of near-duplicate keyframe retrieval from real-world video corpora. In contrast to previous approaches, the NIM technique can recover an explicit mapping between two near-duplicate images with a few deformation parameters and find out the correct correspondences from noisy data effectively. To make our technique applicable to large-scale applications, we suggest an effective multi-level ranking scheme that filters out the irrelevant results in a coarse-to-fine manner. In our ranking scheme, to overcome the extremely small training size challenge, we employ a semi-supervised learning method for improving the performance using unlabeled data. To evaluate the effectiveness of our solution, we have conducted extensive experiments on two benchmark testbeds extracted from the TRECVID2003 and TRECVID2004 corpora. The promising results show that our proposed method is more effective than other state-of-the-art approaches for near-duplicate keyframe retrieval.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
| |
3
|
H. Bay, T. Tuytelaars, and L. J. V. Gool. Surf: Speeded up robust features. In Proc. European Conf. Computer Vision, pages 404--417, 2006.
|
| |
4
|
|
| |
5
|
|
| |
6
|
|
 |
7
|
|
| |
8
|
|
| |
9
|
|
| |
10
|
C.-H. Hoi, W. Wang, and M. R. Lyu. A novel scheme for video similarity detection. In CIVR, pages 373--382, 2003.
|
| |
11
|
S. C. Hoi and M. R. Lyu. A multi-modal and multi-level ranking framework for content-based video retrieval. To appear in IEEE Transactions on Multimedia, 2008.
|
| |
12
|
M. Kass, A. Witkin, and D. Terzopoulos. Snakes: Active contour models. Int'l J. Computer Vision, 1(4):321--331, Jan. 1988.
|
 |
13
|
|
| |
14
|
M. Lades , J. C. Vorbruggen , J. Buhmann , J. Lange , C. von der Malsburg , R. P. Wurtz , W. Konen, Distortion Invariant Object Recognition in the Dynamic Link Architecture, IEEE Transactions on Computers, v.42 n.3, p.300-311, March 1993
[doi> 10.1109/12.210173]
|
| |
15
|
|
| |
16
|
|
 |
17
|
|
| |
18
|
T. Ojala, M. Pietikainen, and D. Harwood. A comparative study of texture measures with classification based on feature distributions. 29(1):51--59, January 1996.
|
| |
19
|
|
| |
20
|
|
 |
21
|
|
| |
22
|
|
| |
23
|
TRECVID. TREC video retrieval evaluation. In http://www-nlpir.nist.gov/projects/trecvid/.
|
| |
24
|
V. N. Vapnik. Statistical Learning Theory. John Wiley & Sons, 1998.
|
 |
25
|
|
 |
26
|
|
 |
27
|
|
| |
28
|
Z. Xu, R. Jin, J. Zhu, I. King, and M. R. Lyu. Efficient convex relaxation for transductive support vector machine. In NIPS'2007, 2007.
|
 |
29
|
|
 |
30
|
|
| |
31
|
W. Zhao, Y. Jiang, and C. Ngo. Keyframe retrieval by keypoints: Can point-to-point matching help? In CIVR06, pages 72--81, 2006.
|
| |
32
|
W.-L. Zhao, C.-W. Ngo, H. K. Tan, and X. Wu. Near-duplicate keyframe identification with interest point matching and pattern learning. IEEE Trans. on Multimedia, 9(5):1037--1048, 2007.
|
| |
33
|
J. Zhu. Semi-supervised learning literature survey. Technical report, Carnegie Mellon University, 2005.
|
| |
34
|
J. Zhu, S. C. Hoi, and M. R. Lyu. Face annotation by transductive kernel fisher discriminant. IEEE Trans. on Multimedia, 10(1):86--96, 2008.
|
| |
35
|
J. Zhu and M. R. Lyu. Progressive finite newton approach to real-time nonrigid surface detection. In Proc. Conf. Computer Vision and Pattern Recognition, pages 1--8, 2007.
|
| |
36
|
J. Zhu, M. R. Lyu, and T. S. Huang. A fast 2d shape recovery approach by fusing features and appearance. To appear in IEEE Trans. Pattern Anal. Mach. Intell., 2008.
|
|