ACM Home Page
Please provide us with feedback. Feedback
Near-duplicate keyframe retrieval by nonrigid image matching
Full text PdfPdf (3.10 MB)
Source
International Multimedia Conference archive
Proceeding of the 16th ACM international conference on Multimedia table of contents
Vancouver, British Columbia, Canada
SESSION: Content track C1: duplicate detection table of contents
Pages 41-50  
Year of Publication: 2008
ISBN:978-1-60558-303-7
Authors
Jianke Zhu  The Chinese University of Hong Kong, Hong Kong, Hong Kong
Steven C.H. Hoi  Nanyang Technological University, Singapore, Singapore
Michael R. Lyu  The Chinese University of Hong Kong, Hong Kong, Hong Kong
Shuicheng Yan  National University of Singapore, Singapore, Singapore
Sponsors
ACM: Association for Computing Machinery
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 10,   Downloads (12 Months): 194,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1459359.1459366
What is a DOI?

ABSTRACT

Near-duplicate image retrieval plays an important role in many real-world multimedia applications. Most previous approaches have some limitations. For example, conventional appearance-based methods may suffer from the illumination variations and occlusion issue, and local feature correspondence-based methods often do not consider local deformations and the spatial coherence between two point sets. In this paper, we propose a novel and effective Nonrigid Image Matching (NIM) approach to tackle the task of near-duplicate keyframe retrieval from real-world video corpora. In contrast to previous approaches, the NIM technique can recover an explicit mapping between two near-duplicate images with a few deformation parameters and find out the correct correspondences from noisy data effectively. To make our technique applicable to large-scale applications, we suggest an effective multi-level ranking scheme that filters out the irrelevant results in a coarse-to-fine manner. In our ranking scheme, to overcome the extremely small training size challenge, we employ a semi-supervised learning method for improving the performance using unlabeled data. To evaluate the effectiveness of our solution, we have conducted extensive experiments on two benchmark testbeds extracted from the TRECVID2003 and TRECVID2004 corpora. The promising results show that our proposed method is more effective than other state-of-the-art approaches for near-duplicate keyframe retrieval.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
 
3
H. Bay, T. Tuytelaars, and L. J. V. Gool. Surf: Speeded up robust features. In Proc. European Conf. Computer Vision, pages 404--417, 2006.
 
4
 
5
 
6
7
 
8
 
9
 
10
C.-H. Hoi, W. Wang, and M. R. Lyu. A novel scheme for video similarity detection. In CIVR, pages 373--382, 2003.
 
11
S. C. Hoi and M. R. Lyu. A multi-modal and multi-level ranking framework for content-based video retrieval. To appear in IEEE Transactions on Multimedia, 2008.
 
12
M. Kass, A. Witkin, and D. Terzopoulos. Snakes: Active contour models. Int'l J. Computer Vision, 1(4):321--331, Jan. 1988.
13
 
14
 
15
 
16
17
 
18
T. Ojala, M. Pietikainen, and D. Harwood. A comparative study of texture measures with classification based on feature distributions. 29(1):51--59, January 1996.
 
19
 
20
21
 
22
 
23
TRECVID. TREC video retrieval evaluation. In http://www-nlpir.nist.gov/projects/trecvid/.
 
24
V. N. Vapnik. Statistical Learning Theory. John Wiley & Sons, 1998.
25
26
27
 
28
Z. Xu, R. Jin, J. Zhu, I. King, and M. R. Lyu. Efficient convex relaxation for transductive support vector machine. In NIPS'2007, 2007.
29
30
 
31
W. Zhao, Y. Jiang, and C. Ngo. Keyframe retrieval by keypoints: Can point-to-point matching help? In CIVR06, pages 72--81, 2006.
 
32
W.-L. Zhao, C.-W. Ngo, H. K. Tan, and X. Wu. Near-duplicate keyframe identification with interest point matching and pattern learning. IEEE Trans. on Multimedia, 9(5):1037--1048, 2007.
 
33
J. Zhu. Semi-supervised learning literature survey. Technical report, Carnegie Mellon University, 2005.
 
34
J. Zhu, S. C. Hoi, and M. R. Lyu. Face annotation by transductive kernel fisher discriminant. IEEE Trans. on Multimedia, 10(1):86--96, 2008.
 
35
J. Zhu and M. R. Lyu. Progressive finite newton approach to real-time nonrigid surface detection. In Proc. Conf. Computer Vision and Pattern Recognition, pages 1--8, 2007.
 
36
J. Zhu, M. R. Lyu, and T. S. Huang. A fast 2d shape recovery approach by fusing features and appearance. To appear in IEEE Trans. Pattern Anal. Mach. Intell., 2008.

Collaborative Colleagues:
Jianke Zhu: colleagues
Steven C.H. Hoi: colleagues
Michael R. Lyu: colleagues
Shuicheng Yan: colleagues