|
ABSTRACT
With rapid advances in video processing technologies and ever fast increments in network bandwidth, the popularity of video content publishing and sharing has made similarity search an indispensable operation to retrieve videos of user interests. The video similarity is usually measured by the percentage of similar frames shared by two video sequences, and each frame is typically represented as a high-dimensional feature vector. Unfortunately, high complexity of video content has posed the following major challenges for fast retrieval: (a) effective and compact video representations, (b) efficient similarity measurements, and (c) efficient indexing on the compact representations. In this paper, we propose a number of methods to achieve fast similarity search for very large video database. First, each video sequence is summarized into a small number of clusters, each of which contains similar frames and is represented by a novel compact model called Video Triplet (ViTri). ViTri models a cluster as a tightly bounded hypersphere described by its position, radius, and density. The ViTri similarity is measured by the volume of intersection between two hyperspheres multiplying the minimal density, i.e., the estimated number of similar frames shared by two clusters. The total number of similar frames is then estimated to derive the overall similarity between two video sequences. Hence the time complexity of video similarity measure can be reduced greatly. To further reduce the number of similarity computations on ViTris, we introduce a new one dimensional transformation technique which rotates and shifts the original axis system using PCA in such a way that the original inter-distance between two high-dimensional vectors can be maximally retained after mapping. An efficient B+-tree is then built on the transformed one dimensional values of ViTris' positions. Such a transformation enables B+-tree to achieve its optimal performance by quickly filtering a large portion of non-similar ViTris. Our extensive experiments on real large video datasets prove the effectiveness of our proposals that outperform existing methods significantly.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
 |
2
|
Stefan Berchtold , Christian Böhm , Hans-Peter Kriegal, The pyramid-technique: towards breaking the curse of dimensionality, Proceedings of the 1998 ACM SIGMOD international conference on Management of data, p.142-153, June 01-04, 1998, Seattle, Washington, United States
|
 |
3
|
|
| |
4
|
|
| |
5
|
H. Chang, S. Sull, and S. Lee. Efficient video indexing scheme for content-based retrieval. In IEEE Transactions on Circuits and Systems for Video Technology, 1999.
|
| |
6
|
S. Cheung and A. Zakhor. Efficient video similarity measurement with video signature. In IEEE Transactions on Circuits and Systems for Video Technology, 2003.
|
| |
7
|
P. Indyk, G. Iyengar, and N. Shivakumar. Finding pirated video sequences on the internet. In Tech. Rep., Stanford Infolab, 1999.
|
| |
8
|
G. Iyengar and A. Lippman. Distributional clustering for efficient content-based retrieval of images and video. In ICIP, pages 81--84, 2000.
|
| |
9
|
J. Jacquelin. Problem of hyperspace. In http: //maths-express.ifrance.com/mathsexpress/articles/voir/hyperchevre.htm, 2004.
|
| |
10
|
C. Jensen, D. Lin, and B. C.Ooi. Query and update efficient b+-tree based indexing of moving objects. In VLDB, 2004.
|
| |
11
|
H. Jin, B. Ooi, H. Shen, C. Yu, and A. Zhou. An Adaptive and Efficient Dimensionality Reduction Algorithm for High-Dimensional Indexing. In ICDE, pages 87--98, 2003.
|
 |
12
|
|
| |
13
|
M. R. Naphade, R. Wang, and T. S. Huang. Multimodal pattern matching for audio-visual query and retrieval. In SPIE, pages 188--195, 2001.
|
| |
14
|
N. Vasconcelos. On the complexity of probabilistic image retrieval. In ICCV, 2001.
|
| |
15
|
|
CITED BY 13
|
|
Junqi Zhang , Xiangdong Zhou , Wei Wang , Baile Shi , Jian Pei, Using high dimensional indexes to support relevance feedback based interactive images retrieval, Proceedings of the 32nd international conference on Very large data bases, September 12-15, 2006, Seoul, Korea
|
|
|
Xiangmin Zhou , Xiaofang Zhou , Heng Tao Shen, Efficient similarity search by summarization in large video database, Proceedings of the eighteenth conference on Australasian database, p.161-167, January 30-February 02, 2007, Ballarat, Victoria, Australia
|
|
|
|
|
|
|
|
|
Heng Tao Shen , Xiaofang Zhou , Zi Huang , Jie Shao , Xiangmin Zhou, UQLIPS: a real-time near-duplicate video clip detection system, Proceedings of the 33rd international conference on Very large data bases, September 23-27, 2007, Vienna, Austria
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Xiangmin Zhou , Xiaofang Zhou , Lei Chen , Athman Bouguettaya , Nong Xiao , John A. Taylor, An efficient near-duplicate video shot detection method using shot-based interest points, IEEE Transactions on Multimedia, v.11 n.5, p.879-891, August 2009
|
|