| Video keyframe production by efficient clustering of compressed chromaticity signatures (poster session) |
| Full text |
Pdf
(330 KB)
|
| Source
|
International Multimedia Conference
archive
Proceedings of the eighth ACM international conference on Multimedia
table of contents
Marina del Rey, California, United States
Pages: 365 - 367
Year of Publication: 2000
ISBN:1-58113-198-4
|
|
Authors
|
|
Mark S. Drew
|
School of Computing Science, Simon Fraser University, Vancouver, B.C. Canada V5A 1S6
|
|
James Au
|
School of Computing Science, Simon Fraser University, Vancouver, B.C. Canada V5A 1S6
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 8, Downloads (12 Months): 26, Citation Count: 6
|
|
|
ABSTRACT
We develop a new low-dimensional video frame feature that is more insensitive to lighting change, motivated by color constancy work in physics-based vision, and apply the feature to keyframe production using hierarchical clustering. The new feature has the further advantage of more expressively capturing image information and as a result produces a very succinct set of keyframes for any video. Because we effectively reduce any video to the same lighting conditions, we can produce a universal basis on which to project video frame features. We carry out clustering efficiently by adapting a hierarchical clustering data structure to temporally-ordered clusters. Using a new multi-stage hierarchical clustering method, we merge clusters based on the ratio of cluster variance to variance of the parent node, merging only adjacent clusters, and then follow with a second round of clustering. The second stage merges clusters incorrectly split in the first round by the greedy hierarchical algorithm, and as well merges non-adjacent clusters to fuse near-repeat shots. The new summarization method produces a very succinct set of keyframes for videos, and results are excellent.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
B.-L. Yeo and M.M. Yeung. Classification, simplification and dynamic visualization of scene transition graphs for video browsing. In SPIE Storage and Retrieval for Image and Video Databases VI, 1998.
|
| |
2
|
|
| |
3
|
D. Zhong, H. Zhang, and S.-F. Chang. Clustering methods for video browsing and annotation. In SPIE Storage and Retrieval for Image and Video Databases IV, pages 239-246,1996.
|
| |
4
|
|
| |
5
|
A.M. Ferman and A.M. Tekalp. Efficient filtering and clustering methods for temporal video segmentation and visual summarization. J. Vis. Commun. & lmage Rep., 9:336-351, 1998.
|
| |
6
|
A.M. Ferman and A.M. Tekalp. Multiscale content extraction and representation for video indexing. In SPIE Multimedia Storage and Archiving Systems 11, 1997.
|
 |
7
|
H. J. Zhang , C. Y. Low , S. W. Smoliar , J. H. Wu, Video parsing, retrieval and browsing: an integrated and content-based solution, Proceedings of the third ACM international conference on Multimedia, p.15-24, November 05-09, 1995, San Francisco, California, United States
[doi> 10.1145/217279.215068]
|
| |
8
|
A. Hanjalic, M. Ceccarelli, R.L. Lagendijk, and J. Biemond. Automation of systems enabling search on stored video data. In SPIE Storage and Retrieval for Image and Video Databases V, pages 427--438,1997.
|
 |
9
|
|
| |
10
|
|
| |
11
|
J. Wei, M.S. Drew, and Z.N. Li. Illumination invariant video segmentation by hierarchical robust thresholding. In Electronic Imaging 198: Storage and Retrieval for Image and Video Databases I/1, pages 188-201. SPIE Vol. 3312, 1998.
|
| |
12
|
G.D. Finlayson, P.M. Hubel, and S. Hordley. Colour by correlation. In Fifth Color Imaging Conf., pages 6-11, 1997.
|
| |
13
|
E. Sahouria and A. Zakhor. Content analysis of video using principal components. 1EEE Trans. Circ. Sys. Vid. Tech., 9:1290-1298, 1999.
|
| |
14
|
A. Girgensohnand J. Boreczky.Time-constrained key frame selection technique. In IEEE MM Sys., pages 756-761,1999.
|
| |
15
|
M. S. Drew, J. Wei, and Z.N. Li. Illumination-invariant image retrieval and video segmentation. Pattern Recognition, 32:1369-1388, 1999.
|
| |
16
|
C.E Borges. Trichromatic approximation method for surface illumination. J. Opt. Soc. Am. A, 8:1319-1323,1991.
|
| |
17
|
Mark S. Drew, Ze-Nian Li., and Xiang Zhong. Video dissolve and wipe detection via spatio-temporal images of chromatic histogram differences. In 1CIP'O0, 2000. To appear.
|
|