ACM Home Page
Please provide us with feedback. Feedback
Video summarization preserving dynamic content
Full text PdfPdf (1.80 MB)
Source
International Multimedia Conference archive
Proceedings of the international workshop on TRECVID video summarization table of contents
Augsburg, Bavaria, Germany
Pages: 40 - 44  
Year of Publication: 2007
ISBN:978-1-59593-780-3
Authors
Francine Chen  FX Palo Alto Laboratory Inc., Palo Alto, CA
Matthew Cooper  FX Palo Alto Laboratory Inc., Palo Alto, CA
John Adcock  FX Palo Alto Laboratory Inc., Palo Alto, CA
Sponsors
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
ACM: Association for Computing Machinery
SIGGRAPH: ACM Special Interest Group on Computer Graphics and Interactive Techniques
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 7,   Downloads (12 Months): 43,   Citation Count: 4
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1290031.1290038
What is a DOI?

ABSTRACT

This paper describes a system for selecting excerpts from unedited video and presenting the excerpts in a short summary video for efficiently understanding the video contents. Color and motion features are used to divide the video into segments where the color distribution and camera motion are similar. Segments with and without camera motion are clustered separately to identify redundant video. Audio features are used to identify clapboard appearances for exclusion. Representative segments from each cluster are selected for presentation. To increase the original material contained within the summary and reduce the time required to view the summary, selected segments are played back at a higher rate based on the amount of detected camera motion in the segment. Pitch-preserving audio processing is used to better capture the sense of the original audio. Metadata about each segment is overlayed on the summary to help the viewer understand the context of the summary segments in the original video.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
J.-Y. Bouget. Pyramidal implementation of the lucas kanade feature tracker. description of the algorithm. Technical report, Intel Corporation Microprocessor Research Lab, 2000.
 
2
M. Cooper and J. Foote. Scene boundary detection via video self-similarity analysis. In Proc. IEEE International Conference on Image Processing (ICIP), pages 378--81, 2001.
 
3
B. Gunsel, M. Ferman, and A. M. Tekalp. Temporal video segmentation using unsupervised clustering and semantic object tracking. Journal of Electronic Imaging, 7:592--604, July 1998.
 
4
 
5
B. Lucas and T. Kanade. An iterative image registration technique with an application to stereo vision. In IJCAI81, pages 674--679, 1981.
 
6
A. Ng, M. Jordan, and Y. Weiss. On spectral clustering: Analysis and an algorithm. In Advances in Neural Information Processing Systems 14: Proceedings of the 2001., 2001.
 
7
J.-M. Odobez, D. Gatica-Perez, and M. Guillemot. Video shot clustering using spectral methods. In Int. Workshop on Content-based Multimedia Indexing (CBMI), 2003.
 
8
Open source computer vision library. http://www.intel.com/technology/computing/opencv/.
9


Collaborative Colleagues:
Francine Chen: colleagues
Matthew Cooper: colleagues
John Adcock: colleagues