ACM Home Page
Please provide us with feedback. Feedback
Video Manga: generating semantically meaningful video summaries
Full text PdfPdf (3.41 MB)
Source International Multimedia Conference archive
Proceedings of the seventh ACM international conference on Multimedia (Part 1) table of contents
Orlando, Florida, United States
Pages: 383 - 392  
Year of Publication: 1999
ISBN:1-58113-151-8
Authors
Shingo Uchihashi  FX Palo Alto Laboratory, 3400 Hillview Avenue, Palo Alto, CA
Jonathan Foote  FX Palo Alto Laboratory, 3400 Hillview Avenue, Palo Alto, CA
Andreas Girgensohn  FX Palo Alto Laboratory, 3400 Hillview Avenue, Palo Alto, CA
John Boreczky  FX Palo Alto Laboratory, 3400 Hillview Avenue, Palo Alto, CA
Sponsors
SIGGRAPH: ACM Special Interest Group on Computer Graphics and Interactive Techniques
SIGCOMM: ACM Special Interest Group on Data Communication
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 18,   Downloads (12 Months): 109,   Citation Count: 51
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/319463.319654
What is a DOI?

ABSTRACT

This paper presents methods for automatically creating pictorial video summaries that resemble comic books. The relative importance of video segments is computed from their length and novelty. Image and audio analysis is used to automatically detect and emphasize meaningful events. Based on this importance measure, we choose relevant keyframes. Selected keyframes are sized by importance, and then efficiently packed into a pictorial summary. We present a quantitative measure of how well a summary captures the salient events in a video, and show how it can be used to improve our summaries. The result is a compact and visually pleasing summary that captures semantically important events, and is suitable for printing or Web access. Such a summary can be further enhanced by including text captions derived from OCR or other methods. We describe how the automatically generated summaries are used to simplify access to a large collection of videos.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
2
 
3
Boreczky, J. and Rowe, L., "Comparison of Video Shot Boundary Detection Techniques," in Proc. SPIE Conference on Storage and Retrieval for Still Image and 'Video Databases IE, San Jose, CA, February, 1996, pp. t70-179.
 
4
5
 
6
Foote, J., Boreczky, J., and Wilcox, L., "Finding Presentations in Recorded Meetings Using Audio and Video Features," in Proc. 1CASSP '99, Vol. 6, pp. 3045-3048, 1999.
 
7
Girgensohn, A. and Boreczky, J., "Time-Constrained Keyframe Selection Technique," in IEEE Multimedia Systems '99, IEEE Computer Society, Vol. 1, pp. 756- 761, 1999.
 
8
Girgensohn, A. and Foote, J., "Video Frame Classification Using Transform Coefficients," in Proc. ICASSP '99, Vol. 6, pp. 3045-3048, 1999.
 
9
Huang, Q., Liu, Z. and Rosenberg, A., "Automated Semantic Structure Reconstruction and Representation Generation for Broadcast News," in Proc. iS& T/SPIE Conference on Storage and Retrieval for Image and Video Databases VII, Vol. 3656, pp. 50-62, 1999.
 
10
ISO MPEG 7 Content Set, Item V20, "Korea's Pop Singers' Live Music Show", Korean Broadcasting System, 1998.
 
11
Pfeiffer, S., Lienhart, R., Fischer, S. and Effelsberg, W., "Abstracting digital movies automatically," in Journal of Visual Communication and Image Representation, 7(4), pp. 345-353, December 1996.
 
12
13
 
14
15
 
16
Uchihashi, S. and Foote, J., "Summarizing Video Using a Shot Importance Measure and a Frame-Packing Algorithm," in Proc. ICASSP '99, Vol. 6, pp. 3041- 3044, 1999.
 
17
Yeo, B-L. and Yeung, M., "Classification, Simplification and Dynamic Visualization of Scene Transition Graphs for Video Browsing," in Proc. IS&T/SPIE Electronic Imaging 98: Storage and Retrieval for Image and Wdeo Databases VI.
 
18
Yeung, M. M., Yeo, B. L., Wolf, W. and Liu, B., "Video Browsing using Clustering and Scene Transitions on Compressed Sequences," in SPIE Vol. 2417 Multimedia Computing and Networking 1995, pp. 399-413, Feb. 1995.
 
19
Yeung, M. and Yeo, B-L., "Video Visualization for Compact Presentation and Fast Browsing of Pictorial Content," in IEEE Trans. Circuits and Svs. for Video Technology, Vol. 7, No. 5, pp. 771-785, Oct. 1997.
 
20
Yu, H., Clark, C., Malkin, R. and Waibel, A., "Experiments In Automatic Meeting Transcription Using JRTK," in Proc. ICASSP 98, pp. 921-924, 1998.
21
 
22
Zhuang, Y., Rui, Y., Huang, T.S. and Mehrotra, S., "Adaptive Key Frame Extraction Using Unsupervised Clustering," in Proc. ICIP '98, Vol. I, pp. 866-870, 1998.

CITED BY  51

Collaborative Colleagues:
Shingo Uchihashi: colleagues
Jonathan Foote: colleagues
Andreas Girgensohn: colleagues
John Boreczky: colleagues