|
ABSTRACT
This paper presents methods for automatically creating pictorial video summaries that resemble comic books. The relative importance of video segments is computed from their length and novelty. Image and audio analysis is used to automatically detect and emphasize meaningful events. Based on this importance measure, we choose relevant keyframes. Selected keyframes are sized by importance, and then efficiently packed into a pictorial summary. We present a quantitative measure of how well a summary captures the salient events in a video, and show how it can be used to improve our summaries. The result is a compact and visually pleasing summary that captures semantically important events, and is suitable for printing or Web access. Such a summary can be further enhanced by including text captions derived from OCR or other methods. We describe how the automatically generated summaries are used to simplify access to a large collection of videos.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
 |
2
|
F. Arman , R. Depommier , A. Hsu , M.-Y. Chiu, Content-based browsing of video sequences, Proceedings of the second ACM international conference on Multimedia, p.97-103, October 15-20, 1994, San Francisco, California, United States
[doi> 10.1145/192593.192630]
|
| |
3
|
Boreczky, J. and Rowe, L., "Comparison of Video Shot Boundary Detection Techniques," in Proc. SPIE Conference on Storage and Retrieval for Still Image and 'Video Databases IE, San Jose, CA, February, 1996, pp. t70-179.
|
| |
4
|
Michael G. Christel , Michael A. Smith , C. Roy Taylor , David B. Winkler, Evolving video skims into useful multimedia abstractions, Proceedings of the SIGCHI conference on Human factors in computing systems, p.171-178, April 18-23, 1998, Los Angeles, California, United States
[doi> 10.1145/274644.274670]
|
 |
5
|
Jonathan Foote , John Boreczhy , Andreas Girgensohn , Lynn Wilcox, An intelligent media browser using automatic multimodal analysis, Proceedings of the sixth ACM international conference on Multimedia, p.375-380, September 13-16, 1998, Bristol, United Kingdom
[doi> 10.1145/290747.290804]
|
| |
6
|
Foote, J., Boreczky, J., and Wilcox, L., "Finding Presentations in Recorded Meetings Using Audio and Video Features," in Proc. 1CASSP '99, Vol. 6, pp. 3045-3048, 1999.
|
| |
7
|
Girgensohn, A. and Boreczky, J., "Time-Constrained Keyframe Selection Technique," in IEEE Multimedia Systems '99, IEEE Computer Society, Vol. 1, pp. 756- 761, 1999.
|
| |
8
|
Girgensohn, A. and Foote, J., "Video Frame Classification Using Transform Coefficients," in Proc. ICASSP '99, Vol. 6, pp. 3045-3048, 1999.
|
| |
9
|
Huang, Q., Liu, Z. and Rosenberg, A., "Automated Semantic Structure Reconstruction and Representation Generation for Broadcast News," in Proc. iS& T/SPIE Conference on Storage and Retrieval for Image and Video Databases VII, Vol. 3656, pp. 50-62, 1999.
|
| |
10
|
ISO MPEG 7 Content Set, Item V20, "Korea's Pop Singers' Live Music Show", Korean Broadcasting System, 1998.
|
| |
11
|
Pfeiffer, S., Lienhart, R., Fischer, S. and Effelsberg, W., "Abstracting digital movies automatically," in Journal of Visual Communication and Image Representation, 7(4), pp. 345-353, December 1996.
|
| |
12
|
|
 |
13
|
|
| |
14
|
|
 |
15
|
Yukinobu Taniguchi , Akihito Akutsu , Yoshinobu Tonomura, PanoramaExcerpts: extracting and packing panoramas for video browsing, Proceedings of the fifth ACM international conference on Multimedia, p.427-436, November 09-13, 1997, Seattle, Washington, United States
[doi> 10.1145/266180.266396]
|
| |
16
|
Uchihashi, S. and Foote, J., "Summarizing Video Using a Shot Importance Measure and a Frame-Packing Algorithm," in Proc. ICASSP '99, Vol. 6, pp. 3041- 3044, 1999.
|
| |
17
|
Yeo, B-L. and Yeung, M., "Classification, Simplification and Dynamic Visualization of Scene Transition Graphs for Video Browsing," in Proc. IS&T/SPIE Electronic Imaging 98: Storage and Retrieval for Image and Wdeo Databases VI.
|
| |
18
|
Yeung, M. M., Yeo, B. L., Wolf, W. and Liu, B., "Video Browsing using Clustering and Scene Transitions on Compressed Sequences," in SPIE Vol. 2417 Multimedia Computing and Networking 1995, pp. 399-413, Feb. 1995.
|
| |
19
|
Yeung, M. and Yeo, B-L., "Video Visualization for Compact Presentation and Fast Browsing of Pictorial Content," in IEEE Trans. Circuits and Svs. for Video Technology, Vol. 7, No. 5, pp. 771-785, Oct. 1997.
|
| |
20
|
Yu, H., Clark, C., Malkin, R. and Waibel, A., "Experiments In Automatic Meeting Transcription Using JRTK," in Proc. ICASSP 98, pp. 921-924, 1998.
|
 |
21
|
H. J. Zhang , C. Y. Low , S. W. Smoliar , J. H. Wu, Video parsing, retrieval and browsing: an integrated and content-based solution, Proceedings of the third ACM international conference on Multimedia, p.15-24, November 05-09, 1995, San Francisco, California, United States
[doi> 10.1145/217279.215068]
|
| |
22
|
Zhuang, Y., Rui, Y., Huang, T.S. and Mehrotra, S., "Adaptive Key Frame Extraction Using Unsupervised Clustering," in Proc. ICIP '98, Vol. I, pp. 866-870, 1998.
|
CITED BY 51
|
|
|
|
|
John Boreczky , Andreas Girgensohn , Gene Golovchinsky , Shingo Uchihashi, An interactive comic book presentation for exploring video, Proceedings of the SIGCHI conference on Human factors in computing systems, p.185-192, April 01-06, 2000, The Hague, The Netherlands
|
|
|
Patrick Chiu , John Boreczky , Andreas Girgensohn , Don Kimber, LiteMinutes: an Internet-based system for multimedia meeting minutes, Proceedings of the 10th international conference on World Wide Web, p.140-149, May 01-05, 2001, Hong Kong, Hong Kong
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Xiaomeng Wu , Wenli Zhang , Shunsuke Kamijo , Masao Sakauchi, Construction of interactive video information system by applying results of object recognition, Proceedings of the eleventh ACM international conference on Multimedia, November 02-08, 2003, Berkeley, CA, USA
|
|
|
Georgina Gaughan , Alan F. Smeaton , Cathal Gurrin , Hyowon Lee , Kieran McDonald, Design, implementation and testing of an interactive video retrieval system, Proceedings of the 5th ACM SIGMM international workshop on Multimedia information retrieval, November 07-07, 2003, Berkeley, California
|
|
|
|
|
|
|
|
|
|
|
|
Jamey Graham , Berna Erol , Jonathan J. Hull , Dar-Shyang Lee, The video paper multimedia playback system, Proceedings of the eleventh ACM international conference on Multimedia, November 02-08, 2003, Berkeley, CA, USA
|
|
|
Maryam Kamvar , Patrick Chiu , Lynn Wilcox , Sandeep Casi , Surapong Lertsithichai, MiniMedia surfer: browsing video segments on small displays, CHI '04 extended abstracts on Human factors in computing systems, April 24-29, 2004, Vienna, Austria
|
|
|
|
|
|
|
|
|
|
|
|
Bin Yu , Wei-Ying Ma , Klara Nahrstedt , Hong-Jiang Zhang, Video summarization based on user log enhanced link analysis, Proceedings of the eleventh ACM international conference on Multimedia, November 02-08, 2003, Berkeley, CA, USA
|
|
|
|
|
|
|
|
|
|
|
|
Patrick Chiu , Andreas Girgensohn , Surapong Lertsithichai , Wolf Polak , Frank Shipman, MediaMetro: browsing multimedia document collections with a 3D city metaphor, Proceedings of the 13th annual ACM international conference on Multimedia, November 06-11, 2005, Hilton, Singapore
|
|
|
|
|
|
|
|
|
|
|
|
Michael S. Lew , Nicu Sebe , Chabane Djeraba , Ramesh Jain, Content-based multimedia information retrieval: State of the art and challenges, ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP), v.2 n.1, p.1-19, February 2006
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Kai-Yin Cheng , Sheng-Jie Luo , Bing-Yu Chen , Hao-Hua Chu, SmartPlayer: user-centric video fast-forwarding, Proceedings of the 27th international conference on Human factors in computing systems, April 04-09, 2009, Boston, MA, USA
|
|
|
|
|
|
Tao Wang , Yue Gao , Patricia P. Wang , Eric Li , Wei Hu , Yimin Zhang , Junhai Yong, Video summarization by redundancy removing and content ranking, Proceedings of the 15th international conference on Multimedia, September 25-29, 2007, Augsburg, Germany
|
|
|
|
|
|
|
|
|
|
|
|
Tao Wang , Yue Gao , Jianguo Li , Patricia P. Wang , Xiaofeng Tong , Wei Hu , Yimin Zhang , Jianmin Li, THU-ICRC at rush summarization of TRECVID 2007, Proceedings of the international workshop on TRECVID video summarization, p.79-83, September 28-28, 2007, Augsburg, Bavaria, Germany
|
|
|
Frank Shipman , Andreas Girgensohn , Lynn Wilcox, Authoring, viewing, and generating hypervideo: An overview of Hyper-Hitchcock, ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP), v.5 n.2, p.1-19, November 2008
|
|
|
Tao Wang , Shangping Feng , Patricia P. Wang , Wei Hu , Shuang Zhang , Wei Zhang , Yangzhou Du , Jianguo Li , Jianmin Li , Yimin Zhang, THU-intel at rushes summarization of TRECVID 2008, Proceedings of the 2nd ACM TRECVid Video Summarization Workshop, p.124-128, October 31-31, 2008, Vancouver, British Columbia, Canada
|
|
|
Patrick Chiu , Jeffrey Huang , Maribeth Back , Nicholas Diakopoulos , John Doherty , Wolf Polak , Xiaohua Sun, mTable: browsing photos and videos on a tabletop system, Proceeding of the 16th ACM international conference on Multimedia, October 26-31, 2008, Vancouver, British Columbia, Canada
|
|
|
|
|
|
|
|
|
|
|
|
Jacob Eisenstein , Regina Barzilay , Randall Davis, Turning lectures into comic books using linguistically salient gestures, Proceedings of the 22nd national conference on Artificial intelligence, p.877-882, July 22-26, 2007, Vancouver, British Columbia, Canada
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|