ACM Home Page
Please provide us with feedback. Feedback
A user attention model for video summarization
Full text PdfPdf (644 KB)
Source International Multimedia Conference archive
Proceedings of the tenth ACM international conference on Multimedia table of contents
Juan-les-Pins, France
SESSION: Session 11: multimedia analysis and retrieval table of contents
Pages: 533 - 542  
Year of Publication: 2002
ISBN:1-58113-620-X
Authors
Yu-Fei Ma  Microsoft Research Asia, Beijing, China
Lie Lu  Microsoft Research Asia, Beijing, China
Hong-Jiang Zhang  Microsoft Research Asia, Beijing, China
Mingjing Li  Microsoft Research Asia, Beijing, China
Sponsors
SIGGRAPH: ACM Special Interest Group on Computer Graphics and Interactive Techniques
SIGCOMM: ACM Special Interest Group on Data Communication
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 28,   Downloads (12 Months): 240,   Citation Count: 45
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/641007.641116
What is a DOI?

ABSTRACT

Automatic generation of video summarization is one of the key techniques in video management and browsing. In this paper, we present a generic framework of video summarization based on the modeling of viewer's attention. Without fully semantic understanding of video content, this framework takes advantage of understanding of video content, this framework takes advantage of computational attention models and eliminates the needs of complex heuristic rules in video summarization. A set of methods of audio-visual attention model features are proposed and presented. The experimental evaluations indicate that the computational attention based approach is an effective alternative to video semantic analysis for video summarization.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
H.J Zhang, et al, "An integrated system for content-based video retrieval and browsing," Pattern Recognition, vol. 30, no.4, pp. 643--658, 1997.
 
2
W. Wolf, "Key frame selection by motion analysis," Proc. of ICASSP'96, vol.2, pp. 1228--1231, 1996.
 
3
Y. Zhuang, et al, "Adaptive key frame extraction using unsupervised clustering," Proc. of ICIP'98, 1998.
 
4
A. Girgensohn, and J. Boreczky, "Time-constrained key frame selection technique," Proc. of ICMCS, pp. 756--761, 1999.
 
5
F. Dufaux, "Key frame selection to represent a video," Proc. of ICME 2000.
 
6
P. Campisi, A. Longari and A. Neri, "Automatic key frame selection using a wavelet based approach," Proc. of SPIE, vol. 3813, pp. 861--872, July 1999.
7
8
 
9
10
 
11
 
12
X. Orriols, X. Binefa, "An EM algorithm for video summarization, generative model approach," Proc. of ICCV, 2001.
 
13
Y.H. Gong, X. Liu, "Video Summarization Using Singular Value Decomposition," Proc. of CVPR, June, 2000.
 
14
L. Itti, C. Koch, "Computational Modeling of Visual Attention," Nature Reviews Neuroscience, Vol. 2, No. 3, pp. 194--203, Mar. 2001.
 
15
 
16
L. Itti, C. Koch, "A Comparison of Feature Combination Strategies for Saliency-Based Visual Attention Systems," Proc. of SPIE Human Vision and Electronic Imaging IV (HVEI'99), San Jose, CA, Vol. 3644, pp. 473--82, Jan 1999.
 
17
 
18
 
19
Y.F Ma, H.J. Zhang, "A Model of Motion Attention for Video Skimming," Proc. of ICIP, 2002.
 
20
Y.F. Ma, H.J. Zhang "A New Perceived Motion based Shot Content Representation," Proc. of ICIP, 2001.
 
21
22
 
23
L. Lu, Stan Li, H. J. Zhang, "Content-based Audio Segmentation Using Support Vector Machines". Proc. of ICME 2001, pp956--959, Tokyo, Japan, 2001.
 
24

CITED BY  45

Collaborative Colleagues:
Yu-Fei Ma: colleagues
Lie Lu: colleagues
Hong-Jiang Zhang: colleagues
Mingjing Li: colleagues