ACM Home Page
Please provide us with feedback. Feedback
Automatically estimating number of scenes for rushes summarization
Full text PdfPdf (214 KB)
Source
International Multimedia Conference archive
Proceedings of the 2nd ACM TRECVid Video Summarization Workshop table of contents
Vancouver, British Columbia, Canada
Pages 129-133  
Year of Publication: 2008
ISBN:978-1-60558-309-9
Authors
Koji Yamasaki  Tokyo Institute of Technology, Tokyo, Japan
Koichi Shinoda  Tokyo Institute of Technology, Tokyo, Japan
Sadaoki Furui  Tokyo Institute of Technology, Tokyo, Japan
Sponsors
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 4,   Downloads (12 Months): 40,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1463563.1463587
What is a DOI?

ABSTRACT

This paper describes our video summarization system using a model selection technique to estimate the optimal number of scenes for a summary. It uses a minimum description length as a model selection criterion and carries out two-stage estimation. First, we estimate the number of scenes in each shot, and then we estimate the number of scenes in a whole video clip. We model a set of scenes with a Gaussian mixture model, where the mixture component is assumed to represent one scene. Our system was evaluated in the TRECVID 2008 rushes summarization task, where the test video set was unedited materials provided by the BBC. Our scores were about the same as the average of all the participants for the eight evaluation measures.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
H. Akaike. A new look at the statistical model identification. IEEE Transactions on Automatic Control, AC-19(6):716--723, 1974.
 
2
J.-Y. Bouget. Pyramidal implementation of the lucas kanade feature tracker description of the algorithm. Technical report, Intel Corporation Microprocessor Research Labs, 2000.
 
3
J. Lin. Divergence measures based on the shannon entropy. IEEE Transactions on Information Theory, 37(1):145--151, 1991.
 
4
Z. Liu, E. Zavesky, D. Gibbon, B. Shahraray, and P. Haffner. AT&T research at TRECVID 2007. In TREC Video Retrieval Evaluation Online Proceedings, 2007.
 
5
B. Lucas and T. Kanade. An iterative image registration technique with an application to stereo vision. Proc. 7th International Joint Conference on Artificial Intelligence, pages 674--679, 1981.
 
6
T. Nakamura, Y. Miyamura, K. Shinoda, and S. Furui. TokyoTech's TRECVID2006 notebook. In TREC Video Retrieval Evaluation Online Proceedings, 2006.
 
7
T. Nakamura, K. Shinoda, and S. Furui. TokyoTech's TRECVID2007 notebook. In TREC Video Retrieval Evaluation Online Proceedings, 2007.
 
8
Intel Open Source Computer Vision Library. http://www.intel.com/research/mrl/research/opencv/.
9
10
 
11
Z. Pan and C.-W. Ngo. Moving-object detection, association, and selection in home videos. IEEE Transactions on Multimedia, 9(2):268--279, February 2007.
 
12
J. Rissanen. A universal prior for integers and estimation by minimum description length. Ann. Statist., 11(2):416--431, 1983.
 
13
K. Shinoda, K. Ishihara, S. Furui, and T. Mochizuki. Automatic score scene detection for baseball video. International Symposium on Large-Scale Knowledge Resources (LKR2008), pages 226--240, March 2008.
14
 
15
E. Spyrou, P. Kapsalas, G. Tolias, P. Mylonas, and Y. A. et al. The COST292 experimental framework for TRECVID 2007. In TREC Video Retrieval Evaluation Online Proceedings, 2007.
16


Collaborative Colleagues:
Koji Yamasaki: colleagues
Koichi Shinoda: colleagues
Sadaoki Furui: colleagues