ACM Home Page
Please provide us with feedback. Feedback
Comparison of content selection methods for skimming rushes video
Full text PdfPdf (193 KB)
Source
International Multimedia Conference archive
Proceedings of the 2nd ACM TRECVid Video Summarization Workshop table of contents
Vancouver, British Columbia, Canada
Pages 85-89  
Year of Publication: 2008
ISBN:978-1-60558-309-9
Authors
Werner Bailer  JOANNEUM RESEARCH, Graz, Austria
Georg Thallinger  JOANNEUM RESEARCH, Graz, Austria
Sponsors
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 7,   Downloads (12 Months): 50,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1463563.1463578
What is a DOI?

ABSTRACT

We compare two methods for selecting segments to be included in a video skim, using lists of relevant as well as redundant segments created from different visual features as input. One approach is rule-based, and creates a weighted sum of the input relevances. The other is HMM based, using a model trained on the TRECVID 2007 rushes data. The redundant segments are created from detection of repeated takes and junk content, the selected segments from visual activity and face detection. The results show that the approaches create very short summaries which only contain a part of the relevant information in the video, but reach very high scores in terms of the usability measures non-duplicates, non-junk and pleasant tempo. The HMM based approach contains more information despite shorter duration of the summaries.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
W. Bailer. A comparison of distance measures for clustering video sequences. In Proceedings of 1st Workshop on Automated Information Extraction in Media Production, Turin, IT, Sept. 2008.
 
2
W. Bailer, E. Dumont, S. Essid, and B. Mérialdo. A collaborative approach to automatic rushes video summarization. In Proceedings of IEEE International Conference on Image Processing, San Diego, CA, USA, Sept. 2008.
3
 
4
W. Bailer, F. Lee, and G. Thallinger. Detecting and clustering multiple takes of one scene. In MMM, pages 80--89, Kyoto, JP, Jan. 2008.
 
5
C.-C. Chang and C.-J. Lin. LIBSVM: a library for support vector machines, 2001.
 
6
MPEG-7. Multimedia Content Description Interface-Part 3: Visual. ISO/IEC 15938--3, 2001.
 
7
NHK STRL. Test modules for TRECVID activity. Use case scenario. Ver.1.2.0E, Apr. 2008.
8
 
9
A. F. Smeaton and P. Over. TRECVID 2006: Shot boundary detection task overview. In Proceedings of the TRECVID Workshop, November 2006.
 
10
P. Viola and M. Jones. Fast and robust classification using asymmetric adaboost and a detector cascade. In T. G. Dietterich, S. Becker, and Z. Ghahramani, editors, Advances in Neural Information Processing Systems 14, Cambridge, MA, 2002. MIT Press.
 
11


Collaborative Colleagues:
Werner Bailer: colleagues
Georg Thallinger: colleagues