ACM Home Page
Please provide us with feedback. Feedback
Dynamic key frame presentation techniques for augmenting video browsing
Full text PdfPdf (1.50 MB)
Source AVI archive
Proceedings of the working conference on Advanced visual interfaces table of contents
L'Aquila, Italy
SESSION: Designing multimedia table of contents
Pages: 185 - 194  
Year of Publication: 1998
Authors
Tony Tse  University of Maryland, College Park, MD
Gary Marchionini  University of Maryland, College Park, MD
Wei Ding  University of Maryland, College Park, MD
Laura Slaughter  University of Maryland, College Park, MD
Anita Komlodi  University of Maryland, College Park, MD
Sponsor
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 8,   Downloads (12 Months): 31,   Citation Count: 14
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/948496.948522
What is a DOI?

ABSTRACT

Because of unique temporal and spatial properties of video data, different techniques for summarizing videos have been proposed. Key frames extracted directly from video inform users about content without requiring them to view the entire video. As part of ongoing work to develop video browsing interfaces, several interface displays based on key frames were investigated. Variations on dynamic key frame "slide shows" were examined and compared to a static key frame "filmstrip" display. The slide show mechanism displays key frames in rapid succession and is designed to facilitate visual browsing by exploiting human perceptual capabilities. User studies were conducted in a series of three experiments. Key frame display rate, number of simultaneous displays, and user perception were investigated as a function of user performance in object recognition and gist determination tasks. No significant performance degradation was detected at display rates up to 8 key frames per second, but performance degraded significantly at higher rates. Performance on gist determination tasks degraded less severely than performance on object recognition tasks as display rates increased. Furthermore, gist determination performance dropped significantly between three and four simultaneous slide shows in a single display. Users generally preferred key frame filmstrips to dynamic displays, although objective measures of performance were mixed. Implications for visual interface design and further questions for future research are provided.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
Boyce, S. J., A. Pollatsek, and K. Rayner. (1989). Effect of background information on object identification. Journal of Experimental Psychology: Human Perception and Performance, 15(3): 556--566.
 
3
 
4
Ding, W., G. Marchionini, and T. Tse. (1997). Previewing video data: Browsing key frames at high rates using a video slide show interface. Proceedings of the International Symposium on Research, Development, and Practice in Digital Libraries (ISDL '97), Tsukuba, Japan.
 
5
Ellis, H. C. and R. R. Hunt. (1989). Fundamentals of human memory and cognition. Wm. C. Brown Publishers: Dubuque, IA.
 
6
Elliot, E. (1993). Watch, grab, arrange, see: Thinking with motion images via streams and collages. MSVS Thesis Document. Cambridge, MA: MIT Media Lab.
7
 
8
Komlodi, A. (1997). Visual surrogates for motion picture documents: Presentation techniques for key frames. CLIS-TR-97-15, College Park, MD: Digital Library Research Group (http://www.glue.umd.edu/~dlrg/).
 
9
 
10
O'Connor, B. C. (1991). Selecting key frames of moving image documents: A digital environment for analysis and navigation. Microcomputers for Information Management, 8(2), 119--133.
 
11
Potter, M. C. (1976). Short-term conceptual memory for pictures. Journal of Experimental Psychology: Human Learning and Memory, 2(5), 509--522.
 
12
13
 
14
 
15
Wickens, C. D. (1992). Engineering psychology and human performance. Second Edition. New York: HarperCollins.
 
16
Yeung, M. M, B. L. Yeo, W. Wolf, and B. Liu. (1995). Video browsing using clustering and scene transition on compressed sequences. Proceedings of Multimedia Computing and Networking, San Jose.
 
17
Yow, D, B. L. Yeo, M. M. Yeung, and B. Liu. (1995). Analysis and presentation of soccer highlights from digital video. Proceedings of the Second Asian Conference on Computer Vision (ACCV '95).
 
18

CITED BY  14

Collaborative Colleagues:
Tony Tse: colleagues
Gary Marchionini: colleagues
Wei Ding: colleagues
Laura Slaughter: colleagues
Anita Komlodi: colleagues