ACM Home Page
Please provide us with feedback. Feedback
Deciphering visual gist and its implications for video retrieval and interface design
Full text PdfPdf (384 KB)
Source Conference on Human Factors in Computing Systems archive
CHI '05 extended abstracts on Human factors in computing systems table of contents
Portland, OR, USA
SESSION: Late breaking results: short papers table of contents
Pages: 1877 - 1880  
Year of Publication: 2005
ISBN:1-59593-002-7
Authors
Meng Yang  University of North Carolina at Chapel Hill, Chapel Hill, NC
Gary Marchionini  University of North Carolina at Chapel Hill, Chapel Hill, NC
Sponsors
ACM: Association for Computing Machinery
SIGCHI: ACM Special Interest Group on Computer-Human Interaction
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 7,   Downloads (12 Months): 39,   Citation Count: 3
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1056808.1057045
What is a DOI?

ABSTRACT

How do people make sense of a video based on viewing a few frames of that video? What elements constitute the "visual gist" in their minds? Answers to these questions will give implications to both content-based video retrieval and the interface design (e.g., key-frame selection) of digital video libraries. A preliminary study was conducted to unravel the issues and 45 subjects participated in the study. After viewing a fast forward surrogate, the subjects were asked to choose pictures which they thought would "belong to" the video. And they were also asked to think aloud during their selection processes. Nine visual gist attributes (e.g., people, objects and actions) were generated using the grounded theory method and their frequencies were also compared and analyzed.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
Dufaux, F. Key frame selection to represent a video. ICIP 2000. Vol, II. p. 275--278
 
3
Grodal, T. Emotions, Cognitions, and Narrative Patterns in Film. In Passionate views : film, cognition, and emotion, edited by Plantinga, C. & Smith, G. M. 1999, 127--145
 
4
 
5
Levin, D. T. & Simons, D. J. Failure to detect changes to attended objects in motion pictures. Psychological Bulletin, 4, 1997, 501--506
 
6
Lieberman, L. R., & Culpepper, J. T. Words versus objects: comparison of free verbal recall. Pscychol. Rep. 17,1965, 983--988
 
7
Mandler, J & Ritchey G. H. Long term memory for pictures. Journal of Experimental Psychology {Human learning and memory}, 3, 1977. 386--396
 
8
Markkula, M. and Sormumen, E. Searching for photos: journalists' practices in pictorial IR, The Challenge of Image Retrieval Research Workshop, 1998.
 
9
 
10
Paivio, A. & Csapo, K. Picture superiority in free recall: Imagery or dual coding? Cognitive Psychology, 5, 1973, 176--206
11
 
12
Shepard, R. N. Recognition memory for words, sentences, and pictures. Journal of Verbal Learning and Verbal Behavior, 6, 1967, 156--163
 
13
Simons, D. J. & Levin, D. T., Change blindness. Trends Cognitive Science, 1, 1997, 261--267
 
14
 
15
 
16
Wolfe, J.M. Visual memory: what do you know about what you saw? Current Biology, 8(9), 1998, 303--304.


Collaborative Colleagues:
Meng Yang: colleagues
Gary Marchionini: colleagues