|
ABSTRACT
This paper introduces the video collage, a novel effective interface for browsing and interpreting video collections. The paper discusses how collages are automatically produced, illustrates their use, and evaluates their effectiveness as summaries across news stories. Collages are presentations of text and images derived from multiple video sources, which provide an interactive visualization for a set of video documents, summarizing their contents and providing a navigation aid for further exploration. The dynamic creation of collages is based on user context, e.g., an originating query, coupled with automatic processing to refine the candidate imagery. Named entity identification and common phrase extraction provides descriptive text. The dynamic manipulation of collages allows user-directed browsing and reveals additional detail. The utility of collages as summaries is examined with respect to other published news summaries.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
Bates, M.J. The design of browsing and berrypicking techniques for the on-line search interface. Online Review 13(5) (1989), 407--431.
|
| |
3
|
Daniel M. Bikel , Scott Miller , Richard Schwartz , Ralph Weischedel, Nymble: a high-performance learning name-finder, Proceedings of the fifth conference on Applied natural language processing, p.194-201, March 31-April 03, 1997, Washington, DC
[doi> 10.3115/974557.974586]
|
 |
4
|
John Boreczky , Andreas Girgensohn , Gene Golovchinsky , Shingo Uchihashi, An interactive comic book presentation for exploring video, Proceedings of the SIGCHI conference on Human factors in computing systems, p.185-192, April 01-06, 2000, The Hague, The Netherlands
[doi> 10.1145/332040.332428]
|
 |
5
|
Shih-Fu Chang , Gwendal Auffret , Jonathan Foote , Chung-Shen Li , Behzad Shahraray , Tanveer Syeda-Mahmood , HongJiang Zhang, Multimedia access and retrieval (panel session): the state of the art and future directions, Proceedings of the seventh ACM international conference on Multimedia (Part 1), p.443-445, October 30-November 05, 1999, Orlando, Florida, United States
[doi> 10.1145/319463.319684]
|
 |
6
|
|
| |
7
|
Christel, M. and Warmack, A. The Effect of Text in Storyboards for Video Navigation, in Proc. IEEE ICASSP (Salt Lake City UT, May 2001), vol. III, 1409--1412.
|
| |
8
|
Clarkson, P. and Rosenfeld, R. Statistical language modeling using the CMU-Cambridge toolkit, in Proc. Eurospeech '97 (Rhodes, Greece, Sept. 1997), Assoc., 2707--2710.
|
 |
9
|
Wei Ding , Gary Marchionini , Dagobert Soergel, Multimodal surrogates for video browsing, Proceedings of the fourth ACM conference on Digital libraries, p.85-93, August 11-14, 1999, Berkeley, California, United States
[doi> 10.1145/313238.313266]
|
| |
10
|
Google search engine (as of April, 2002), © 2002 Google, <u>http://www.google.com</u>.
|
| |
11
|
Hearst, M.A. User Interfaces and Visualization. In Modern Information Retrieval, edited by R. Baeza-Yates and B. Ribeiro-Neto, Addison Wesley/ACM Press, New York, 1999.
|
| |
12
|
Infoplease.com. 2001 People in the News, © 2002 Learning Network, <u>http://www.infoplease.com/ipa/A0878485.html</u>.
|
| |
13
|
Andrew Large , Jamshid Beheshti , Alin Breuleux , Andre Renaud, Multimedia and comprehension: the relationship among text, animation, and captions, Journal of the American Society for Information Science, v.46 n.5, p.340-347, June 1995
[doi> 10.1002/(SICI)1097-4571(199506)46:5<340::AID-ASI5>3.0.CO;2-S]
|
 |
14
|
Francis C. Li , Anoop Gupta , Elizabeth Sanocki , Li-wei He , Yong Rui, Browsing digital video, Proceedings of the SIGCHI conference on Human factors in computing systems, p.169-176, April 01-06, 2000, The Hague, The Netherlands
[doi> 10.1145/332040.332425]
|
 |
15
|
Andrew Merlino , Daryl Morey , Mark Maybury, Broadcast news navigation using story segmentation, Proceedings of the fifth ACM international conference on Multimedia, p.381-391, November 09-13, 1997, Seattle, Washington, United States
[doi> 10.1145/266180.266390]
|
| |
16
|
Miller, D., Schwartz, R., Weischedel, R., and Stone, R. Named Entity Extraction for Broadcast News, in Proc. DARPA Broadcast News Workshop (Washington DC, March 1999), <u>http://www.nist.gov/speech/publications/darpa99/html/ie20/ie20.htm</u>.
|
| |
17
|
Shneiderman, B. The Eyes Have It: A Task by Data Type Taxonomy for Information Visualizations. HCI Lab, Inst. Systems Research, Inst. Advanced Computer Studies, Dept. of Computer Science Tech. Report CS-TR-3665, Univ. of Maryland, (July 1996).
|
| |
18
|
|
| |
19
|
|
| |
20
|
Who2.com. Find Famous People Fast!, © 2002 by Who2?, <u>http://www.who2.com</u>.
|
| |
21
|
Zhang, H.J., Gong, Y.H., Smoliar, S.W., and Yan, S.Y. Automatic Parsing of news video, in Proc. IEEE Conf. on Multimedia Computing and Systems (Boston MA, May 1994), 45--54.
|
CITED BY 18
|
|
Hui Yang , Lekha Chaisorn , Yunlong Zhao , Shi-Yong Neo , Tat-Seng Chua, VideoQA: question answering on news video, Proceedings of the eleventh ACM international conference on Multimedia, November 02-08, 2003, Berkeley, CA, USA
|
|
|
Kent Wittenburg , Clifton Forlines , Tom Lanning , Alan Esenther , Shigeo Harada , Taizo Miyachi, Rapid serial visual presentation techniques for consumer digital video devices, Proceedings of the 16th annual ACM symposium on User interface software and technology, p.115-124, November 02-05, 2003, Vancouver, Canada
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Michelle Chang , John J. Leggett , Richard Furuta , Andruid Kerne , J. Patrick Williams , Samuel A. Burns , Randolph G. Bias, Collection understanding, Proceedings of the 4th ACM/IEEE-CS joint conference on Digital libraries, June 07-11, 2004, Tuscon, AZ, USA
|
|
|
|
|
|
|
|
|
Jun Yang , Alexander G. Hauptmann, 3WNews: who, where, and when in news video, Proceedings of the 14th annual ACM international conference on Multimedia, October 23-27, 2006, Santa Barbara, CA, USA
|
|
|
|
|
|
|
|
|
|
|
|
Shi-Yong Neo , Yuanyuan Ran , Hai-Kiat Goh , Yantao Zheng , Tat-Seng Chua , Jintao Li, The use of topic evolution to help users browse and find answers in news video corpus, Proceedings of the 15th international conference on Multimedia, September 25-29, 2007, Augsburg, Germany
|
|
|
|
|
|
|
|
|
|
|