|
ABSTRACT
There are a significant number of events that happen on a regular basis that would be worth preserving on video but for which it is impractical to use traditional video production methods. In this paper, we describe one possible way to inexpensively and unobtrusively capture and produce video in a classroom lecture environment. We discuss the importance of cinematic principles in the lecture video domain and describe guidelines that should be followed when capturing a lecture. We continue by surveying the tools provided by computer vision and computer graphics that allow us to determine syntactic information about images. Finally, we describe a way to combine these tools to create a framework for a Virtual Videography system, one that can automatically generate production quality video. This framework is based on the creation of region objects, a semantically related region of video, despite the fact that we can reliably only gather syntactic information.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
M. Bianchi. Autoauditorium: a fully automatic, multi-camera system to televise auditorium presentations, 1998.
|
| |
3
|
David Bordwell. On the History of Film Style. Harvard University Press, 1998.
|
| |
4
|
David Bordwell and Kristin Thompson. Film Art: An Introduction. The McGraw-Hill Companies, Inc., 1997.
|
 |
5
|
|
| |
6
|
Foveal Systems, LLC. Auto auditorium. web page, 1999-2000. www.autoauditorium.com.
|
| |
7
|
William T. Freeman , David B. Anderson , Paul A. Beardsley , Chris N. Dodge , Michal Roth , Craig D. Weissman , William S. Yerazunis , Hiroshi Kage , Kazuo Kyuma , Yasunari Miyake , Ken-ichi Tanaka, Computer Vision for Interactive Computer Graphics, IEEE Computer Graphics and Applications, v.18 n.3, p.42-53, May 1998
[doi> 10.1109/38.674971]
|
 |
8
|
|
 |
9
|
|
| |
10
|
M. Hunke and A. Waibel. Face locating and tracking for human-computer interaction, 1994.
|
 |
11
|
John D. Gould , John Conti , Todd Hovanyecz, Composing letters with a simulated listening typewriter, Proceedings of the 1982 conference on Human factors in computing systems, p.367-370, March 15-17, 1982, Gaithersburg, Maryland, United States
[doi> 10.1145/800049.801813]
|
| |
12
|
Michael J. Jones and James M. Rehg. Statistical color models with application to skin detection. Technical Report CRL 98/11, Cambridge Research Laboratory, December 1998.
|
| |
13
|
Shanon Ju, Michael Black, Scott Minneman, and Don Kimber. Summarization of video-taped presentations: Automatic analysis of motion and gesture. IEEE Transactions on Circuits and Systems for Video Technology, 1998.
|
 |
14
|
|
| |
15
|
Norman J. Medoff and Tom Tanquary. Portable Video ENG and EFP. Focal Press, fourth edition edition, 2002.
|
 |
16
|
|
| |
17
|
University of Wisconsin Madison. eteach. http://eteach.engr.wisc.edu/newEteach/home.html, 1999.
|
 |
18
|
|
| |
19
|
Eric Saund. Image mosaicing and a diagrammatic user interface for an office whiteboard scanner. http://www.parc.xerox.com/spl/members/saund/zombieboard-public.html.
|
| |
20
|
|
| |
21
|
|
| |
22
|
Tanveer Syeda-Mahmood and et al. Cuevideo: A system for cross-modal search and browse of video databases. In Proceedings of Computer Vision and Pattern Recognition, June 2000.
|
| |
23
|
Virtual Ink Corp. Mimio. Computer Hardware Product, 2000.
|
| |
24
|
Michael N. Wallick, Niels da Vitoria Lobo, and Mubarak Shah. Computer vision framework for analyzing computer and overhead projections from within video. International Journal of Computers and Their Applications, 8(2), June 2001.
|
| |
25
|
J. Y. A. Wang and E. H. Adelson. Representing moving images with layers. The IEEE Transactions on Image Processing Special Issue: Image Sequence Compression, 3(5):625-638, September 1994.
|
|