| Detecting topical events in digital video |
| Full text |
Pdf
(1.04 MB)
|
| Source
|
International Multimedia Conference
archive
Proceedings of the eighth ACM international conference on Multimedia
table of contents
Marina del Rey, California, United States
Pages: 85 - 94
Year of Publication: 2000
ISBN:1-58113-198-4
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 2, Downloads (12 Months): 23, Citation Count: 10
|
|
|
ABSTRACT
The detection of events is essential to high-level semantic querying of video databases. It is also a very challenging problem requiring the detection and integration of evidence for an event available in multiple information modalities, such as audio, video and language. This paper focuses on the detection of specific types of events, namely, topic of discussion events that occur in classroom/lecture environments. Specifically, we present a query-driven approach to the detection of topic of discussion events with foils used in a lecture as a way to convey a topic. In particular, we use the image content of foils to detect visual events in which the foil is displayed and captured in the video stream. The recognition of a foil in video frames exploits the color and spatial layout of regions on foils using a technique called region hashing. Next, we use the textual phrases listed on a foil as an indication of a topic, and detect topical audio events as places in the audio track where the best evidence for the topical phrases was heard. Finally, we use a probabilistic model of event likelihood to combine the results of visual and audio avent detection that exploits their time cooccurrence. The resulting identification of topical events is evaluated in the domain of classroom lectures and talks.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
Gregory D. Abowd , Christopher G. Atkeson , Ami Feinstein , Cindy Hmelo , Rob Kooper , Sue Long , Nitin Sawhney , Mikiya Tani, Teaching and learning as multimedia authoring: the classroom 2000 project, Proceedings of the fourth ACM international conference on Multimedia, p.187-198, November 18-22, 1996, Boston, Massachusetts, United States
[doi> 10.1145/244130.244191]
|
 |
2
|
|
| |
3
|
K. Bharat and M. Henzinger. Improved algorithms for topic distillation in a hypexlinked environment. In Proc. 22nd Annual SIGIR Conference, pages 326--327, 1999.
|
| |
4
|
J.C. Clark and N. Ferrier. Modal control of an attentive vision system. In Proceedings of the International Conference on Computer Vision, pages 514-523. 1988.
|
| |
5
|
G. Hauptmann, D. Lee, and P.E. Kennedy. Topic labeling of multilingual broadcast news in the informedia digital video library. In Proc. A CM Digital Libraries/SIGIR MIDAS Workshop, 1999.
|
 |
6
|
|
| |
7
|
|
| |
8
|
Y. Lamdan and H.J. Wolfish. Geometric hashing: A general and efficient model-based recognition scheme. In Proceedings of the International Conference on Computer Vision, pages 218-249, 1988.
|
 |
9
|
|
| |
10
|
|
| |
11
|
R. Schwartz et al. A maximum likelihood model for topic classification in broadcast news. In Proc. European Conf. on Speech Communication and Technology, 1997.
|
| |
12
|
|
| |
13
|
|
 |
14
|
|
 |
15
|
Savitha Srinivasan , Dragutin Petkovic , Dulce Ponceleon, Towards robust features for classifying audio in the CueVideo system, Proceedings of the seventh ACM international conference on Multimedia (Part 1), p.393-400, October 30-November 05, 1999, Orlando, Florida, United States
[doi> 10.1145/319463.319658]
|
| |
16
|
T. Syeda-Mahmood. Indexing of topics using foils. In IEEE Conf. on Computer Vision and Pattern Recognition, 2000.
|
| |
17
|
|
| |
18
|
|
CITED BY 10
|
|
Nevenka Dimitrova , Radu Jasinschi , Lalitha Agnihotri , John Zimmerman , Thomas McGee , Dongge Li, Personalizing video recorders using multimedia processing and integration, Proceedings of the ninth ACM international conference on Multimedia, September 30-October 05, 2001, Ottawa, Canada
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
INDEX TERMS
Primary Classification:
H.
Information Systems
H.5
INFORMATION INTERFACES AND PRESENTATION (I.7)
H.5.1
Multimedia Information Systems
Subjects:
Video (e.g., tape, disk, DVI)
Additional Classification:
H.
Information Systems
H.2
DATABASE MANAGEMENT
H.2.8
Database applications
Subjects:
Image databases
I.
Computing Methodologies
I.4
IMAGE PROCESSING AND COMPUTER VISION
General Terms:
Design,
Management,
Measurement,
Performance,
Theory
Keywords:
multi-modal fusion,
query-driven topic detection,
slide detection,
topic of discussion events,
topical audio events
|