| Real-time new event detection for video streams |
| Full text |
Pdf
(360 KB)
|
Source
|
Conference on Information and Knowledge Management
archive
Proceeding of the 17th ACM conference on Information and knowledge management
table of contents
Napa Valley, California, USA
SESSION: DB: stream processing
table of contents
Pages 379-388
Year of Publication: 2008
ISBN:978-1-59593-991-3
|
|
Authors
|
|
Gang Luo
|
IBM T.J. Watson Research Center, Hawthorne, NY, USA
|
|
Rong Yan
|
IBM T.J. Watson Research Center, Hawthorne, NY, USA
|
|
Philip S. Yu
|
IBM T.J. Watson Research Center, Hawthorne, NY, USA
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 16, Downloads (12 Months): 151, Citation Count: 0
|
|
|
ABSTRACT
Online detection of video clips that present previously unseen events in a video stream is still an open challenge to date. For this online new event detection (ONED) task, existing studies mainly focus on optimizing the detection accuracy instead of the detection efficiency. As a result, it is difficult for existing systems to detect new events in real time, especially for large-scale video collections such as the video content available on the Web. In this paper, we propose several scalable techniques to improve the video processing speed of a baseline ONED system by orders of magnitude without sacrificing much detection accuracy. First, we use text features alone to filter out most of the non-new-event clips and to skip those expensive but unnecessary steps including image feature extraction and image similarity computation. Second, we use a combination of indexing and compression methods to speed up text processing. We implemented a prototype of our optimized ONED system on top of IBM's System S. The effectiveness of our techniques is evaluated on the standard TRECVID 2005 benchmark, which demonstrates that our techniques can achieve a 480-fold speedup with detection accuracy degraded less than 5%.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
L. Agnihotri, N. Dimitrova, and T. McGee et al. Envolvable Visual Commercial Detector. CVPR (2) 2003: 79--84.
|
 |
2
|
James Allan , Victor Lavrenko , Hubert Jin, First story detection in TDT is hard, Proceedings of the ninth international conference on Information and knowledge management, p.374-381, November 06-11, 2000, McLean, Virginia, United States
[doi> 10.1145/354756.354843]
|
 |
3
|
|
 |
4
|
|
| |
5
|
R. Braun, R. Kaneshiro. Exploiting Topic Pragmatics for New Event Detection in TDT-2004. TDT-2004 Workshop.
|
| |
6
|
M. Campbell, S. Ebadollahi, and D. Joshi et al. IBM Research TRECVID-2006 Video Retrieval System. NIST TRECVID workshop, 2006.
|
| |
7
|
|
| |
8
|
M. Clayton. US Plans Massive Data Sweep. The Christian Science Monitor, February 09, 2006. http://www.csmonitor.com/2006/0209/p01s02-uspo.html, 2006.
|
| |
9
|
P. Duygulu, M. Chen, and A.G. Hauptmann. Comparison and Combination of Two Novel Commercial Detection Methods. ICME 2004: 1267--1270.
|
 |
10
|
|
| |
11
|
W. Hsu, S. Chang. Topic Tracking across Broadcast News Videos with Visual Duplicates and Semantic Concepts. ICIP 2006: 141--144.
|
| |
12
|
IBM Technology Translates Arabic Media Broadcasts to English. http://www.sda-asia.com/sda/news/psecom,id,11163,srn,4,channel,developer,nodeid,4,_language,Singapore.html#, 2006.
|
| |
13
|
Kun-Lung Wu , Kirsten W. Hildrum , Wei Fan , Philip S. Yu , Charu C. Aggarwal , David A. George , Buǧra Gedik , Eric Bouillet , Xiaohui Gu , Gang Luo , Haixun Wang, Challenges and experience in prototyping a multi-modal stream analytic and monitoring application on System S, Proceedings of the 33rd international conference on Very large data bases, September 23-27, 2007, Vienna, Austria
|
 |
14
|
|
| |
15
|
|
 |
16
|
|
| |
17
|
E. Lipton. Software to Monitor Overseas Opinions of U.S. The New York Times, October 4, 2006. http://news.zdnet.com/2100-9588_22-6122641.html, 2006.
|
 |
18
|
|
 |
19
|
|
| |
20
|
R. Peterson. IBM Strives for Super Human Speech. http://www.accessible-devices.com/superspeech.html, 2006.
|
| |
21
|
M. F. Porter. An Algorithm for Suffix Stripping. Program 14(3): 130--137, 1980.
|
 |
22
|
|
| |
23
|
A. Singhal. Modern Information Retrieval: A Brief Overview. IEEE Data Eng. Bull. 24(4): 35--43, 2001.
|
| |
24
|
SMART Stopword List. http://www.lextek.com/manuals/onix/stopwords2.html, 2005.
|
| |
25
|
TDT Homepage. http://www.nist.gov/speech/tests/tdt.
|
| |
26
|
TREC Video Retrieval Evaluation. http://www-nlpir.nist.gov/projects/trecvid.
|
| |
27
|
E. M. Voorhees. Overview of TREC 2005. TREC 2005: 1--15.
|
| |
28
|
X. Wu, C. Ngo, and Q. Li. Threading and Autodocumenting News Videos: a Promising Solution to Rapidly Browse News Topics. IEEE Signal Processing Magazine 23(2): 59--68, 2006.
|
 |
29
|
|
 |
30
|
Yiming Yang , Jian Zhang , Jaime Carbonell , Chun Jin, Topic-conditioned novelty detection, Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, July 23-26, 2002, Edmonton, Alberta, Canada
[doi> 10.1145/775047.775150]
|
 |
31
|
|
| |
32
|
YouTube Homepage. http://www.youtube.com.
|
|