ACM Home Page
Please provide us with feedback. Feedback
Multi-camera spatio-temporal fusion and biased sequence-data learning for security surveillance
Full text PdfPdf (282 KB)
Source International Multimedia Conference archive
Proceedings of the eleventh ACM international conference on Multimedia table of contents
Berkeley, CA, USA
SESSION: Surveillance table of contents
Pages: 528 - 538  
Year of Publication: 2003
ISBN:1-58113-722-2
Authors
Gang Wu  University of California, Santa Barbara, CA
Yi Wu  University of California, Santa Barbara, CA
Long Jiao  University of California, Santa Barbara, CA
Yuan-Fang Wang  University of California, Santa Barbara, CA
Edward Y. Chang  University of California, Santa Barbara, CA
Sponsors
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
SIGCOMM: ACM Special Interest Group on Data Communication
ACM: Association for Computing Machinery
SIGGRAPH: ACM Special Interest Group on Computer Graphics and Interactive Techniques
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 9,   Downloads (12 Months): 98,   Citation Count: 12
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/957013.957126
What is a DOI?

ABSTRACT

We present a framework for multi-camera video surveillance. The framework consists of three phases: detection, representation, and recognition. The detection phase handles multi-source spatio-temporal data fusion for efficiently and reliably extracting motion trajectories from video. The representation phase summarizes raw trajectory data to construct hierarchical, invariant, and content-rich descriptions of the motion events. Finally, the recognition phase deals with event classification and identification on the data descriptors. Because of space limits, we describe only briefly how we detect and represent events, but we provide in-depth treatment on the third phase: event recognition. For effective recognition, we devise a sequence-alignment kernel function to perform sequence data learning for identifying suspicious events. We show that when the positive training instances (i.e., suspicious events) are significantly outnumbered by the negative training instances (benign events), then SVMs (or any other learning methods) can suffer a high incidence of errors. To remedy this problem, we propose the kernel boundary alignment (KBA) algorithm to work with the sequence-alignment kernel. Through empirical study in a parking-lot surveillance setting, we show that our spatio-temporal fusion scheme and biased sequence-data learning method are highly effective in identifying suspicious events.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
 
3
Y. Bengio. Markovian models for sequential data. Neural computing serveys, 1998.
 
4
5
 
6
R. G. Brown. Introduction to Random Signal Analysis and Kalman filtering. Wiley, New York, NY, 1983.
 
7
8
 
9
E. Church. Revised Geometry of the Aerial Photograph. Bulletin of Aerial Photogrammetry, 15, 1945.
 
10
H. Cohn. Conformal Mapping on Riemann Surfaces. Dover Pubns, 1980.
 
11
R. T. Collins, A. J. Lipton, T. Kanade, H. Fujiyoshi, D. Duggins, Y. Tsin, D. Tolliver, N. Enomoto, O. Hasegawa, P. Burt, and L. Wixson. A system for video surveillance and monitoring (VSAM project final report). CMU Technical Report CMU-RI-TR-00-12, 2000.
 
12
 
13
 
14
 
15
 
16
 
17
 
18
R. Haralick, H. Joo, C. Lee, X. Zhuang, V. Vaidya, and M. Kim. Pose estimation from corresponding point data. IEEE Trans. Syst., Man, Cybern., 19:1426--46, 1989.
 
19
 
20
 
21
 
22
T. S. Jaakola and D. Haussler. Probabilistic kernel regression models. In Conference of AI and Statistics, 1999.
 
23
S. J. Julier, J. K. Uhlmann, and H. F. Durrant-Whyte. A New Approach for Filtering Nonlinear Systems. In Proc. Amiercan Control Conference, Seattle, WA, 1995.
 
24
K. Kanatani. Optimal homography computation with a reliability measure. Proc. IAPR Workshop Machine Vision Applications, Nov. 1998.
 
25
V. Kettnaker and R. Zabih. Bayesian multi-camera surveillance. CVPR, 1999.
 
26
 
27
Leslie, Christin, E. Eskin, A. Cohen, J. Weston, and W. S. Noble. Mismatch string kernels for svm protein classification. Neural Information Processing Systems, 2002.
 
28
C. Leslie, E. Eskin, and W. S. Noble. The spectrum kernel: a string kernel for svm protein classification. Proceedings of the Pacific Symposium on Biocomputing. World Scientific, 2002.
 
29
J. Lou, H. Yang, W. Hu, and T. Tan. Visual vehicle tracking using an improved ekf. ACCV, 2002.
 
30
S. Maybank, A. Worrall, and G. Sullivan. Filter for car tracking based on acceleration and steering angle. Proc. of British Machine Vision Conference, 1996.
 
31
P. S. Maybeck. Stochastic Models, Estimation, and Control, vol. 1. Academic Press, New York, NY, 1979.
 
32
I. Pavlidis and V. Morellas. Two examples of indoor and outdoor surveillance systems: Motivation, design, and testing. Proceedings 2nd European Workshop on Advanced Video-Based Surveillance, 2001.
 
33
I. Pavlidis, V. Morellas, P. Tsiamyrtzis, and S. Harp. Urban surveillance systems: From the laboratory to the commercial world. Proc. of the IEEE, 89(10), 2001.
 
34
C. Regazzoni and P. K. Varshney. Multisensor surveillance systems based on image and video data. Proc. of the IEEE Conf. on Image Proc., 2002.
 
35
F. W. Sears. Optics, 3rd Ed. Addison-Wesley, Reading, MA, 1958.
 
36
T. Starner and A. Pentland. Visual recognition of american sign language using hidden markov models. Technical Report Master's Thesis, MIT, Program in Media Arts & Sciences, MIT Media Laboratory, 1994.
 
37
 
38
C. Watkins. Dynamic alignment kernels. Technical Report, Department of Computer Science, University of London, 1999.
 
39
 
40
G. Wu and E. Chang. Adaptive feature-space conformal transformation for learning imbalanced data. International Conference of Machine Learning (ICML), August 2003.
 
41
 
42

CITED BY  12

Collaborative Colleagues:
Gang Wu: colleagues
Yi Wu: colleagues
Long Jiao: colleagues
Yuan-Fang Wang: colleagues
Edward Y. Chang: colleagues