ACM Home Page
Please provide us with feedback. Feedback
News video classification using SVM-based multimodal classifiers and combination strategies
Full text PdfPdf (103 KB)
Source International Multimedia Conference archive
Proceedings of the tenth ACM international conference on Multimedia table of contents
Juan-les-Pins, France
POSTER SESSION: Poster session and reception table of contents
Pages: 323 - 326  
Year of Publication: 2002
ISBN:1-58113-620-X
Authors
Wei-Hao Lin  Carnegie Mellon University, Pittsburgh, PA
Alexander Hauptmann  Carnegie Mellon University, Pittsburgh, PA
Sponsors
SIGGRAPH: ACM Special Interest Group on Computer Graphics and Interactive Techniques
SIGCOMM: ACM Special Interest Group on Data Communication
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 11,   Downloads (12 Months): 112,   Citation Count: 7
Additional Information:

abstract   references   cited by   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/641007.641075
What is a DOI?

ABSTRACT

Video classification is the first step toward multimedia content understanding. When video is classified into conceptual categories, it is usually desirable to combine evidence from multiple modalities. However, combination strategies in previous studies were usually ad hoc. We investigate a meta-classification combination strategy using Support Vector Machine, and compare it with probability-based strategies. Text features from closed-captions and visual features from images are combined to classify broadcast news video. The experimental results show that combining multimodal classifiers can significantly improve recall and precision, and our meta-classification strategy gives better precision than the approach of taking the product of the posterior probabilities.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
Chang, C.-C. and Lin, C.-J. LIBSVM: a library for support vector machines. 2001. Software available at <u>http://www.csie.ntu.edu.tw/~cjlin/libsvm</u>.
 
3
 
4
Dimitrova N., Agnihotri, L. and Wei G. Video Classification Based on HMM Using Text and Faces. European Conference on Signal Processing, Finland, 2000.
 
5
Huang, J., Liu, Z., Wang, Y., Chen, Y. and Wong, E.K. Integration of Multimodal Features for Video Scene Classification Based on HMM. IEEE Third Workshop on Multimedia Signal Processing, Copenhagen, Denmark, 1999.
 
6
 
7
 
8
Lin, W.-H., Jin. R. and Hauptmann, A. Meta-classification of Multimedia Classifiers. International Workshop on Knowledge Discovery in Multimedia and Complex Data, Taipei, Taiwan, 2002.
 
9
Liu, Z., Huang, J. and Wang, Y. Classification of TV Programs Based on Audio Information Using Hidden Markov Model. IEEE Workshop on Multimedia Signal Processing, 1998.
10
 
11
 
12
Schölkopf, B., Sung, K.-K., Burges, C., Giroso, F., Niyogi, P., Poggio, T. and Vapnik, V. Comparing Support Vector Machine with Gaussian Kernels to Radial Basis Function Classifiers. IEEE Transactions on Signal Processing, 45(11), 1997.
 
13
 
14
 
15
Wang, Y., Liu, Z. and Huang, J.-C. Multimedia Content Analysis Using Both Audio and Visual Clues. IEEE Signal Processing Magazine, 2000.
 
16
M. F. Porter. An algorithm for suffix stripping. Program, 14(3):130--137, July 1980.

CITED BY  7
Collaborative Colleagues:
Wei-Hao Lin: colleagues
Alexander Hauptmann: colleagues