|
ABSTRACT
Video classification is the first step toward multimedia content understanding. When video is classified into conceptual categories, it is usually desirable to combine evidence from multiple modalities. However, combination strategies in previous studies were usually ad hoc. We investigate a meta-classification combination strategy using Support Vector Machine, and compare it with probability-based strategies. Text features from closed-captions and visual features from images are combined to classify broadcast news video. The experimental results show that combining multimodal classifiers can significantly improve recall and precision, and our meta-classification strategy gives better precision than the approach of taking the product of the posterior probabilities.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
Chang, C.-C. and Lin, C.-J. LIBSVM: a library for support vector machines. 2001. Software available at <u>http://www.csie.ntu.edu.tw/~cjlin/libsvm</u>.
|
| |
3
|
|
| |
4
|
Dimitrova N., Agnihotri, L. and Wei G. Video Classification Based on HMM Using Text and Faces. European Conference on Signal Processing, Finland, 2000.
|
| |
5
|
Huang, J., Liu, Z., Wang, Y., Chen, Y. and Wong, E.K. Integration of Multimodal Features for Video Scene Classification Based on HMM. IEEE Third Workshop on Multimedia Signal Processing, Copenhagen, Denmark, 1999.
|
| |
6
|
|
| |
7
|
|
| |
8
|
Lin, W.-H., Jin. R. and Hauptmann, A. Meta-classification of Multimedia Classifiers. International Workshop on Knowledge Discovery in Multimedia and Complex Data, Taipei, Taiwan, 2002.
|
| |
9
|
Liu, Z., Huang, J. and Wang, Y. Classification of TV Programs Based on Audio Information Using Hidden Markov Model. IEEE Workshop on Multimedia Signal Processing, 1998.
|
 |
10
|
|
| |
11
|
|
| |
12
|
Schölkopf, B., Sung, K.-K., Burges, C., Giroso, F., Niyogi, P., Poggio, T. and Vapnik, V. Comparing Support Vector Machine with Gaussian Kernels to Radial Basis Function Classifiers. IEEE Transactions on Signal Processing, 45(11), 1997.
|
| |
13
|
|
| |
14
|
|
| |
15
|
Wang, Y., Liu, Z. and Huang, J.-C. Multimedia Content Analysis Using Both Audio and Visual Clues. IEEE Signal Processing Magazine, 2000.
|
| |
16
|
M. F. Porter. An algorithm for suffix stripping. Program, 14(3):130--137, July 1980.
|
CITED BY 7
|
|
|
|
|
|
|
|
Ruofei Zhang , Ramesh Sarukkai , Jyh-Herng Chow , Wei Dai , Zhongfei Zhang, Joint categorization of queries and clips for web-based video search, Proceedings of the 8th ACM international workshop on Multimedia information retrieval, October 26-27, 2006, Santa Barbara, California, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|