|
ABSTRACT
Since the emergence of extensive multimedia data, feature fusion has been more and more important for image and video retrieval, indexing and annotation. Existing feature fusion techniques simply concatenate a pair of different features or use canonical correlation analysis based methods for joint dimensionality reduction in the feature space. However, how to fuse multiple features in a generalized way is still an open problem. In this paper, we reformulate the multiple feature fusion as a general subspace learning problem. The objective of the framework is to find a general linear subspace in which the cumulative pairwise canonical correlation between every pair of feature sets is maximized after the dimension normalization and subspace projection. The learned subspace couples dimensionality reduction and feature fusion together, which can be applied to both unsupervised and supervised learning cases. In the supervised case, the pairwise canonical correlations of feature sets within the same classes are also counted in the objective function for maximization. To better model the high-order feature structure and overcome the computational difficulty, the features extracted from the same pattern source are represented by a single 2D tensor. The tensor-based dimensionality reduction methods are used to further extract low-dimensional discriminative features from the fused feature ensemble. Extensive experiments on visual data classification demonstrate the effectiveness and robustness of the proposed methods.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
T. Ahonen, A. Hadid, and M. Pietikainen. Face recognition with local binary patterns. In European Conference on Computer Vision, pages 469--481, 2004.
|
| |
2
|
|
| |
3
|
M. Barker and W. Rayens. Partial least squares for discrimination. Journal of Chemometrics, 17(3):166--173, 2003.
|
| |
4
|
|
 |
5
|
|
| |
6
|
|
| |
7
|
|
| |
8
|
N. Dalal and B. Triggs. Object detection using histograms of oriented gradients. In European Conference on Computer Vision, Workshop on Pascal VOC'06, 2006.
|
| |
9
|
Y. Fang, T. Tan, and Y. Wang. Fusion of global and local features for face verification. In IEEE Conf. on ICPR, pages 382--385, 2002.
|
| |
10
|
Y. Fu and T. S. Huang. Image classification using correlation tensor analysis. IEEE Trans. on Image Processing, 17(2):226--234, 2008.
|
| |
11
|
Y. Fu, M. Liu, and T. Huang. Conformal embedding analysis with local graph modeling on the unit hypersphere. In IEEE Conf. on CVPR, workshop on Component Analysis, 2007.
|
| |
12
|
|
| |
13
|
T.-K. Kim, O. Arandjelovic, and R. Cipolla. Learning over sets using boosted manifold principal angles (bompa). In British Machine Vision Conference, pages 779--788, 2005.
|
| |
14
|
|
| |
15
|
T.-K. Kim, S.-F. Wong, and R. Cipolla. Tensor canonical correlation analysis for action classification. In IEEE Conf. on CVPR, 2007.
|
| |
16
|
|
| |
17
|
M. Liu, Y. Fu, and T. S. Huang. An audio-visual fusion framework with joint dimensionality reduction. In IEEE Conf. on ICASSP, 2008.
|
| |
18
|
|
| |
19
|
P. Jonathon Phillips , Patrick J. Flynn , Todd Scruggs , Kevin W. Bowyer , Jin Chang , Kevin Hoffman , Joe Marques , Jaesik Min , William Worek, Overview of the Face Recognition Grand Challenge, Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1, p.947-954, June 20-26, 2005
[doi> 10.1109/CVPR.2005.268]
|
| |
20
|
|
| |
21
|
M. Sargin, Y. Yemez, E. Erzin, and A. Tekalp. Audiovisual synchronization and fusion using canonical correlation analysis. IEEE Trans. on Multimedia, 9(7):1396--1403, 2007.
|
| |
22
|
|
| |
23
|
Q.-S. Sun, Z. Jin, P.-A. Heng, and D.-S. Xia. A novel feature fusion method based on partial least squares regression. Lecture Notes in Computer Science 3686, 3686/2005:268--277, 2005.
|
| |
24
|
Q.-S. Sun, S.-G. Zeng, Y. Liu, P.-A. Heng, and D.-S. Xia. A new method of feature fusion and its application in image recognition. Pattern Recognition, 38(12):2437--2448, 2005.
|
| |
25
|
X. Tan and B. Triggs. Enhanced local texture feature sets for face recognition under difficult lighting conditions. In IEEE Conf. on AMFG, pages 168--182, 2007.
|
| |
26
|
M. Turk and A. Pentland. Face recognition using eigenfaces. In IEEE Conf. on CVPR, pages 586--591, 1991.
|
| |
27
|
|
| |
28
|
X. Wang and X. Tang. Using random subspace to combine multiple features for face recognition. In IEEE Conf. on FGR, pages 284--289, 2004.
|
| |
29
|
|
| |
30
|
Shuicheng Yan , Dong Xu , Qiang Yang , Lei Zhang , Xiaoou Tang , Hong-Jiang Zhang, Discriminant Analysis with Tensor Representation, Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1, p.526-532, June 20-26, 2005
[doi> 10.1109/CVPR.2005.131]
|
| |
31
|
Shuicheng Yan , Dong Xu , Benyu Zhang , Hong-Jiang Zhang , Qiang Yang , Stephen Lin, Graph Embedding and Extensions: A General Framework for Dimensionality Reduction, IEEE Transactions on Pattern Analysis and Machine Intelligence, v.29 n.1, p.40-51, January 2007
[doi> 10.1109/TPAMI.2007.12]
|
| |
32
|
J. Yang, J.-Y. Yang, D. Zhang, and J.-F. Lu. Feature fusion: parallel strategy vs. serial strategy. Pattern Recognition, 36(6):1369--1381, 2003.
|
| |
33
|
|
| |
34
|
|
| |
35
|
|
|