ACM Home Page
Please provide us with feedback. Feedback
Invariant optimal feature selection: A distance discriminant and feature ranking based solution
Source Pattern Recognition archive
Volume 41 ,  Issue 5  (May 2008) table of contents
Pages 1429-1439  
Year of Publication: 2008
ISSN:0031-3203
Authors
Jianning Liang  Shanghai Key Laboratory of Intelligent Information Processing, Department of Computer Science and Engineering, Fudan University, Shanghai 200433, China
Su Yang  Shanghai Key Laboratory of Intelligent Information Processing, Department of Computer Science and Engineering, Fudan University, Shanghai 200433, China
Adam Winstanley  National Centre for Geocomputation, Department of Computer Science, National University of Ireland, Maynooth, Co. Kildare, Ireland
Publisher
Elsevier Science Inc.  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): n/a,   Downloads (12 Months): n/a,   Citation Count: 4
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: 10.1016/j.patcog.2007.10.018

ABSTRACT

The goal of feature selection is to find the optimal subset consisting of m features chosen from the total n features. One critical problem for many feature selection methods is that an exhaustive search strategy has to be applied to seek the best subset among all the possible nm feature subsets, which usually results in a considerably high computational complexity. The alternative suboptimal feature selection methods provide more practical solutions in terms of computational complexity but they cannot promise that the finally selected feature subset is globally optimal. We propose a new feature selection algorithm based on a distance discriminant (FSDD), which not only solves the problem of the high computational costs but also overcomes the drawbacks of the suboptimal methods. The proposed method is able to find the optimal feature subset without exhaustive search or Branch and Bound algorithm. The most difficult problem for optimal feature selection, the search problem, is converted into a feature ranking problem following rigorous theoretical proof such that the computational complexity can be greatly reduced. The proposed method is invariant to the linear transformation of data when a diagonal transformation matrix is applied. FSDD was compared with ReliefF and mrmrMID based on mutual information on 8 data sets. The experiment results show that FSDD outperforms the other two methods and is highly efficient.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
[1]
 
[2]
 
[3]
Dash, M. and Liu, H., Feature selection for classification. Intelligent Data Anal. v1. 131-156.
[4]
 
[5]
Quah, K.H. and Quek, C., MCES: a novel Monte Carlo evaluative selection approach for objective feature selections. IEEE Trans. Neural Networks. v18 i2.
 
[6]
[7]
 
[8]
 
[9]
 
[10]
 
[11]
Kumar, R., Jayaraman, V.K. and Kulkarni, B.D., An SVM classifier incorporating simultaneous noise reduction and feature selection: illustrative case examples. Pattern Recognition. v38 i1. 41-49.
 
[12]
 
[13]
 
[14]
 
[15]
Kudo, M. and Sklanshy, J., Comparison of algorithm that select features for pattern classifiers. Pattern Recognition. v33. 25-41.
 
[16]
 
[17]
 
[18]
Liu, H., Li, J. and Wong, L., A comparative study on feature selection and classification methods using gene expression profiles and proteomic patterns. Genome Inf. v13. 51-60.
 
[19]
 
[20]
Available at {http://www.ics.uci.edu/mlearn/databases/}.
 
[21]
Available at {http://lib.stat.cmu.edu/datasets/}.
 
[22]
Friedman, M. and Kandel, A., Introduction to Pattern Recognition: Statistical, Structural, Neural and Fuzzy Logic Approaches. 1999. World Scientific Publishing, Singapore.
 
[23]
Li, H., Jiang, T. and Zhang, K., Efficient and robust feature extraction by maximum margin criterion. IEEE Trans. Neural Networks. v17 i1. 157-165.
 
[24]
 
[25]
Chow, T.W.S. and Huang, D., Estimating optimal feature subsets using efficient estimation of high-dimensional mutual information. IEEE Trans. Neural Networks. v16 i1.
 
[26]
 
[27]
 
[28]
Available at {http://www.csie.ntu.edu.tw/cjlin/libsvm}.


Collaborative Colleagues:
Jianning Liang: colleagues
Su Yang: colleagues
Adam Winstanley: colleagues