|
ABSTRACT
Feature selection aims to reduce dimensionality for building comprehensible learning models with good generalization performance. Feature selection algorithms are largely studied separately according to the type of learning: supervised or unsupervised. This work exploits intrinsic properties underlying supervised and unsupervised feature selection algorithms, and proposes a unified framework for feature selection based on spectral graph theory. The proposed framework is able to generate families of algorithms for both supervised and unsupervised feature selection. And we show that existing powerful algorithms such as ReliefF (supervised) and Laplacian Score (unsupervised) are special cases of the proposed framework. To the best of our knowledge, this work is the first attempt to unify supervised and unsupervised feature selection, and enable their joint study under a general framework. Experiments demonstrated the efficacy of the novel algorithms derived from the framework.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Chapelle, O., Schölkopf, B., and Zien, A. (Eds.) (2006). Semi-supervised learning, chapter Graph-Based Methods. The MIT Press.
|
| |
2
|
Chung, F. (1997). Spectral graph theory. AMS.
|
| |
3
|
|
| |
4
|
|
| |
5
|
He, X., Cai, D., & Niyogi, P. (2005). Laplacian score for feature selection. In NIPS. MIT Press.
|
| |
6
|
|
| |
7
|
|
| |
8
|
|
| |
9
|
Ng, A., Jordan, M., & Weiss, Y. (2001). On spectral clustering: Analysis and an algorithm. NIPS.
|
| |
10
|
|
| |
11
|
|
| |
12
|
|
| |
13
|
Smola, A., & Kondor, I. (2003). Kernels and regularization on graphs. COLT.
|
| |
14
|
|
| |
15
|
Zhang, T., & Ando, R. (2006). Analysis of spectral kernel design based semi-supervised learning. NIPS.
|
| |
16
|
Zhao, Z., & Liu, H. (2007). Semi-supervised Feature Selection via Spectral Analysis. SDM.
|
CITED BY 8
|
|
Jieping Ye , Kewei Chen , Teresa Wu , Jing Li , Zheng Zhao , Rinkal Patel , Min Bae , Ravi Janardan , Huan Liu , Gene Alexander , Eric Reiman, Heterogeneous data fusion for alzheimer's disease study, Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, August 24-27, 2008, Las Vegas, Nevada, USA
|
|
|
|
|
|
|
|
|
Zheng Zhao , Jiangxin Wang , Huan Liu , Jieping Ye , Yung Chang, Identifying biologically relevant genes via multiple heterogeneous data sources, Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, August 24-27, 2008, Las Vegas, Nevada, USA
|
|
|
|
|
|
|
|
|
|
|
|
Feiping Nie , Shiming Xiang , Yangqing Jia , Changshui Zhang , Shuicheng Yan, Trace ratio criterion for feature selection, Proceedings of the 23rd national conference on Artificial intelligence, p.671-676, July 13-17, 2008, Chicago, Illinois
|
|