|
ABSTRACT
Dimensionality reduction via Random Projections has attracted considerable attention in recent years. The approach has interesting theoretical underpinnings and offers computational advantages. In this paper we report a number of experiments to evaluate Random Projections in the context of inductive supervised learning. In particular, we compare Random Projections and PCA on a number of different datasets and using different machine learning methods. While we find that the random projection approach predictively underperforms PCA, its computational advantages may make it attractive for certain applications.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
D. Achlioptas, F. McSherry, and B. Schölkopf. Sampling techniques for kernel methods. In S. B. T. G. Dietterich and Z. Ghahramani, editors, Advances in Neural Information Processing Systems.
|
| |
3
|
U. Alon, N. Barkai, D. A. Notterman, K. Gish, S. Ybarra, D. Mack, and A. J. Levine. Broad patterns of gene expression revealed by clustering of tumor and normal colon tissues probed by oligonucleotide arrays. In Proc. Natl. Acad. Sci. USA, volume 96, pages 6745--6750, June 1999.
|
 |
4
|
|
| |
5
|
C. Blake and C. Merz. (uci) repository of machine learning databases, 1998.
|
| |
6
|
|
| |
7
|
|
| |
8
|
S. Dasgupta and A. Gupta. An elementary proof of the johnson-lindenstrauss lemma. Technical Report TR-99-006, International Computer Science Institute, Berkeley, California, USA, 1999.
|
| |
9
|
|
| |
10
|
T. R. Golub, D. K. Slonim, P. Tamayo, C. H. M. Gaasenbeek, J. P. Mesirov, H. Coller, M. L. Loh, J. R. Downing, M. A. Caligiuri, C. D. Bloomfield, and E. S. Lander. Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. In Science, volume 286, pages 531--537, 1999.
|
| |
11
|
T. Hastie, R. Tibshirani, and J. Friedman. The Elements of Statistical Learning. Data Mining, Inference, and Prediction. Springer, New York, 2001.
|
 |
12
|
|
| |
13
|
|
| |
14
|
S. Kaski. Dimensionality reduction by random mapping: Fast similarity computation for clustering. In Proceedings of IJCNN'98, volume 1, pages 413--418. IEEE Service Center, Piscataway, NJ, 1998.
|
| |
15
|
A. Magen. Dimensionality reductions that preserve volumes and distance to affine spaces, and their algorithmic applications.
|
 |
16
|
|
| |
17
|
|
CITED BY 9
|
|
|
|
|
|
|
|
Kaushal Sanghai , Ting Su , Jennifer Dy , David Kaeli, A multinomial clustering model for fast simulation of computer architecture designs, Proceeding of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining, August 21-24, 2005, Chicago, Illinois, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|