| Variable selection and ranking for analyzing automobile traffic accident data |
| Full text |
Pdf
(151 KB)
|
| Source
|
Symposium on Applied Computing
archive
Proceedings of the 2005 ACM symposium on Applied computing
table of contents
Santa Fe, New Mexico
SESSION: AI and computational logic and image analysis (AI)
table of contents
Pages: 32 - 37
Year of Publication: 2005
ISBN:1-58113-964-0
|
|
Authors
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 9, Downloads (12 Months): 35, Citation Count: 0
|
|
|
ABSTRACT
Variable ranking and feature selection are important concepts in data mining and machine learning. This paper introduces a new variable ranking technique named Sum Max Gain Ratio (SMGR). The new technique is evaluated within the domain of traffic accident data and against a more generalized dataset. In certain cases, SMGR is empirically shown to provide similar results to established approaches with significantly better runtime performance.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
| |
3
|
|
| |
4
|
|
| |
5
|
|
| |
6
|
|
| |
7
|
T. Furey, N. Cristianini, N. Duffy, D. W. Bednarski, M. Schummer and D. Haussler. Support vector machine classification and validation of cancer tissue samples using microarray expression data. Bioinformatics, 16:906--914, 2000.
|
| |
8
|
T. R. Golub et al. Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring. Science, 286:531--537, October 1999.
|
| |
9
|
D. W. Stockburger. Introductory statistics: Concepts, Models, and Applications.
|
 |
10
|
Donna K. Slonim , Pablo Tamayo , Jill P. Mesirov , Todd R. Golub , Eric S. Lander, Class prediction and discovery using gene expression data, Proceedings of the fourth annual international conference on Computational molecular biology, p.263-272, April 2000, Tokyo, Japan
[doi> 10.1145/332306.332564]
|
| |
11
|
|
| |
12
|
P. Pavlidis, J. Weston, J. Cai, and W. N. Grundy. Learning gene functional classifications from multiple data types. Journal of Computational Biology, 9(2): 401--411, 2002.
|
| |
13
|
|
| |
14
|
|
| |
15
|
W. Hawley. Foundations of statistics. Harcourt Brace & Company, 1996.
|
| |
16
|
|
| |
17
|
M. Grimaldi, P. Cunningham and A. Kokaram. An evaluation of alternative feature selection strategies and ensemble techniques for classifying music. The 14th European Conference on Machine Learning and the 7th European Conference on Principles and Practice of Knowledge Discovery in Databases, Dubrovnik, Croatia 2003.
|
| |
18
|
|
| |
19
|
|
| |
20
|
|
| |
21
|
|
| |
22
|
G. John, R. Kohavi and K. Pfleeger. Irrelevant features and the subset selection problem. In Proceedings of the 11th International Conference on Machine Learning, pp. 121--129, 1994.
|
|