| A novel grammar-based genetic programming approach to clustering |
| Full text |
Pdf
(140 KB)
|
| Source
|
Symposium on Applied Computing
archive
Proceedings of the 2005 ACM symposium on Applied computing
table of contents
Santa Fe, New Mexico
SESSION: Evolutionary computation and optimization (ECO)
table of contents
Pages: 928 - 932
Year of Publication: 2005
ISBN:1-58113-964-0
|
|
Authors
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 8, Downloads (12 Months): 51, Citation Count: 0
|
|
|
ABSTRACT
Most of the classical methods for clustering analysis require the user setting of number of clusters. To surmount this problem, in this paper a grammar-based Genetic Programming approach to automatic data clustering is presented. An innovative clustering process is conceived strictly linked to a novel cluster representation which provides intelligible information on patterns. The efficacy of the implemented partitioning system is estimated on a medical domain by exploiting expressly defined evaluation indices. Furthermore, a comparison with other clustering tools is performed.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
Babu, G. P. and Marty, M. N. Clustering with evolutionary strategies. Pattern Recognition, 27, 2 (1994), 321--329.
|
| |
3
|
Dempster, A. P., Laird, N. M., and Rubin, D. B. Maximum Likelihood from Incomplete Data via the EM algorithm. Journal of the Royal Statistical Society, Series B., 39, 1 (1977), 1--38.
|
| |
4
|
Fraley, C. and Raftery, A. How many clusters? Which clustering method? Answers via model-based cluster analysis. The Computer Journal, 41, 8 (1998), 578--588.
|
 |
5
|
Sudipto Guha , Rajeev Rastogi , Kyuseok Shim, CURE: an efficient clustering algorithm for large databases, Proceedings of the 1998 ACM SIGMOD international conference on Management of data, p.73-84, June 01-04, 1998, Seattle, Washington, United States
|
| |
6
|
Hall, L. O., Ozyurt, B., and Bezdek, J. C. Clustering with a genetically optimized approach. IEEE Trans. on Evolutionary Computation, 3, 2 (1999), 103--112.
|
| |
7
|
|
| |
8
|
|
| |
9
|
|
 |
10
|
|
| |
11
|
|
| |
12
|
Lee, C. Y. and Antonsson, E. K. Dynamic partitional clustering using evolutionary strategies. In Proc. of the 3rd Asia-Pacific Conference on Simulated Evolution and Learning. IEEE Press, Nagoya, Japan, 2000.
|
| |
13
|
|
| |
14
|
Murphy, P. M. and Aha, D. W. UCI Repository of machine learning databases. University of California, Dept. of Information and Computer Science. Available at www.ics.uci.edu/~mlearn/MLRepository.html
|
| |
15
|
Sarafis, I., Zalzala, A. M. S., and Trinder, P. W. A genetic rule-based data clustering toolkit. In Proc. of the IEEE Congress on Evolutionary Computation, 2002, 1238--1243.
|
| |
16
|
Weka3.4: Data mining software in Java. Available at www.cs.waikato.ac.nz/~ml/weka/
|
| |
17
|
Yip, A. M. A scale dependent data clustering model by direct maximization of homogeneity and separation. In Proc. Mathematical challenges in scientific data mining IPAM. 14-18 January 2002.
|
 |
18
|
Tian Zhang , Raghu Ramakrishnan , Miron Livny, BIRCH: an efficient data clustering method for very large databases, Proceedings of the 1996 ACM SIGMOD international conference on Management of data, p.103-114, June 04-06, 1996, Montreal, Quebec, Canada
|
|