| Descriptive statistics of non-uniform interval symbolic data |
| Full text |
Pdf
(503 KB)
|
Source
|
ACM/SIGEVO Summit on Genetic and Evolutionary Computation
archive
Proceedings of the first ACM/SIGEVO Summit on Genetic and Evolutionary Computation
table of contents
Shanghai, China
POSTER SESSION: Poster sessions
table of contents
Pages 831-834
Year of Publication: 2009
ISBN:978-1-60558-326-6
|
|
Authors
|
|
Jun-peng Guo
|
School of Management, Tianjin University, Tianjin, China
|
|
Wen-hua Li
|
School of Management, Tianjin University, Tianjin, China
|
|
Feng Gao
|
School of Management, Tianjin University, Tianjin, China
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 8, Downloads (12 Months): 25, Citation Count: 0
|
|
|
ABSTRACT
As a new kind of data mining method, symbolic data analysis (SDA) can not only decrease the computational complexity of huge data, but also master the property of the sample integrally by data package technology. Interval number is one of the most important types of symbolic data. Previous studies assumed each individual to be uniformly distributed within the interval, but the fact is not so. Non-uniform interval symbolic data is defined in this paper, and the study is concentrated on their descriptive univariate statistics and bivariate statistics. On the basis of the study on empirical distribution function for non-uniform interval symbolic data, the calculation formula of mean and variance of non-uniform interval variables is achieved. Furthermore, covariance and correlation coefficient between two non-uniform interval variables are solved based on their empirical joint distribution function. Finally an example is given.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Bock, H.H., Diday, E. (Eds.).2000. Analysis of Symbolic Data, Springer-Verlag Berlin, New York.
|
| |
2
|
Billard, L. and Diday, E. 2003. From the Statistics of Data to the Statistics of Knowledge: Symbolic Data Analysis. Journal of the American Statistical Association, 98, 462(Jun. 2003), 470--487.
|
| |
3
|
Hu, Y., Wang, H.W. 2004. A new data mining method based on huge data and its application. Journal of Beijing University of Aeronautics and Astronautics (Social Sciences). 17(Jun. 2004), 40--44.
|
| |
4
|
Hu, Y., Wang, H.W. 2004. An interval data factor analysis method and its application. Application of Statistics and Management. 23, (Jul. 2004), 53--58.
|
| |
5
|
Cazes, P., Chouakria, A., Diday, E. 2000. Symbolic principal components analysis, in: Analysis of Symbolic Data (Eds. H.H. Bock, E. Diday), Springer-Verlag Berlin, New York.
|
| |
6
|
|
| |
7
|
|
|