ACM Home Page
Please provide us with feedback. Feedback
Descriptive statistics of non-uniform interval symbolic data
Full text PdfPdf (503 KB)
Source
ACM/SIGEVO Summit on Genetic and Evolutionary Computation archive
Proceedings of the first ACM/SIGEVO Summit on Genetic and Evolutionary Computation table of contents
Shanghai, China
POSTER SESSION: Poster sessions table of contents
Pages 831-834  
Year of Publication: 2009
ISBN:978-1-60558-326-6
Authors
Jun-peng Guo  School of Management, Tianjin University, Tianjin, China
Wen-hua Li  School of Management, Tianjin University, Tianjin, China
Feng Gao  School of Management, Tianjin University, Tianjin, China
Sponsors
SIGEVO: ACM Special Interest Group on Genetic and Evolutionary Computation
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 8,   Downloads (12 Months): 25,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1543834.1543954
What is a DOI?

ABSTRACT

As a new kind of data mining method, symbolic data analysis (SDA) can not only decrease the computational complexity of huge data, but also master the property of the sample integrally by data package technology. Interval number is one of the most important types of symbolic data. Previous studies assumed each individual to be uniformly distributed within the interval, but the fact is not so. Non-uniform interval symbolic data is defined in this paper, and the study is concentrated on their descriptive univariate statistics and bivariate statistics. On the basis of the study on empirical distribution function for non-uniform interval symbolic data, the calculation formula of mean and variance of non-uniform interval variables is achieved. Furthermore, covariance and correlation coefficient between two non-uniform interval variables are solved based on their empirical joint distribution function. Finally an example is given.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Bock, H.H., Diday, E. (Eds.).2000. Analysis of Symbolic Data, Springer-Verlag Berlin, New York.
 
2
Billard, L. and Diday, E. 2003. From the Statistics of Data to the Statistics of Knowledge: Symbolic Data Analysis. Journal of the American Statistical Association, 98, 462(Jun. 2003), 470--487.
 
3
Hu, Y., Wang, H.W. 2004. A new data mining method based on huge data and its application. Journal of Beijing University of Aeronautics and Astronautics (Social Sciences). 17(Jun. 2004), 40--44.
 
4
Hu, Y., Wang, H.W. 2004. An interval data factor analysis method and its application. Application of Statistics and Management. 23, (Jul. 2004), 53--58.
 
5
Cazes, P., Chouakria, A., Diday, E. 2000. Symbolic principal components analysis, in: Analysis of Symbolic Data (Eds. H.H. Bock, E. Diday), Springer-Verlag Berlin, New York.
 
6
 
7

Collaborative Colleagues:
Jun-peng Guo: colleagues
Wen-hua Li: colleagues
Feng Gao: colleagues