ACM Home Page
Please provide us with feedback. Feedback
Mining rank-correlated sets of numerical attributes
Full text PdfPdf (816 KB)
Source International Conference on Knowledge Discovery and Data Mining archive
Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining table of contents
Philadelphia, PA, USA
SESSION: Research track papers table of contents
Pages: 96 - 105  
Year of Publication: 2006
ISBN:1-59593-339-5
Authors
Toon Calders  University of Antwerp
Bart Goethals  University of Antwerp
Szymon Jaroszewicz  Szczecin University of Technology, Warsaw, Poland
Sponsors
ACM: Association for Computing Machinery
SIGKDD: ACM Special Interest Group on Knowledge Discovery in Data
SIGMOD: ACM Special Interest Group on Management of Data
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 8,   Downloads (12 Months): 58,   Citation Count: 2
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1150402.1150417
What is a DOI?

ABSTRACT

We study the mining of interesting patterns in the presence of numerical attributes. Instead of the usual discretization methods, we propose the use of rank based measures to score the similarity of sets of numerical attributes. New support measures for numerical data are introduced, based on extensions of Kendall's tau, and Spearman's Footrule and rho. We show how these support measures are related. Furthermore, we introduce a novel type of pattern combining numerical and categorical attributes. We give efficient algorithms to find all frequent patterns for the proposed support measures, and evaluate their performance on real-life datasets.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
2
3
 
4
F. Bodon and L. Schmidt-Thieme. The relation of closed itemset mining, complete pruning strategies and item ordering in apriori-based fim algorithms. In Proc. PKDD, pages 437--444, 2005.
 
5
T. Calders and B. Goethals. Depth-first non-derivable itemset mining. In Proc. SIAM DM, 2005.
 
6
P. Diaconis and R. Graham. Spearman's footrule as a measure of disarray. J. of the Royal Statistical Society, Series B, 39(2):262--268, 1977.
 
7
S. Dzeroski and L. Todorovski. Discovering dynamics: From inductive logic programming to machine discovery. Journal of Intelligent Information Systems, 4:89--108, 1995.
 
8
 
9
B. Goethals. Frequent set mining. In In The Data Mining and Knowledge Discovery Handbook, chapter 17, pages 377--397. 2005.
 
10
S. Hettich and S. D. Bay. The UCI KDD Archive {http://kdd.ics.uci.edu} Irvine, CA: University of California, Dept. of inf. and comp. science, 1999.
11
 
12
M. Kendall. Rank Correlation Methods. Oxford University Press, 1990.
 
13
A. M. Liebetrau. Measures of Association, volume 32 of Quantitative Applications in the Social Sciences. Sage Publications, Inc., 1983.
14
 
15
G. I. Webb S. Huang. Discarding insignificant rules during impact rule discovery in large databases. In Proc. SIAM DM, pages 541--545, 2005.
16
17


Collaborative Colleagues:
Toon Calders: colleagues
Bart Goethals: colleagues
Szymon Jaroszewicz: colleagues