ACM Home Page
Please provide us with feedback. Feedback
Polynomial association rules with applications to logistic regression
Full text PdfPdf (854 KB)
Source International Conference on Knowledge Discovery and Data Mining archive
Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining table of contents
Philadelphia, PA, USA
POSTER SESSION: Research track posters table of contents
Pages: 586 - 591  
Year of Publication: 2006
ISBN:1-59593-339-5
Author
Szymon Jaroszewicz  Szczecin University of Technology, Szczecin, Poland & National Institute of Telecommunications, Warsaw, Poland
Sponsors
ACM: Association for Computing Machinery
SIGKDD: ACM Special Interest Group on Knowledge Discovery in Data
SIGMOD: ACM Special Interest Group on Management of Data
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 12,   Downloads (12 Months): 77,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1150402.1150472
What is a DOI?

ABSTRACT

A new class of associations (polynomial itemsets and polynomial association rules) is presented which allows for discovering nonlinear relationships between numeric attributes without discretization. For binary attributes, proposed associations reduce to classic itemsets and association rules. Many standard association rule mining algorithms can be adapted to finding polynomial itemsets and association rules. We applied polynomial associations to add non-linear terms to logistic regression models. Significant performance improvement was achieved over stepwise methods, traditionally used in statistics, with comparable accuracy.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
A. Agresti. An Introduction to Categorical Data Analysis. John Wiley & Sons, Inc., New York, 1996.
 
3
K. Duan and S. Keerthi. Which is the best multiclass SVM method? An empirical study. In Multiple Classifier Systems, 6th International Workshop (MCS'05), pages 278--285, Seaside, CA, 2005.
 
4
S. Dzeroski and L. Todorovski. Discovering dynamics. In ICML, pages 97--103, 1993.
 
5
 
6
J. Freyberger, N. T. Heffernan, and C. Ruiz. Using association rules to guide a search for best fitting transfer models of student learning. In Workshop on Analyzing Student-Tutor Interaction Logs to Improve Educational Outcomes at the 7th Annual Intelligent Tutoring Systems Conference, Maceio, Brazil, 2004.
 
7
J. Friedman, T. Hastie, and R. Tibshirani. Additive logistic regression: a statistical view of boosting. Technical report, Dept. of Statistics, Stanford University, 1998.
 
8
B. Goethals. Survey on frequent pattern mining. Manuscript, 2003.
 
9
R Development Core Team. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria, 2005. http://www.R-project.org.
10
11
12
 
13
I. H. Witten and E. Frank. Data Mining: Practical Machine Learning Tools and Techniques. Morgan Kaufmann, 2005.
 
14
X. Yin and J. Han. CPAR: Classification based on predictive association rules. In SIAM International Conference on Data Mining (SDM), 2003.
 
15
R. Zembowicz and J. M. Zytkow. Discovery of equations: Experimental evaluation of convergence. In 10th National Conference on Artificial Intelligence (AAAI), pages 70--75, San Jose, CA, 1992.