| Mining quantitative association rules in large relational tables |
| Full text |
Pdf
(1.22 MB)
|
| Source
|
International Conference on Management of Data
archive
Proceedings of the 1996 ACM SIGMOD international conference on Management of data
table of contents
Montreal, Quebec, Canada
Pages: 1 - 12
Year of Publication: 1996
ISBN:0-89791-794-4
Also published in ...
|
|
Authors
|
|
Ramakrishnan Srikant
|
IBM Almaden Research Center, 650 Harry Road, San Jose, CA and Department of Computer Science, University of Wisconsin, Madison
|
|
Rakesh Agrawal
|
IBM Almaden Research Center, 650 Harry Road, San Jose, CA
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 22, Downloads (12 Months): 198, Citation Count: 206
|
|
|
ABSTRACT
We introduce the problem of mining association rules in large relational tables containing both quantitative and categorical attributes. An example of such an association might be "10% of married people between age 50 and 60 have at least 2 cars". We deal with quantitative attributes by fine-partitioning the values of the attribute and then combining adjacent partitions as necessary. We introduce measures of partial completeness which quantify the information lost due to partitioning. A direct application of this technique can generate too many similar rules. We tackle this problem by using a "greater-than-expected-value" interest measure to identify the interesting rules in the output. We give an algorithm for mining such quantitative association rules. Finally, we describe the results of using this approach on a real-life dataset.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
AIS93
|
Rakesh Agrawal , Tomasz Imieliński , Arun Swami, Mining association rules between sets of items in large databases, Proceedings of the 1993 ACM SIGMOD international conference on Management of data, p.207-216, May 25-28, 1993, Washington, D.C., United States
|
| |
AS94
|
|
 |
BKSS90
|
Norbert Beckmann , Hans-Peter Kriegel , Ralf Schneider , Bernhard Seeger, The R*-tree: an efficient and robust access method for points and rectangles, Proceedings of the 1990 ACM SIGMOD international conference on Management of data, p.322-331, May 23-26, 1990, Atlantic City, New Jersey, United States
|
| |
HF95
|
|
| |
HS95
|
|
| |
JD88
|
|
| |
MTV94
|
Heikki Marmila, Harmu Toivonen, and A. Inkeri Verkamo. Efficient algorithms for discovering association rules. In KDD-94: AAAI Workshop on Knowledge Discovery in Databases, pages 181- 192, Seattle, Washington, July 1994.
|
 |
PCY95
|
Jong Soo Park , Ming-Syan Chen , Philip S. Yu, An effective hash-based algorithm for mining association rules, Proceedings of the 1995 ACM SIGMOD international conference on Management of data, p.175-186, May 22-25, 1995, San Jose, California, United States
|
| |
PS91
|
|
| |
SA95
|
|
| |
SON95
|
|
| |
ST95
|
Avi Silberschatz and Alexander Tuzhilin. On Subjective Measures of Interestingness in Knowledge Discovery. In Proc. of the First Int'l Conference on Knowledge Discovery and Data Mining, Montreal, Canada, August 1995.
|
CITED BY 206
|
|
|
|
|
|
|
|
|
|
|
Charų C. Aggarwal , Zheng Sun , Philip S. Yu, Online algorithms for finding profile association rules, Proceedings of the seventh international conference on Information and knowledge management, p.86-95, November 02-07, 1998, Bethesda, Maryland, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Anthony K.H. Tung , Hongjun Lu , Jiawei Han , Ling Feng, Breaking the barrier of transactions: mining inter-transaction association rules, Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining, p.297-301, August 15-18, 1999, San Diego, California, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Takeshi Fukuda , Yasukiko Morimoto , Shinichi Morishita , Takeshi Tokuyama, Data mining using two-dimensional optimized association rules: scheme, algorithms, and visualization, ACM SIGMOD Record, v.25 n.2, p.13-23, June 1996
|
|
|
|
|
|
Helen Pinto , Jiawei Han , Jian Pei , Ke Wang , Qiming Chen , Umeshwar Dayal, Multi-dimensional sequential pattern mining, Proceedings of the tenth international conference on Information and knowledge management, October 05-10, 2001, Atlanta, Georgia, USA
|
|
|
|
|
|
Ramesh C. Agarwal , Charu C. Aggarwal , V. V. V. Prasad, Depth first generation of long patterns, Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining, p.108-118, August 20-23, 2000, Boston, Massachusetts, United States
|
|
|
|
|
|
Jef Wijsen , Raymond T. Ng , Toon Calders, Discovering roll-up dependencies, Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining, p.213-222, August 15-18, 1999, San Diego, California, United States
|
|
|
|
|
|
Wei Wang , Jiong Yang , Philip S. Yu, Efficient mining of weighted association rules (WAR), Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining, p.270-274, August 20-23, 2000, Boston, Massachusetts, United States
|
|
|
|
|
|
|
|
|
Takeshi Fukuda , Yasuhido Morimoto , Shinichi Morishita , Takeshi Tokuyama, Mining optimized association rules for numeric attributes, Proceedings of the fifteenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems, p.182-191, June 04-06, 1996, Montreal, Quebec, Canada
|
|
|
Sergey Brin , Rajeev Rastogi , Kyuseok Shim, Mining optimized gain rules for numeric attributes, Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining, p.135-144, August 15-18, 1999, San Diego, California, United States
|
|
|
|
|
|
|
|
|
|
|
|
Bing Liu , Wynne Hsu , Yiming Ma, Pruning and summarizing the discovered associations, Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining, p.125-134, August 15-18, 1999, San Diego, California, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Jiawei Han , Jian Pei , Behzad Mortazavi-Asl , Qiming Chen , Umeshwar Dayal , Mei-Chun Hsu, FreeSpan: frequent pattern-projected sequential pattern mining, Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining, p.355-359, August 20-23, 2000, Boston, Massachusetts, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Ihab F. Ilyas , Volker Markl , Peter Haas , Paul Brown , Ashraf Aboulnaga, CORDS: automatic discovery of correlations and soft functional dependencies, Proceedings of the 2004 ACM SIGMOD international conference on Management of data, June 13-18, 2004, Paris, France
|
|
|
Shian-Hua Lin , Chi-Sheng Shih , Meng Chang Chen , Jan-Ming Ho , Ming-Tat Ko , Yueh-Ming Huang, Extracting classification knowledge of Internet documents with mining term associations: a semantic approach, Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval, p.241-249, August 24-28, 1998, Melbourne, Australia
|
|
|
Amit A. Nanavati , Krishna P. Chitrapura , Sachindra Joshi , Raghu Krishnapuram, Mining generalised disjunctive association rules, Proceedings of the tenth international conference on Information and knowledge management, October 05-10, 2001, Atlanta, Georgia, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Wai-Ho Au , Keith C. C. Chan , Andrew K. C. Wong , Yang Wang, Attribute Clustering for Grouping, Selection, and Classification of Gene Expression Data, IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB), v.2 n.2, p.83-101, April 2005
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Elke Achtert , Christian Böhm , Hans-Peter Kriegel , Peer Kröger , Arthur Zimek, Deriving quantitative models for correlation clusters, Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining, August 20-23, 2006, Philadelphia, PA, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Riadh Ben Messaoud , Sabine Loudcher Rabaséda , Omar Boussaid , Rokia Missaoui, Enhanced mining of association rules from data cubes, Proceedings of the 9th ACM international workshop on Data warehousing and OLAP, November 10-10, 2006, Arlington, Virginia, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Anna C. Gilbert , Yannis Kotidis , S. Muthukrishnan , Martin J. Strauss, How to summarize the universe: dynamic maintenance of quantiles, Proceedings of the 28th international conference on Very Large Data Bases, p.454-465, August 20-23, 2002, Hong Kong, China
|
|
|
|
|
|
|
|
|
Chenyong Hu , Benyu Zhang , Yongji Wang , Shuicheng Yan , Zheng Chen , Qing Wang , Qiang Yang, Learning quantifiable associations via principal sparse non-negative matrix factorization, Intelligent Data Analysis, v.9 n.6, p.603-620, November 2005
|
|
|
|
|
|
|
|
|
Jun Yan , Ning Liu , Qiang Yang , Benyu Zhang , Qiansheng Cheng , Zheng Chen, Mining Adaptive Ratio Rules from Distributed Data Sources, Data Mining and Knowledge Discovery, v.12 n.2-3, p.249-273, May 2006
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Christoph F. Eick , Rachana Parmar , Wei Ding , Tomasz F. Stepinski , Jean-Philippe Nicot, Finding regional co-location patterns for sets of continuous variables in spatial datasets, Proceedings of the 16th ACM SIGSPATIAL international conference on Advances in geographic information systems, November 05-07, 2008, Irvine, California
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Vilém Novák , Irina Perfilieva , Antonín Dvořák , Guoqing Chen , Qiang Wei , Peng Yan, Mining pure linguistic associations from numerical data, International Journal of Approximate Reasoning, v.48 n.1, p.4-22, April, 2008
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Alex Tze Hiang Sim , Maria Indrawan , Bala Srinivasan, Mining infrequent and interesting rules from transaction records, Proceedings of the 7th WSEAS International Conference on Artificial intelligence, knowledge engineering and data bases, p.515-520, February 20-22, 2008, Cambridge, UK
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
María N. Moreno , Saddys Segrera , Vivian F. López , M. José Polo, A method for mining quantitative association rules, Proceedings of the 6th WSEAS International Conference on Simulation, Modelling and Optimization, p.173-178, September 22-24, 2006, Lisbon, Portugal
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Adam Kirsch , Michael Mitzenmacher , Andrea Pietracaprina , Geppino Pucci , Eli Upfal , Fabio Vandin, An efficient rigorous approach for identifying statistically significant frequent itemsets, Proceedings of the twenty-eighth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, June 29-July 01, 2009, Providence, Rhode Island, USA
|
|
|
|
|
|
Gaurav Pandey , Gowtham Atluri , Michael Steinbach , Chad L. Myers , Vipin Kumar, An association analysis approach to biclustering, Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining, June 28-July 01, 2009, Paris, France
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|