|
ABSTRACT
The world around us changes constantly. Knowing what has changed is an important part of our lives. For businesses, recognizing changes is also crucial. It allows businesses to adapt themselves to the changing market needs. In this paper, we study changes of association rules from one time period to another. One approach is to compare the supports and/or confidences of each rule in the two time periods and report the differences. This technique, however, is too simplistic as it tends to report a huge number of rule changes, and many of them are, in fact, simply the snowball effect of a small subset of fundamental changes. Here, we present a technique to highlight the small subset of fundamental changes. A change is fundamental if it cannot be explained by some other changes. The proposed technique has been applied to a number of real-life datasets. Experiments results show that the number of rules whose changes are unexplainable is quite small (about 20% of the total number of changes discovered), and many of these unexplainable changes reflect some fundamental shifts in the application domain.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Agrawal, R. and Psaila, G. "Active data mining." KDD-95.
|
| |
2
|
|
| |
3
|
Bayardo, R., Agrawal, R, and Gunopulos, D. "Constraintbased rule mining in large, dense databases." ICDE-99, 1999.
|
| |
4
|
|
 |
5
|
|
| |
6
|
Ester, M., Frommelt, A., Kriegel, H-P., and Sander, J. "Algorithms for characterization and trend detection in spatial databases." KDD-98, 1998.
|
| |
7
|
Fayyad, U. M. and Irani, K. B. "Multi-interval discretization of continuous-valued attributes for classification learning." IJCAI-93, 1993.
|
 |
8
|
Venkatesh Ganti , Johannes Gehrke , Raghu Ramakrishnan, A framework for measuring changes in data characteristics, Proceedings of the eighteenth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, p.126-137, May 31-June 03, 1999, Philadelphia, Pennsylvania, United States
[doi> 10.1145/303976.303989]
|
| |
9
|
Ganti, V., Gehrke, J., and Ramakrishnan, R. "DEMON: mining and monitoring evolving data." ICDE-2000, 2000.
|
| |
10
|
|
| |
11
|
Johnson T. and Dasu, T. "Comparing massive highdimensional data sets," KDD-98, 1998.
|
| |
12
|
Kohavi, R., John, G., Long, R., Manley, D., and Pfleger, K. "MLC++: a machine learning library in C++." Tools with artificial intelligence, 740-743, 1994.
|
| |
13
|
Lane, T. and Brodley, C. "Approaches to online learning and concept drift for user identification in computer security." KDD-98, 1998.
|
| |
14
|
Liu, B., Hsu, W., and Chen, S. "Using general impressions to analyze discovered classification rules." KDD-97, 1997.
|
| |
15
|
Liu, B., Hsu, W. and Ma, Y. Discovering the set of fundamental rule changes. NUS Technical Report, 2001.
|
 |
16
|
Bing Liu , Wynne Hsu , Yiming Ma, Pruning and summarizing the discovered associations, Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining, p.125-134, August 15-18, 1999, San Diego, California, United States
[doi> 10.1145/312129.312216]
|
| |
17
|
Mann, P. Introductory statistics. John Wiley & Sons, 1998.
|
 |
18
|
|
| |
19
|
|
| |
20
|
|
CITED BY 10
|
|
Peter Fule , John F. Roddick, Experiences in building a tool for navigating association rule result sets, Proceedings of the second workshop on Australasian information security, Data Mining and Web Intelligence, and Software Internationalisation, p.103-108, January 01, 2004, Dunedin, New Zealand
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|