ACM Home Page
Please provide us with feedback. Feedback
Detecting changes in large data sets of payment card data: a case study
Full text PdfPdf (786 KB)
Source
International Conference on Knowledge Discovery and Data Mining archive
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining table of contents
San Jose, California, USA
SESSION: Industrial and government track short papers table of contents
Pages: 1018 - 1022  
Year of Publication: 2007
ISBN:978-1-59593-609-7
Authors
Chris Curry  Open Data Group, River Forest, IL
Robert L. Grossman  Open Data Group, River Forest, IL
David Locke  Open Data Group, River Forest, IL
Steve Vejcik  Open Data Group, River Forest, IL
Joseph Bugajski  Visa International, Foster City, CA
Sponsors
ACM: Association for Computing Machinery
SIGKDD: ACM Special Interest Group on Knowledge Discovery in Data
SIGMOD: ACM Special Interest Group on Management of Data
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): n/a,   Downloads (12 Months): n/a,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1281192.1281303
What is a DOI?

ABSTRACT

An important problem in data mining is detecting changes in large datasets. Although there are a variety of change detection algorithms that have been developed, in practice it can be a problem to scale these algorithms to large data sets due to the heterogeneity of the data. In this paper, we describe a case study involving payment card data in which we built and monitored a separate change detection model for each cell in a multi-dimensional data cube. We describe a system that has been in operation for the past two years that builds and monitors over 15,000 separate baseline models and the process that isused for generating and investigating alerts using these baselines.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Alan Agresti, An Introduction to Categorical Data Analysis, John Wiley and Sons, Inc., New York, 1996.
 
2
The Augustus open source data mining system can be downloaded from www.sourceforge.net/projects/augustus.
 
3
 
4
 
5
Joseph Bugajski, Robert Grossman, Eric Sumner, Tao Zhang, A Methodology for Establishing Information Quality Baselines for Complex, Distributed Systems, 10th International Conference on Information Quality (ICIQ), 2005.
6
 
7
Robert L. Grossman, PMML Models for Detecting Changes, Proceedings of the KDD-05 Workshop on Data Mining Standards, Services and Platforms (DM-SSP 05), ACM Press, New York, 2005, pages 6--15.
8


Collaborative Colleagues:
Chris Curry: colleagues
Robert L. Grossman: colleagues
David Locke: colleagues
Steve Vejcik: colleagues
Joseph Bugajski: colleagues