ACM Home Page
Please provide us with feedback. Feedback
Discovery of interesting episodes in sequence data
Full text PdfPdf (69 KB)
Source Symposium on Applied Computing archive
Proceedings of the 2006 ACM symposium on Applied computing table of contents
Dijon, France
SESSION: Data mining (DM) table of contents
Pages: 598 - 602  
Year of Publication: 2006
ISBN:1-59593-108-2
Authors
Ambika Srinivasan  The University of Texas at Arlington
Dhawal Bhatia  The University of Texas at Arlington
Sharma Chakravarthy  The University of Texas at Arlington
Sponsor
SIGAPP: ACM Special Interest Group on Applied Computing
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 9,   Downloads (12 Months): 63,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1141277.1141414
What is a DOI?

ABSTRACT

There is considerable body of work on sequence mining of transactional data. Most of the related work on point data (not significant intervals) makes several passes over the entire dataset in order to discover frequently occurring (sequential) patterns. But Hybrid apriori, proposed in this paper, as the name implies is an apriori-class of mining algorithm in SQL and takes a different approach. Significant intervals for each event (or device) is computed first and used for detecting frequent event patterns. The advantages of this approach are that the data set is compressed to find significant intervals thereby reducing the size of input used. Also, each event/device is processed individually allowing for parallel computation of individual events. Then the hybrid apriori algorithm works on the significant intervals using an apriori-style algorithm adapted to intervals. Our approach has significant advantages over the traditional mining algorithms in terms of its efficiency, scalability and storage requirements.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
Mannila, H., H. Toivonen, and I. Verkamo, Discovering Frequent Episodes in Sequences, in Proc. of KDD 1995.
 
3
 
4
A. Srinivasan, S. Shrestha, and S. Chakravarthy. Discovery of Significant Intervals in Sequential Data, in Proc. of ADBIS Workshop on Data Mining and Knowledge Discovery. 2005.
 
5
Joshi, M., G. Karypis, and V. Kumar. A Universal Formulation of Sequential Patterns, in Technical Report. Dept. of Computer Science, University of Minnesota. 1999.
 
6
 
7
8


Collaborative Colleagues:
Ambika Srinivasan: colleagues
Dhawal Bhatia: colleagues
Sharma Chakravarthy: colleagues