ACM Home Page
Please provide us with feedback. Feedback
A fast algorithm for finding frequent episodes in event streams
Full text PdfPdf (751 KB)
Source
International Conference on Knowledge Discovery and Data Mining archive
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining table of contents
San Jose, California, USA
SESSION: Research track papers table of contents
Pages: 410 - 419  
Year of Publication: 2007
ISBN:978-1-59593-609-7
Authors
Srivatsan Laxman  Microsoft Research Labs
P. S. Sastry  Indian Institute of Science
K. P. Unnikrishnan  General Motors R & D Center
Sponsors
ACM: Association for Computing Machinery
SIGKDD: ACM Special Interest Group on Knowledge Discovery in Data
SIGMOD: ACM Special Interest Group on Management of Data
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): n/a,   Downloads (12 Months): n/a,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1281192.1281238
What is a DOI?

ABSTRACT

Frequent episode discovery is a popular framework for mining data available as a long sequence of events. An episode is essentially a short ordered sequence of event types and the frequency of an episode is some suitable measure of how often the episode occurs in the data sequence. Recently,we proposed a new frequency measure for episodes based on the notion of non-overlapped occurrences of episodes in the event sequence, and showed that, such a definition, in addition to yielding computationally efficient algorithms, has some important theoretical properties in connecting frequent episode discovery with HMM learning. This paper presents some new algorithms for frequent episode discovery under this non-overlapped occurrences-based frequency definition. The algorithms presented here are better (by a factor of N, where N denotes the size of episodes being discovered) in terms of both time and space complexities when compared to existing methods for frequent episode discovery. We show through some simulation experiments, that our algorithms are very efficient. The new algorithms presented here have arguably the least possible orders of spaceand time complexities for the task of frequent episode discovery.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
G. Casas-Garriga. Discovering unbounded episodes in sequential data. In Proceedings of the 7th European Conference on Principles and Practice of Knowledge Discovery in Databases (PKDD'03), pages 83--94, Cavtat-Dubvrovnik, Croatia, 2003.
 
3
 
4
R. Gwadera, M. J. Atallah, and W. Szpankowski. Markov models for identification of significant episodes. In Proceedings of the 2005 SIAM International Conference on Data Mining (SDM-05), Newport Beach, California, April 2005.
 
5
S. Laxman. Discovering frequent episodes in event streams: Fast algorithms, connections with HMMs and generalizations. PhD thesis, Dept. of Electrical Engineering, Indian Institute of Science, Bangalore, India, Mar. 2006.
 
6
 
7
S. Laxman, P. S. Sastry, and K. P. Unnikrishnan. Discovering frequent generalized episodes when events persist for different durations. To appear in IEEE Transactions on Knowledge and Data Engineering, 2007.
 
8
 
9
 
10
M. Regnier and W. Szpankowski. On pattern frequency occurrences in a Markovian sequence. Algorithmica, 22:631--649, 1998.


Collaborative Colleagues:
Srivatsan Laxman: colleagues
P. S. Sastry: colleagues
K. P. Unnikrishnan: colleagues