|
ABSTRACT
We introduce a new algorithm for mining sequential patterns. Our algorithm is especially efficient when the sequential patterns in the database are very long. We introduce a novel depth-first search strategy that integrates a depth-first traversal of the search space with effective pruning mechanisms.Our implementation of the search strategy combines a vertical bitmap representation of the database with efficient support counting. A salient feature of our algorithm is that it incrementally outputs new frequent itemsets in an online fashion.In a thorough experimental evaluation of our algorithm on standard benchmark data from the literature, our algorithm outperforms previous work up to an order of magnitude.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
 |
3
|
|
| |
4
|
C. Bettini, X. S. Wang, and S. Jajodia. Mining temporal relationships with multiple granularities in time sequences. Data Engineering Bulletin, 21(1):32--38, 1998.
|
| |
5
|
|
| |
6
|
|
| |
7
|
|
| |
8
|
H. Mannila, H. Toivonen, and A. I. Verkamo. Discovering frequent episodes in sequences. In KDD 1995, pages 210--215, Montreal, Quebec, Canada, 1995.
|
| |
9
|
Jian Pei , Jiawei Han , Behzad Mortazavi-Asl , Helen Pinto , Qiming Chen , Umeshwar Dayal , Meichun Hsu, PrefixSpan: Mining Sequential Patterns by Prefix-Projected Growth, Proceedings of the 17th International Conference on Data Engineering, p.215-224, April 02-06, 2001
|
| |
10
|
|
| |
11
|
|
| |
12
|
|
CITED BY 71
|
|
Jen-Wei Huang , Chi-Yao Tseng , Jian-Chih Ou , Ming-Syan Chen, On progressive sequential pattern mining, Proceedings of the 15th ACM international conference on Information and knowledge management, November 06-11, 2006, Arlington, Virginia, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
V. Kapoor , P. Poncelet , F. Trousset , M. Teisseire, Privacy preserving sequential pattern mining in distributed databases, Proceedings of the 15th ACM international conference on Information and knowledge management, November 06-11, 2006, Arlington, Virginia, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Xu Yusheng , Ma Zhixin , Li Lian , Tharam S. Dillon, Effective pruning strategies for sequential pattern mining, Proceedings of the 1st international conference on Forensic applications and techniques in telecommunications, information, and multimedia and workshop, January 21-23, 2008, Adelaide, Australia
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Lei Chang , Tengjiao Wang , Dongqing Yang , Hua Luan , Shiwei Tang, Efficient algorithms for incremental maintenance of closed sequential patterns in large databases, Data & Knowledge Engineering, v.68 n.1, p.68-106, January, 2009
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Marc Plantevit , Sabine Goutier , Françoise Guisnel , Anne Laurent , Maguelonne Teisseire, Mining unexpected multidimensional rules, Proceedings of the ACM tenth international workshop on Data warehousing and OLAP, November 09-09, 2007, Lisbon, Portugal
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Jian Pei , Haixun Wang , Jian Liu , Ke Wang , Jianyong Wang , Philip S. Yu, Discovering Frequent Closed Partial Orders from Strings, IEEE Transactions on Knowledge and Data Engineering, v.18 n.11, p.1467-1481, November 2006
|
|
|
|
|
|
|
|
|
|
|
|
Qihong Shao , Yi Chen , Shu Tao , Xifeng Yan , Nikos Anerousis, Efficient ticket routing by resolution sequence mining, Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, August 24-27, 2008, Las Vegas, Nevada, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Jinlin Chen , Subash Shankar , Angela Kelly , Serigne Gningue , Rathika Rajaravivarma, A two stage approach for contiguous sequential pattern mining, Proceedings of the 10th IEEE international conference on Information Reuse & Integration, p.382-387, August 10-12, 2009, Las Vegas, Nevada, USA
|
|
|
|
|
|
|
|
|
Anthony J. T. Lee , Huei-Wen Wu , Tzu-Yu Lee , Ying-Ho Liu , Kuo-Tay Chen, Mining closed patterns in multi-sequence time-series databases, Data & Knowledge Engineering, v.68 n.10, p.1071-1090, October, 2009
|
|
|
Shih-Chuan Chiu , Man-Kwan Shan , Jiun-Long Huang , Hua-Fu Li, Mining polyphonic repeating patterns from music data using bit-string based approaches, Proceedings of the 2009 IEEE international conference on Multimedia and Expo, p.1170-1173, June 28-July 03, 2009, New York, NY, USA
|
|