ACM Home Page
Please provide us with feedback. Feedback
Sliding-window filtering: an efficient algorithm for incremental mining
Full text PdfPdf (1.59 MB)
Source Conference on Information and Knowledge Management archive
Proceedings of the tenth international conference on Information and knowledge management table of contents
Atlanta, Georgia, USA
Session: Sequence Mining table of contents
Pages: 263 - 270  
Year of Publication: 2001
ISBN:1-58113-436-3
Authors
Chang-Hung Lee  National Taiwan University, Taipei, Taiwan, ROC
Cheng-Ru Lin  National Taiwan University, Taipei, Taiwan, ROC
Ming-Syan Chen  National Taiwan University, Taipei, Taiwan, ROC
Sponsors
SIGMIS: ACM Special Interest Group on Management Information Systems
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 5,   Downloads (12 Months): 58,   Citation Count: 22
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/502585.502630
What is a DOI?

ABSTRACT

We explore in this paper an effective sliding-window filtering (abbreviatedly as SWF) algorithm for incremental mining of association rules. In essence, by partitioning a transaction database into several partitions, algorithm SWF employs a filtering threshold in each partition to deal with the candidate itemset generation. Under SWF, the cumulative information of mining previous partitions is selectively carried over toward the generation of candidate itemsets for the subsequent partitions. Algorithm SWF not only significantly reduces I/O and CPU cost by the concepts of cumulative filtering and scan reduction techniques but also effectively controls memory utilization by the technique of sliding-window partition. Algorithm SWF is particularly powerful for efficient incremental mining for an ongoing time-variant transaction database. By utilizing proper scan reduction techniques, only one scan of the incremented dataset is needed by algorithm SWF. The I/O cost of SWF is, in orders of magnitude, smaller than those required by prior methods, thus resolving the performance bottleneck. Experimental studies are performed to evaluate performance of algorithm SWF. It is noted that the improvement achieved by algorithm SWF is even more prominent as the incremented portion of the dataset increases and also as the size of the database increases.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
2
 
3
4
5
 
6
 
7
 
8
 
9
 
10
11
12
13
14
 
15
 
16
17
 
18
 
19
 
20
 
21
S. Thomas, S. Bodagala, K. Alsabti, and S. Ranka. An Efficient Algorithm for the Incremental Updation of Association Rules in Large Databases. Proc. of 1997 Ink Conf. on Knowledge Discovery and Data Mining, 1997.
 
22
 
23
 
24

CITED BY  22

Collaborative Colleagues:
Chang-Hung Lee: colleagues
Cheng-Ru Lin: colleagues
Ming-Syan Chen: colleagues