ACM Home Page
Please provide us with feedback. Feedback
Digital Library logoTake a look at the new version of this page: [ beta version ]. Tell us what you think.
Distributed approximate mining of frequent patterns
Full text PdfPdf (263 KB)
Source Symposium on Applied Computing archive
Proceedings of the 2005 ACM symposium on Applied computing table of contents
Santa Fe, New Mexico
SESSION: Data mining (DM) table of contents
Pages: 529 - 536  
Year of Publication: 2005
ISBN:1-58113-964-0
Authors
Claudio Silvestri  Universita Ca' Foscari, Via Torino, Venezia, Italy
Salvatore Orlando  Universita Ca' Foscari, Via Torino, Venezia, Italy
Sponsor
SIGAPP: ACM Special Interest Group on Applied Computing
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 2,   Downloads (12 Months): 55,   Citation Count: 2
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1066677.1066796
What is a DOI?

ABSTRACT

This paper discusses a novel communication efficient distributed algorithm for approximate mining of frequent patterns from transactional databases. The proposed algorithm consists in the distributed exact computation of locally frequent itemsets and an effective method for inferring the local support of locally unfrequent itemsets. The combination of the two strategies gives a good approximation of the set of the globally frequent patterns and their supports. Several tests on publicly available datasets were conducted, aimed at evaluating the similarity between the exact result set and the approximate ones returned by our distributed algorithm as well as the scalability of the proposed method.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Workshop on frequent itemset mining implementations FIMI'03 in conjunction with ICDM'03. In fimi.cs.helsinki.fi, 2003.
 
2
3
 
4
 
5
6
7
 
8
 
9
 
10
 
11
 
12
Karolien Geurts, Geert Wets, Tom Brijs, and Koen Vanhoof. Profiling high frequency accident locations using association rules. In Proceedings of the 82nd Annual Transportation Research Board, Washington DC. (USA), January 12-16, page 18pp, 2003.
 
13
14
 
15
B. Park and H. Kargupta. Distributed Data Mining: Algorithms, Systems, and Applications. In Data Mining Handbook, pages 341--358. IEA, 2002.
16
 
17
 
18
 
19
 
20
21
 
22
 
23
 
24
S. Orlando, P. Palmerini, R. Perego, C. Lucchese, and F. Silvestri. kdci: a multi-strategy algorithm for mining frequent sets. In Proceedings of the workshop on Frequent Itemset Mining Implementations FIMI'03 in conjunction with ICDM'03, 2003.
 
25
 
26
S. Orlando, P. Palmerini, R. Perego, and F. Silvestri. An efficient parallel and distributed algorithm for counting frequent sets. In Proc. of Int. Conf. VECPAR 2002 - LNCS 2565, pages 197--204. Spinger, 2002.
 
27
 
28
 
29
 
30


Collaborative Colleagues:
Claudio Silvestri: colleagues
Salvatore Orlando: colleagues