ACM Home Page
Please provide us with feedback. Feedback
CloseGraph: mining closed frequent graph patterns
Full text PdfPdf (496 KB)
Source International Conference on Knowledge Discovery and Data Mining archive
Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining table of contents
Washington, D.C.
SESSION: Research track table of contents
Pages: 286 - 295  
Year of Publication: 2003
ISBN:1-58113-737-0
Authors
Xifeng Yan  University of Illinois at Urbana-Champaign, Urbana, Illinois
Jiawei Han  University of Illinois at Urbana-Champaign, Urbana, Illinois
Sponsors
SIGKDD: ACM Special Interest Group on Knowledge Discovery in Data
SIGMOD: ACM Special Interest Group on Management of Data
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 24,   Downloads (12 Months): 155,   Citation Count: 61
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/956750.956784
What is a DOI?

ABSTRACT

Recent research on pattern discovery has progressed form mining frequent itemsets and sequences to mining structured patterns including trees, lattices, and graphs. As a general data structure, graph can model complicated relations among data with wide applications in bioinformatics, Web exploration, and etc. However, mining large graph patterns in challenging due to the presence of an exponential number of frequent subgraphs. Instead of mining all the subgraphs, we propose to mine closed frequent graph patterns. A graph g is closed in a database if there exists no proper supergraph of g that has the same support as g. A closed graph pattern mining algorithm, CloseGraph, is developed by exploring several interesting pruning methods. Our performance study shows that CloseGraph not only dramatically reduces unnecessary subgraphs to be generated but also substantially increases the efficiency of mining, especially in the presence of large graph patterns.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
T. Asai, K. Abe, S. Kawasoe, H. Arimura, H. Satamoto, and S. Arikawa. Efficient substructure discovery from large semi-structured data. In SDM'02.
 
3
 
4
 
5
 
6
L. Dehaspe, H. Toivonen, and R. King. Finding frequent substructures in chemical compounds. In KDD'98.
 
7
I. Eidhammer, I. Jonassen, and W. R. Taylor Structure comparison and structure patterns. In J. of Comp. Bio. 7, 2000.
8
 
9
L. B. Holder, D. J. Cook, and S. Djoko. Substructure discovery in the subdue system. In KDD'94.
 
10
 
11
 
12
 
13
 
14
J. Pei, J. Han, and R. Mao. CLOSET: An efficient algorithm for mining frequent closed itemsets. In DMKD'00.
 
15
 
16
X. Yan and J. Han. gSpan: Graph-based substructure pattern mining. UIUC-CS Tech. Report: R-2002-2296, (a 4-page short version in ICDM'02).
 
17
X. Yan, J. Han, and R. Afshar. CloSpan: Mining closed sequential patterns in large datasets. In SDM'03.
 
18
M. J. Zaki and C. J. Hsiao. CHARM: An efficient algorithm for closed itemset mining. In SDM'02.
19

CITED BY  62