ACM Home Page
Please provide us with feedback. Feedback
Discovery in multi-attribute data with user-defined constraints
Full text PdfPdf (510 KB)
Source ACM SIGKDD Explorations Newsletter archive
Volume 4 ,  Issue 1  (June 2002) table of contents
COLUMN: Constraints in data mining table of contents
Pages: 56 - 64  
Year of Publication: 2002
ISSN:1931-0145
Authors
Chang-Shing Perng  IBM Thomas J. Watson Research Center, Hawthorne, NY
Haixun Wang  IBM Thomas J. Watson Research Center, Hawthorne, NY
Sheng Ma  IBM Thomas J. Watson Research Center, Hawthorne, NY
Joseph L. Hellerstein  IBM Thomas J. Watson Research Center, Hawthorne, NY
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 2,   Downloads (12 Months): 14,   Citation Count: 4
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues   peer to peer  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/568574.568583
What is a DOI?

ABSTRACT

There has been a growing interest in mining frequent itemsets in relational data with multiple attributes. A key step in this approach is to select a set of attributes that group data into transactions and a separate set of attributes that labels data into items. Unsupervised and unrestricted mining, however, is stymied by the combinatorial complexity and the quantity of patterns as the number of attributes grows. In this paper, we focus on leveraging the semantics of the underlying data for mining frequent itemsets. For instance, there are usually taxonomies in the data schema and functional dependencies among the attributes. Domain knowledge and user preferences often have the potential to significantly reduce the exponentially growing mining space. These observations motivate the design of a user-directed data mining framework that allows such domain knowledge to guide the mining process and control the mining strategy. We show examples of tremendous reduction in computation by using domain knowledge in mining relational data with multiple attributes.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
2
 
3
4
 
5
J. Deogun, V. Raghavan, A. Sarkar, and H. Sever. Data mining: Research trends, challenges, and applications, 1997.
 
6
H. B. Enderton. A Mathematical Introduction to Logic. Academic Press, 2nd edition, December 2000.
 
7
 
8
9
 
10
11
 
12
B. Padmanabhan and A. Tuzhilin. Unexpectedness as a measure of interestingness in knowledge discovery, 1999.
 
13
14
 
15
 
16
 
17
R. Srikant, Q. Vu, and R. Agrawal. Mining association rules with item constraints. In Int'l Conf. on Knowledge Discovery and Data Mining (SIGKDD), pages 67-93, 1997.
 
18
A. Tarski. A lattice-theoretical fixpoint theorem and its applications. Pacific J. Math., pages 253-309, 1955.
19


Collaborative Colleagues:
Chang-Shing Perng: colleagues
Haixun Wang: colleagues
Sheng Ma: colleagues
Joseph L. Hellerstein: colleagues

Peer to Peer - Readers of this Article have also read: