ACM Home Page
Please provide us with feedback. Feedback
Clustering transactions using large items
Full text PdfPdf (928 KB)
Source Conference on Information and Knowledge Management archive
Proceedings of the eighth international conference on Information and knowledge management table of contents
Kansas City, Missouri, United States
Pages: 483 - 490  
Year of Publication: 1999
ISBN:1-58113-146-1
Authors
Ke Wang  School of Computing, National University of Singapore
Chu Xu  School of Computing, National University of Singapore
Bing Liu  School of Computing, National University of Singapore
Sponsors
SIGART: ACM Special Interest Group on Artificial Intelligence
SIGIR: ACM Special Interest Group on Information Retrieval
SIGMIS: ACM Special Interest Group on Management Information Systems
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 7,   Downloads (12 Months): 57,   Citation Count: 26
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/319950.320054
What is a DOI?

ABSTRACT

In traditional data clustering, similarity of a cluster of objects is measured by pairwise similarity of objects in that cluster. We argue that such measures are not appropriate for transactions that are sets of items. We propose the notion of large items, i.e., items contained in some minimum fraction of transactions in a cluster, to measure the similarity of a cluster of transactions. The intuition of our clustering criterion is that there should be many large items within a cluster and little overlapping of such items across clusters. We discuss the rationale behind our approach and its implication on providing a better solution to the clustering problem. We present a clustering algorithm based on the new clustering criterion and evaluate its effectiveness.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
3
 
4
 
5
 
6
M. Ester, H-P Kriegel, J. Sander, and X. Xu. A density-based algorithm for discovering clusters in large spatial databases with noise. KDD 1996
 
7
S. Guba, R. Rastogi, K. Shim. A clustering algorithm for categorical attributes. ICDE 1999
 
8
E.H. Hart, G. Karypis, V. Kumar and B. Mobasher. Clustering based on association rule hypergraphs. SIGMOD workshop on research issues on Data Mining and Knowledge Discovery, 1997
 
9
 
10
L. Kaufman and P.J. Rousseeuw. Finding groups in data: an introduction to cluster analysis, John Wiley 8c Son, 1990
 
11
 
12
 
13
 
14
 
15
O. Zamir, O. Etzioni, O. Madani and R. M. Karp. Fast and intuitive clustering of web documents. KDD 1997, 287-290
16

CITED BY  26