ACM Home Page
Please provide us with feedback. Feedback
Mining influential attributes that capture class and group contrast behaviour
Full text PdfPdf (704 KB)
Source
Conference on Information and Knowledge Management archive
Proceeding of the 17th ACM conference on Information and knowledge management table of contents
Napa Valley, California, USA
SESSION: KM: feature selection table of contents
Pages 971-980  
Year of Publication: 2008
ISBN:978-1-59593-991-3
Authors
Elsa Loekito  University of Melbourne, Melbourne, Australia
James Bailey  University of Melbourne, Melbourne, Australia
Sponsors
ACM: Association for Computing Machinery
SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 9,   Downloads (12 Months): 97,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1458082.1458210
What is a DOI?

ABSTRACT

Contrast data mining is a key tool for finding differences between sets of objects, or classes, and contrast patterns are a popular method for discrimination between two classes. However, such patterns can be limited in two primary ways: i) They do not readily allow second order differentiation - i.e. discovering contrasts of contrasts, ii) Mining contrast patterns often results in an overwhelming volume of output for the user. To address these limitations, this paper proposes a method which can identify contrast behaviour across both classes and also groups of classes. Furthermore, to increase interpretability for the user, it presents a new technique for finding the attributes which represent the key underlying factors behind the contrast behaviour. The associated mining task is computationally challenging and we describe an efficient algorithm to handle it, based on binary decision diagrams. Experimental results demonstrate that our technique can efficiently identify and explain contrast behaviour which would be difficult or impossible to isolate using standard techniques.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
2
 
3
 
4
H. Fan and K. Ramamohanarao. Noise tolerant classification by chi emerging patterns. In Proc. of PAKDD 2004, pages 201--206, 2004.
 
5
U. Fayyad and K. Irani. Multi-interval discretization of continuous-valued attributes for classification. In Proc. of 13th International Joint Conference on Artificial Intelligence, pages 1022--1029, 1993.
 
6
S. Hettich and S. D. Bay. The UCI KDD archive {http://kdd.ics.uci.edu}, 1999.
7
 
8
J. Li and L. Wong. Identifying good diagnostic gene groups from gene expression profiles using the concept of emerging patterns. Bioinformatics, 18(5):725--34, 2002.
 
9
J. Li and Q. Yang. Strong compound-risk factors: Efficient discovery through emerging patterns and contrast sets. IEEE Trans. on Information Technology in Biomedicine, 11(5):544--552, 2007.
10
 
11
 
12
U. Roessner, J. H. Paterson, M. G. Forbes, G. B. Fincher, P. Langridge, and A. Bacic. An investigation of boron toxicity in barley using metabolomics. Plant Physiology, 142(3):1087--1101, November 2006.
 
13
14
15

Collaborative Colleagues:
Elsa Loekito: colleagues
James Bailey: colleagues