| Capturing truthiness: mining truth tables in binary datasets |
| Full text |
Pdf
(488 KB)
|
Source
|
Symposium on Applied Computing
archive
Proceedings of the 2009 ACM symposium on Applied Computing
table of contents
Honolulu, Hawaii
SESSION: Data mining track
table of contents
Pages 1467-1474
Year of Publication: 2009
ISBN:978-1-60558-166-8
|
|
Authors
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 6, Downloads (12 Months): 35, Citation Count: 0
|
|
|
ABSTRACT
We introduce a new data mining problem: mining truth tables in binary datasets. Given a matrix of objects and the properties they satisfy, a truth table identifies a subset of properties that exhibit maximal variability (and hence, complete independence) in occurrence patterns over the underlying objects. This problem is relevant in many domains, e.g., in bioinformatics where we seek to identify and model independent components of combinatorial regulatory pathways, and in social/economic demographics where we desire to determine independent behavioral attributes of populations. We outline a family of levelwise approaches adapted to mining truth tables, algorithmic optimizations, and applications to bioinformatics and political datasets.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
C. Owens et al. Capturing truthiness: Mining truth tables in binary datasets. Technical report, Virginia Tech, March 2007. http://eprints.cs.vt.edu/archive/00000948/.
|
| |
3
|
|
| |
4
|
|
| |
5
|
|
 |
6
|
|
| |
7
|
|
| |
8
|
J. Fitzgerald et al. Systems Biology and Combination Therapy in the Quest for Clinical Efficacy. Nature Chemical Biology, Vol. 2(9): 458--466, Sep 2006.
|
 |
9
|
|
| |
10
|
J. K. Seppanen et al. Using and Extending Itemsets in Data Mining. PhD thesis, Helsinki University of Technology, 2006.
|
| |
11
|
J. L. Tuegels et al. Generalized Graphical Models for Discrete Data. Statistics and Probability Letters, Vol. 38: 41--47, May 1998.
|
| |
12
|
|
| |
13
|
L. O. Barrera et al. The transcriptional regulatory code of eukaryotic cells-insights from genome-wide analysis of chromatin organization and transcription factor binding. Curr Opin Cell Biol, 18(3): 291--8, 2006.
|
| |
14
|
M. Natarajan et al. A Global Analysis of Cross-talk in a Mammalian Cellular Signaling Network. Nature Cell Biology, Vol. 8(6): 571--580, June 2006.
|
 |
15
|
|
 |
16
|
Naren Ramakrishnan , Deept Kumar , Bud Mishra , Malcolm Potts , Richard F. Helm, Turning CARTwheels: an alternating algorithm for mining redescriptions, Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining, August 22-25, 2004, Seattle, WA, USA
[doi> 10.1145/1014052.1014083]
|
| |
17
|
|
| |
18
|
|
| |
19
|
|
| |
20
|
|
| |
21
|
T. Lee et al. Transcriptional Regulatory Networks in Saccharomyces cerevisiae. Science, 298(5594): 799--804, 2002.
|
| |
22
|
Truthiness. Wikipedia. http://en.wikipedia.org/wiki/Truthiness.
|
|