|
ABSTRACT
Inductive Logic Programming (ILP) systems have been successfully applied to solve complex biological problem by viewing them as binary classification tasks. It remains an open question how an accurate solution to a multi-class problem can be obtained by using a logic based learning method. In this paper we present a novel logic based approach to solve complex and challenging multi-class classification problems in bioinformatics by focusing on a particular task, namely protein fold recognition. Our technique is based on the use of large margin kernel-based methods in conjunction with first order rules induced by an ILP system. The proposed approach learns a multi-class classifier by using a divide and conquer reduction strategy that splits multi-classes into binary groups and solves each individual problem recursively hence generating an underlying decision list structure. The method is applied to assigning protein domains to folds. Experimental evaluation of the method demonstrates the efficacy of the proposed approach to solving complex multi-class classification problems in bioinformatics.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
| |
3
|
C. H. Ding and I. Dubchak. Multi-class protein fold recognition using support vector machines and neural networks. Bioinformatics, 17:349--358, 2001.
|
| |
4
|
|
| |
5
|
|
| |
6
|
Niels Landwehr , Andrea Passerini , Luc De Raedt , Paolo Frasconi, kFOIL: learning simple relational kernels, Proceedings of the 21st national conference on Artificial intelligence, p.389-394, July 16-20, 2006, Boston, Massachusetts
|
| |
7
|
S. Muggleton, H. Lodhi, A. Amini, and M. J. E. Sternberg. Support Vector Inductive Logic Programming. In Proceedings of the Eighth International Conference on Discovery Science, volume 735 of LNAI, pages 163--175. Springer Verlag, 2005.
|
| |
8
|
A. G. Murzin, S. E. Brenner, T. Hubbard, and C. Chothia. SCOP: a structural classification of proteins database for the investigation of sequences and structures. J. Mol. Biol., 247(536--540), 1995.
|
| |
9
|
|
| |
10
|
|
| |
11
|
|
| |
12
|
M. Turcotte, S. Muggleton, and J. E. Sternberg. Automated discovery of structural signatures of protein fold and function. J. Mol. Biol., 306:591--605, 2001.
|
|