ACM Home Page
Please provide us with feedback. Feedback
Multi-class protein fold recognition using large margin logic based divide and conquer learning
Full text PdfPdf (570 KB)
Source International Conference on Knowledge Discovery and Data Mining archive
Proceedings of the KDD-09 Workshop on Statistical and Relational Learning in Bioinformatics table of contents
Paris, France
Pages 22-26  
Year of Publication: 2009
ISBN:978-1-60558-667-0
Authors
Huma Lodhi  Imperial College London
Stephen Muggleton  Imperial College London
Mike J. E. Sternberg  Imperial College London
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 6,   Downloads (12 Months): 13,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1562090.1562095
What is a DOI?

ABSTRACT

Inductive Logic Programming (ILP) systems have been successfully applied to solve complex biological problem by viewing them as binary classification tasks. It remains an open question how an accurate solution to a multi-class problem can be obtained by using a logic based learning method. In this paper we present a novel logic based approach to solve complex and challenging multi-class classification problems in bioinformatics by focusing on a particular task, namely protein fold recognition. Our technique is based on the use of large margin kernel-based methods in conjunction with first order rules induced by an ILP system. The proposed approach learns a multi-class classifier by using a divide and conquer reduction strategy that splits multi-classes into binary groups and solves each individual problem recursively hence generating an underlying decision list structure. The method is applied to assigning protein domains to folds. Experimental evaluation of the method demonstrates the efficacy of the proposed approach to solving complex multi-class classification problems in bioinformatics.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
 
3
C. H. Ding and I. Dubchak. Multi-class protein fold recognition using support vector machines and neural networks. Bioinformatics, 17:349--358, 2001.
 
4
 
5
 
6
 
7
S. Muggleton, H. Lodhi, A. Amini, and M. J. E. Sternberg. Support Vector Inductive Logic Programming. In Proceedings of the Eighth International Conference on Discovery Science, volume 735 of LNAI, pages 163--175. Springer Verlag, 2005.
 
8
A. G. Murzin, S. E. Brenner, T. Hubbard, and C. Chothia. SCOP: a structural classification of proteins database for the investigation of sequences and structures. J. Mol. Biol., 247(536--540), 1995.
 
9
 
10
 
11
 
12
M. Turcotte, S. Muggleton, and J. E. Sternberg. Automated discovery of structural signatures of protein fold and function. J. Mol. Biol., 306:591--605, 2001.

Collaborative Colleagues:
Huma Lodhi: colleagues
Stephen Muggleton: colleagues
Mike J. E. Sternberg: colleagues