ACM Home Page
Please provide us with feedback. Feedback
Predicting the &bgr;-helix fold from protein sequence data
Full text PdfPdf (260 KB)
Source Annual Conference on Research in Computational Molecular Biology archive
Proceedings of the fifth annual international conference on Computational biology table of contents
Montreal, Quebec, Canada
Pages: 59 - 67  
Year of Publication: 2001
ISBN:1-58113-353-7
Authors
Phil Bradley  Department of Mathematics and Lab for Computer Science, MIT, Cambridge, MA
Lenore Cowen  Department of Mathematical Sciences, Johns Hopkins University, Baltimore, MD
Matthew Menke  Department of Mathematics and Lab for Computer Science, MIT, Cambridge, MA
Jonathan King  Department of Biololgy, MIT, Cambridge, MA
Bonnie Berger  Department of Mathematics and Lab for Computer Science, MIT, Cambridge, MA
Sponsor
SIGACT: ACM Special Interest Group on Algorithms and Computation Theory
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 12,   Downloads (12 Months): 25,   Citation Count: 2
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/369133.369171
What is a DOI?

ABSTRACT

A method is presented that uses &bgr;-strand interactions to predict the right-handed &bgr;-helix super-secondary structural motif in protein sequences. A program called BetaWrap implements this method, and is shown to score known &bgr;-helices above non-&bgr;-helices in the Protein Data Bank in cross-validation. It is demonstrated that BetaWrap learns each of the seven known SCOP &bgr;-helix families, when trained on the the known &bgr;-helices from outside the family. BetaWrap also predicts many bacterial proteins of unknown structure that play a role in human infectious disease to &bgr;-helices; in particular, these proteins serve as virulence factors, adhesins and toxins in bacterial pathogenesis, and include cell surface proteins from Chlamydia and the intestinal bacterium Helicobacter pylori. The computational method used here may generalize to other &bgr; structures for which strand topology and profiles of residue accessibility are well conserved.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
S. Altschul, W. Gish, W. Miller, E. Myers, and D. Lipman. Basic local alignment search tool. J. Mol. Biol., 215:403-410, 1990.
 
2
S. Altschul, T. Madden, A. Schaffer, J. Zhang, Z.Zhang, W. Miller, and L. Lipman. Gapped BLAST and PSI-BLAST: a new generation of protein data base search programs. Nucleic Acids Res., 25:3389-3402, 1997.
 
3
A. Bairoch and R. Apweiler. The SWISS-PROT protein database and its supplement TrEMBL in 2000. Nucleic Acids Res., 28:45-48, 2000.
 
4
B. Berger. Algorithms for protein structural motif recognition. J. of Computational Biology, 2:125-138, 1995.
 
5
B. Berger and M. Singh. An iterative method for improved protein structural motif recognition. J. of Computational Biology, 4(3):261-273, Fall 1997.
 
6
B. Berger, D. B. Wilson, E. Wolf, T. Tonchev, M. Milla, and P. S. Kim. Predicting coiled coils by use of pairwise residue correlations. Proc. of the Natl. Academy of Sci., USA, 92:8259-8263, Aug. 1995.
 
7
S. Bryant. Evaluation of threading specificity and accuracy. Proteins, 26:172-185, 1996.
 
8
S. Eddy. Hidden markov models and large-scale genome analysis. Transactions of the American American Crystallographic Association, 1997.
 
9
P. Emsley, I. Charles, N. Fairweather, and N. Isaacs. Structure of bordetella pertussis virulence factor p.69 pertactin. Nature, 381:90-92, 1996.
 
10
D. Engelman, T. Steitz, and A. Goldman. Identifying nonpolar transbilayer helices in amino acid sequences of membrane proteins {review}. Annual Review of Biophysics and Biophysical Chemistry, 15:321-53, 1986.
 
11
D. Frishman and P. Argos. Knowledge-based secondary structure assignment. Proteins: structure, function and genetics, pages 556-579, 1995.
 
12
J. Garnier, J. Gibrat, and B.Robson. GOR secondary structure prediction method version IV. Methods in Enzymology, 266:540-553, 1996.
 
13
C. Haase-Pettingell and J. King. Prevalence of temperature sensitive folding mutations in the parallel beta coil domain of the phage p22 tailspike endorhamnosidase. J. Mol. Biol., 267:88-102, 1997.
 
14
S. Heffron, G. Moe, V. Sieber, J. Mengaud, P. Cossart, J. Vitali, and F. Jurnak. Sequence profile of the parallel helix in the Pectate Lyase superfamily. Journal of Structural Biology, 122:223-235, 1998.
 
15
U. Hobohm and C. Sander. Enlarged representative set of protein structures. Protein Science, 3:522-524, 1994.
 
16
U. Hobohm, M. Scharf, R. Schneider, and C. Sander. Selection of a representative set of structures from the Brookhaven Protein Data bank. Protein Science, 1:409-417, 1992.
 
17
T. Hubbard and J. Park. Fold recognition and ab initio structure predictions using hidden Markov models and beta-strand pair potentials. Proteins, 3:398-402, 1995.
 
18
J. Jenkins, O. Mayans, and R. Pickersgill. Structure and evolution of parallel helix proteins. Journal of Structural Biology, 122:236-246, 1998.
 
19
D. Jones. Genthreader: an efficient and reliable protein fold recognition method for genomic sequences. J. Mol. Biol., 287:797-815, 1999.
 
20
D. Jones. Protein secondary structure prediction based on position-specific scoring matrices. J. Mol. Biol., 292:195-202, 1999.
 
21
D. Jones, W. Taylor, and J. Thornton. A new approach to protein fold recognition. Nature, 358:86-89, 1992.
 
22
K. Karplus, C. Barrett, and R. Hughey. Hidden Markov models for detectin remote protein homologies. Bioinformatics, 14:846-856, 1998.
 
23
L. Kelley, R. MacCallum, and M. Sternberg. Enhanced genome annotation using structure profiles in the program 3D-PSSM. J. Mol. Biol., 299(2):501-522, 2000.
 
24
P. Koehl and M. Levitt. A brighter future for protein structure prediction. Nat. Struct. Biol., 6:108-111, 1999.
 
25
S. Lifson and C. Sander. Specific recognition in the tertiary structure of sheets of proteins. Journal of Molecular Biology, 139:627-629, 1980.
 
26
A. Murzin, S. Brenner, T. Hubbard, and C. Chothia. Scop: a structural classification of proteins database for the investigation of sequences and structures. J. Mol. Biol., 297:536-540, 1995.
 
27
W. Pearson and D. Lipman. Improved tools for biological sequence comparison. Proc. Natl. Acad. Sci. USA, 85:2444-8, 1988.
 
28
B. Rost and C. Sander. Prediction of protein secondary structure at better than 70% accuracy. J. Mol. Biol., 232:584-599, 1993.
 
29
M. Singh, B. Berger, and P. S. Kim. Learncoil-VMF: Computational evidence for coiled coil-like motifs in many viral membrane fussion proteins. J. of Molecular Biology, 290(1):241-251, 1999.
 
30
M. Singh, B. Berger, P. S. Kim, J. Berger, and A. Cochran. Computational learning reveals coiled coil-like motifs in histidine kinase linker domains. Proc. of the Natl. Academy of Sci., USA, 95(6):2738-2743, 1998.
 
31
M. Sippl and S. Weitckus. Detection of nativelike models for amino acid sequences of unknown three-dimensional structure in a data base of known protein conformations. Proteins, 13:258-271, 1992.
 
32
E. Sonnhamer, S. Eddy, E. Birney, A. Bateman, and R. Durbin. Pfam: multiple sequence alignments and HMM-profiles of protein domains. Nucleic Acids Res, 26(1):320-322, 1998.
 
33
M. Sternberg, P. Bates, K. A. Kelley, and R. M. MacCallum. Progress in protein structure prediction: Assessment of CASP3. Curt. Opin. Struct. Biol., 9:368-373, 1999.
 
34
E. Wolf, P. S. Kim, and B. Berger. MultiCoil: a program for predicting two and three stranded coiled coils. Protein Science, 6(6):1179-1189, June 1997.
 
35
M. D. Yoder, N. T. Keen, and F. Jurnak. New domain motif: structure of pectate lyase C, a secreted plant virulence factor. Science, 260:1503-1507, 1993.
 
36
H. Zhu and W. Braun. Sequence specificity, statistical potentials and 3D structure prediction with self-correcting distance geometry calculations of beta-sheet formation in proteins. Protein Science, 8:326-342, 1999.


Collaborative Colleagues:
Phil Bradley: colleagues
Lenore Cowen: colleagues
Matthew Menke: colleagues
Jonathan King: colleagues
Bonnie Berger: colleagues