| On de novo interpretation of tandem mass spectra for peptide identification |
| Full text |
Pdf
(201 KB)
|
| Source
|
Annual Conference on Research in Computational Molecular Biology
archive
Proceedings of the seventh annual international conference on Research in computational molecular biology
table of contents
Berlin, Germany
Pages: 9 - 18
Year of Publication: 2003
ISBN:1-58113-635-8
|
|
Authors
|
|
Vineet Bafna
|
The Center for Advancement of Genomics, Rockville, MD
|
|
Nathan Edwards
|
Informatics Research, Celera Genomics, Rockville, MD
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 11, Downloads (12 Months): 59, Citation Count: 4
|
|
|
ABSTRACT
The correct interpretation of tandem mass spectra is a difficult problem, even when it is limited to scoring peptides against a database. De novo sequencing is considerably harder, but critical when sequence databases are incomplete or not available. In this paper we build upon earlier work due to Dancik et al., and Chen et al. to provide a dynamic programming algorithm for interpreting de novo spectra. Our method can handle most of the commonly occurring ions, including a; b; y, and their neutral losses. Additionally, we shift the emphasis away from sequencing to assigning ion types to peaks. In particular, we introduce the notion of core interpretations, which allow us to give confidence values to individual peak assignments, even in the absence of a strong interpretation. Finally, we introduce a systematic approach to evaluating de novo algorithms as a function of spectral quality. We show that our algorithm, in particular the core-interpretation, is robust in the presence of measurement error, and low fragmentation probability.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
V. Bafna and N. Edwards. SCOPE: a probabilistic model for scoring tandem mass spectra against a peptide database. Bioinformatics, 17 Suppl 1:S13--21, June 2001. Appeared in Intl. Conference on Intelligent Systems for Molecular Biology.
|
| |
2
|
C. Bartels. Fast algorithm for peptide sequencing by mass spectrometry. Biomedical and Environmental Mass Spectrometry, 19:363--368, 1990.
|
| |
3
|
T. Chen, M. Y. Kao, M. Tepel, J. Rush, and G.M. Church. A dynamic programming approach to de novo peptide sequencing via tandem mass spectrometry. Journal of Computational Biology, 8(6):571--83, 2001.
|
| |
4
|
V. Dancik, T. Addona, K. Clauser, J. Vath, and P.A. Pevzner. De novo peptide sequencing via tandem mass spectrometry. Journal of Computational Biology, 6:327--342, 1999.
|
| |
5
|
J. Fernandez de Cossio, J. Gonzales, and V. Besada. Protein identification using mass spectrometric information. Comput. Appl. Biosci., 11:427--434, 1995.
|
| |
6
|
J. Eng, A. McCormack, and J. Yates. An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database. Journal of American Society of Mass Spectrometry, 5:976--989, 1994.
|
| |
7
|
D. Fenyo, J. Qin, and B.T. Chait. Protein identification using mass spectrometric information. Electrophoresis, 19(6):998--1005, 1998.
|
| |
8
|
R.J. Johnson and K. Biemann. Computer program (seqpep) to aid in the interpretation of high-energy collision tandem mass spectra of peptides. Biomedical and Environmental Mass Spectrometry, 18:945--957, 1989.
|
| |
9
|
D.J. Lipman and W.R. Pearson. Rapid and sensitive protein similarity searches. Science, 227:1435--1441, 1985.
|
| |
10
|
M. Mann and M. Wilm. Error-tolerant identification of peptides in sequence databases by peptide sequence tags. Analytical Chemistry, 66:4390--4399, 1994.
|
| |
11
|
P. A. Pevzner. Computational Molecular Biology: An Algorithmic Approach. MIT Press, 2000.
|
 |
12
|
|
| |
13
|
J.A. Taylor and R.S. Johnson. Sequence database searches via de novo peptide sequencing by mass spectrometry. Rapid Communications in Mass Spectrometry, 11:1067--1075, 1997.
|
|