Bioinformatics Vol. 17 no. 90001 2001
Pages S13-S21
© 2001 Oxford University Press
SCOPE: a probabilistic model for scoring tandem mass spectra against a peptide database
1 Informatics Research, Celera Genomics, 45 W. Gude Drive, Rockville, MD, 20850, USA
Received on February 5, 2001
; revised on April 2, 2001
; accepted on April 2, 2001
Proteomics, or the direct analysis of the expressed protein components of a cell, is critical to our understanding of cellular biological processes in normal and diseased tissue. A key requirement for its success is the ability to identify proteins in complex mixtures. Recent technological advances in tandem mass spectrometry has made it the method of choice for high-throughput identification of proteins. Unfortunately, the software for unambiguously identifying peptide sequences has not kept pace with the recent hardware improvements in mass spectrometry instruments. Critical for reliable high-throughput protein identification, scoring functions evaluate the quality of a match between experimental spectra and a database peptide. Current scoring function technology relies heavily on ad-hoc parameterization and manual curation by experienced mass spectrometrists. In this work, we propose a two-stage stochastic model for the observed MS/MS spectrum, given a peptide. Our model explicitly incorporates fragment ion probabilities, noisy spectra, and instrument measurement error. We describe how to compute this probability based score efficiently, using a dynamic programming technique. A prototype implementation demonstrates the effectiveness of the model.
Contact: Vineet.Bafna{at}Celera.Com
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
J. Khatun, E. Hamlett, and M. C. Giddings Incorporating sequence information into the scoring function: a hidden Markov model for improved peptide identification Bioinformatics, March 1, 2008; 24(5): 674 - 681. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Shen, Z. Wang, G. Shankar, X. Zhang, and L. Li A hierarchical statistical model to assess the confidence of peptides and proteins inferred from tandem mass spectrometry Bioinformatics, January 15, 2008; 24(2): 202 - 208. [Abstract] [Full Text] [PDF] |
||||
![]() |
B.-J. M. Webb-Robertson and W. R. Cannon Current trends in computational inference from mass spectrometry-based proteomics Brief Bioinform, September 1, 2007; 8(5): 304 - 317. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. B. Weatherly, J. A. Atwood III, T. A. Minning, C. Cavola, R. L. Tarleton, and R. Orlando A Heuristic Method for Assigning a False-discovery Rate for Protein Identifications from Mascot Database Search Results Mol. Cell. Proteomics, June 1, 2005; 4(6): 762 - 772. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. C. Giddings, A. A. Shah, R. Gesteland, and B. Moore Genome-based peptide fingerprint scanning PNAS, January 7, 2003; 100(1): 20 - 25. [Abstract] [Full Text] [PDF] |
||||



