A computational approach toward label-free protein quantification using predicted peptide detectability
1 School of Informatics, Indiana University Bloomington, IN, USA
2 Center for Genomics and Bioinformatics, Indiana University Bloomington, IN, USA
3 National Center for Glycomics and Glycoproteomics, Indiana University Bloomington, IN, USA
4 Department of Chemistry, Indiana University Bloomington, IN, USA
*To whom correspondence should be addressed at School of Informatics, Indiana University, 901 East 10th Street, Bloomington, IN 47408, USA
Summary: We propose here a new concept of peptide detectability which could be an important factor in explaining the relationship between a protein's quantity and the peptides identified from it in a high-throughput proteomics experiment. We define peptide detectability as the probability of observing a peptide in a standard sample analyzed by a standard proteomics routine and argue that it is an intrinsic property of the peptide sequence and neighboring regions in the parent protein. To test this hypothesis we first used publicly available data and data from our own synthetic samples in which quantities of model proteins were controlled. We then applied machine learning approaches to demonstrate that peptide detectability can be predicted from its sequence and the neighboring regions in the parent protein with satisfactory accuracy. The utility of this approach for protein quantification is demonstrated by peptides with higher detectability generally being identified at lower concentrations over those with lower detectability in the synthetic protein mixtures. These results establish a direct link between protein concentration and peptide detectability. We show that for each protein there exists a level of peptide detectability above which peptides are detected and below which peptides are not detected in an experiment. We call this level the minimum acceptable detectability for identified peptides (MDIP) which can be calibrated to predict protein concentration. Triplicate analysis of a biological sample showed that these MDIP values are consistent among the three data sets.
Contact: predrag{at}indiana.edu
This article has been cited by other articles:
![]() |
V. Lange, J. A. Malmstrom, J. Didion, N. L. King, B. P. Johansson, J. Schafer, J. Rameseder, C.-H. Wong, E. W. Deutsch, M.-Y. Brusniak, et al. Targeted Quantitative Analysis of Streptococcus pyogenes Virulence Factors by Multiple Reaction Monitoring Mol. Cell. Proteomics, August 1, 2008; 7(8): 1489 - 1500. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Gupta, J. Benhamida, V. Bhargava, D. Goodman, E. Kain, I. Kerman, N. Nguyen, N. Ollikainen, J. Rodriguez, J. Wang, et al. Comparative proteogenomics: Combining mass spectrometry and comparative genomics to analyze multiple genomes Genome Res., July 1, 2008; 18(7): 1133 - 1142. [Abstract] [Full Text] [PDF] |
||||
![]() |
B.-J. M. Webb-Robertson, W. R. Cannon, C. S. Oehmen, A. R. Shah, V. Gurumoorthi, M. S. Lipton, and K. M. Waters A support vector machine model for the prediction of proteotypic peptides for accurate mass and time proteomics Bioinformatics, July 1, 2008; 24(13): 1503 - 1509. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Shen, Z. Wang, G. Shankar, X. Zhang, and L. Li A hierarchical statistical model to assess the confidence of peptides and proteins inferred from tandem mass spectrometry Bioinformatics, January 15, 2008; 24(2): 202 - 208. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Alves, R. J. Arnold, D. E. Clemmer, Y. Li, J. P. Reilly, Q. Sheng, H. Tang, Z. Xun, R. Zeng, and P. Radivojac Fast and accurate identification of semi-tryptic peptides in shotgun proteomics Bioinformatics, January 1, 2008; 24(1): 102 - 109. [Abstract] [Full Text] [PDF] |
||||
![]() |
B.-J. M. Webb-Robertson and W. R. Cannon Current trends in computational inference from mass spectrometry-based proteomics Brief Bioinform, September 1, 2007; 8(5): 304 - 317. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Gupta, S. Tanner, N. Jaitly, J. N. Adkins, M. Lipton, R. Edwards, M. Romine, A. Osterman, V. Bafna, R. D. Smith, et al. Whole proteome analysis of post-translational modifications: Applications of mass-spectrometry for proteogenomic annotation Genome Res., September 1, 2007; 17(9): 1362 - 1377. [Abstract] [Full Text] [PDF] |
||||



