Skip Navigation



Bioinformatics Advance Access published online on March 24, 2009

Bioinformatics, doi:10.1093/bioinformatics/btp168
This Article
Right arrow Advance Access manuscript (PDF) Freely available
Right arrow Supplementary Data
Right arrowOA All Versions of this Article:
25/11/1397    most recent
btp168v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Google Scholar
Right arrow Articles by Ramakrishnan, S. R.
Right arrow Articles by Miranker, D. P.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Ramakrishnan, S. R.
Right arrow Articles by Miranker, D. P.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

©2009 The Author(s)
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

Integrating Shotgun Proteomics and mRNA expression data to Improve Protein Identification

Smriti R. Ramakrishnan 1,*, Christine Vogel 2,*, John T. Prince 2, Zhihua Li 2, Luiz O. Penalva 3, Margaret Myers 1, Edward M. Marcotte 2,** and Daniel P. Miranker 1,**

1 Department of Computer Sciences, 1 University Station C0500, The University of Texas at Austin, Austin, TX 78712
2 Center for Systems and Synthetic Biology, Department of Chemistry and Biochemistry & Institute for Cellular and Molecu-lar Biology, 2500 Speedway, The University of Texas at Austin, Austin, TX 78712
3 Children's Cancer Research Institute; The University of Texas Health Science Center at San Antonio; San Antonio, TX 78229

**To whom correspondence should be addressed., E-mail: marcotte{at}icmb.utexas.edu, miranker{at}cs.utexas.edu


   Abstract

Motivation: Tandem mass spectrometry (MS/MS) offers fast and reliable characterization of complex protein mixtures, but suffers from low sensitivity in protein identification. In a typical shotgun-proteomics experiment, it is assumed that all proteins are equally likely to be present. However, there is often other information avail-able, e.g. the probability of a protein's presence is likely to correlate with its mRNA concentration.

Results: We develop a Bayesian score that estimates the posterior probability of a protein's presence in the sample given its identifica-tion in an MS/MS experiment and its mRNA concentration measured under similar experimental conditions. Our method, MSpresso, sub-stantially increases the number of proteins identified in an MS/MS experiment at the same error rate, e.g. in yeast, MSpresso in-creases the number of proteins identified by ~40%. We apply MSpresso to data from different MS/MS instruments, experimental conditions, and organisms (E.coli, human), and predict 19 to 63% more proteins across the different datasets. MSpresso demonstrates that incorporating prior knowledge of protein presence into shotgun-proteomics experiments can substantially improve protein identifica-tion scores.

Availability and Implementation: Software is available upon re-quest from the authors. Mass spectrometry datasets are available from http://marcottelab.org/MSdata/.

Contact: marcotte{at}icmb.utexas.edu, miranker{at}cs.utexas.edu

Supplementary Information: Supplementary data website: http://www.marcottelab.org/MSpresso/.

Associate Editor: Prof. Martin Bishop

*Equally contributing authors.


Received on December 24, 2008; revised on February 19, 2009; accepted on March 18, 2009

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
BioinformaticsHome page
S. R. Ramakrishnan, C. Vogel, T. Kwon, L. O. Penalva, E. M. Marcotte, and D. P. Miranker
Mining gene functional networks to improve mass-spectrometry-based protein identification
Bioinformatics, November 15, 2009; 25(22): 2955 - 2961.
[Abstract] [Full Text] [PDF]


Home page
Brief BioinformHome page
M. F. Ochs
Knowledge-based data analysis comes of age
Brief Bioinform, October 23, 2009; (2009) bbp044v1.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.