Skip Navigation


Bioinformatics Advance Access originally published online on May 4, 2006
Bioinformatics 2006 22(13):1641-1647; doi:10.1093/bioinformatics/btl134
This Article
Right arrow Full Text Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow All Versions of this Article:
22/13/1641    most recent
btl134v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (6)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Nie, L.
Right arrow Articles by Zhang, W.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Nie, L.
Right arrow Articles by Zhang, W.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2006. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org

Integrated analysis of transcriptomic and proteomic data of Desulfovibrio vulgaris: zero-inflated Poisson regression models to predict abundance of undetected proteins

Lei Nie 1, Gang Wu 2, Fred J. Brockman 3 and Weiwen Zhang 3,*

1 Department of Biostatistics, Bioinformatics and Biomathematics, Georgetown University Washington DC 20057, USA
2 Department of Biological Sciences, University of Maryland at Baltimore County Baltimore, MD 21250, USA
3 Microbiology Department, Pacific Northwest National Laboratory PO Box 999, Mail Stop P7-50, Richland, WA 99352, USA

*To whom correspondence should be addressed.

Motivation: Integrated analysis of global scale transcriptomic and proteomic data can provide important insights into the metabolic mechanisms underlying complex biological systems. However, because the relationship between protein abundance and mRNA expression level is complicated by many cellular and physical processes, sophisticated statistical models need to be developed to capture their relationship.

Results: In this study, we describe a novel data-driven statistical model to integrate whole-genome microarray and proteomic data collected from Desulfovibrio vulgaris grown under three different conditions. Based on the Poisson distribution pattern of proteomic data and the fact that a large number of proteins were undetected (excess zeros), zero-inflated Poisson (ZIP)-based models were proposed to define the correlation pattern between mRNA and protein abundance. In addition, by assuming that there is a probability mass at zero representing unexpressed genes and expressed proteins that were undetected owing to technical limitations, a Potential ZIP model was established. Two significant improvements introduced by this approach are (1) the predicted protein abundance level values for experimentally detected proteins are corrected by considering their mRNA levels and (2) protein abundance values can be predicted for undetected proteins (in the case of this study, ~83% of the proteins in the D.vulgaris genome) for better biological interpretation. We demonstrated the use of these statistical models by comparatively analyzing proteomic and microarray results from D.vulgaris grown on lactate-based versus formate-based media. These models correctly predicted increased expression of Ech hydrogenase and decreased expression of Coo hydrogenase for D.vulgaris grown on formate.

Contact: Weiwen.Zhang{at}pnl.gov

Supplementary information: Supplementary data are available at Bioinformatics online.


Received on December 14, 2005; revised on March 31, 2006; accepted on April 1, 2006

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
MicrobiologyHome page
W. Zhang, F. Li, and L. Nie
Integrating multiple 'omics' analysis for microbial biology: application and methodologies
Microbiology, February 1, 2010; 156(2): 287 - 301.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
W. Torres-Garcia, W. Zhang, G. C. Runger, R. H. Johnson, and D. R. Meldrum
Integrative analysis of transcriptomic and proteomic data of Desulfovibrio vulgaris: a non-linear model to predict abundance of undetected proteins
Bioinformatics, August 1, 2009; 25(15): 1905 - 1914.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
L. Nie, G. Wu, and W. Zhang
Correlation of mRNA Expression and Protein Abundance Affected by Multiple Sequence Features Related to Translational Efficiency in Desulfovibrio vulgaris: A Quantitative Analysis
Genetics, December 1, 2006; 174(4): 2229 - 2243.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.