Bioinformatics Advance Access published online on October 23, 2006
Bioinformatics, doi:10.1093/bioinformatics/btl541
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 Department of Chemical and Biomolecular Engineering, University of California, Los Angeles, CA 90095, USA
* To whom correspondence should be addressed.
Motivation: Data from DNA microarrays and ChIP-chip binding assays often form the basis of transcriptional regulatory analyses. However, experimental noise in both data types combined with environmental dependence and uncorrelation between binding and regulation in ChIP-chip binding data complicate analyses that utilize these complimentary data sources. Therefore, to minimize the impact of these inaccuracies on transcription analyses it is desirable to identify instances of gene expression-ChIP-chip agreement, under the premise that inaccuracies are less likely to be present when separate data sources corroborate each other. Current methods for such identification either make key assumptions that limit their applicability, and/or yield high false positive and false negative rates. The goal of this work was to develop a method with a minimal amount of assumptions, and thus widely applicable, that can identify agreement between gene expression and ChIP-chip data at a higher confidence level than current methods. Results: We demonstrate in Saccharomyces cerevisiae that currently available ChIP-chip binding data explains microarray data from a variety of environments only as well as randomized networks with the same connectivity density. This suggests a high degree of inconsistency between the two data types, and illustrates the need for a method that can identify consistency between the two data sources. Here we have developed a Gibbs sampling technique to identify genes whose expression and ChIP-chip binding data are mutually consistent. Compared to current methods that could perform the same task, the Gibbs sampling method developed here exceeds their ability at high levels (>50%) of transcription network and gene expression error, while performing similarly at lower levels. Using this technique, we show that on average 73% more gene expression features can be captured per gene as compared to the unfiltered use of gene expression and ChIP-chip derived network connectivity data. Availability: Our algorithm is available on request from the authors, and soon to be posted on the web. See author's homepage for details, http://www.seas.ucla.edu/~liaoj/. Supplementary Information: at Bioinformatics online.
Received August 31, 2006
Revised October 5, 2006
Accepted October 17, 2006
Article
A Gibbs sampler for the identification of gene expression and network connectivity consistency
Mark P. Brynildsen 1, Linh M. Tran 1, and James C. Liao 1 *
James C. Liao, E-mail: liaoj{at}ucla.edu
![]()
Abstract
Associate Editor: Martin Bishop
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
M. P. Brynildsen, T.-Y. Wu, S.-S. Jang, and J. C. Liao Biological network mapping and source signal deduction Bioinformatics, July 15, 2007; 23(14): 1783 - 1791. [Abstract] [Full Text] [PDF] |
||||
