Bioinformatics Advance Access originally published online on June 28, 2007
Bioinformatics 2007 23(17):2322-2330; doi:10.1093/bioinformatics/btm332
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Context-sensitive data integration and prediction of biological networks
1Department of Computer Science, Princeton University, 35 Olden Street and 2Lewis-Sigler Institute for Integrative Genomics, Princeton University, Carl Icahn Laboratory, Princeton, NJ, USA
*To whom correspondence should be addressed.
| Abstract |
|---|
Motivation: Several recent methods have addressed the problem of heterogeneous data integration and network prediction by modeling the noise inherent in high-throughput genomic datasets, which can dramatically improve specificity and sensitivity and allow the robust integration of datasets with heterogeneous properties.
However, experimental technologies capture different biological processes with varying degrees of success, and thus, each source of genomic data can vary in relevance depending on the biological process one is interested in predicting. Accounting for this variation can significantly improve network prediction, but to our knowledge, no previous approaches have explicitly leveraged this critical information about biological context.
Results: We confirm the presence of context-dependent variation in functional genomic data and propose a Bayesian approach for context-sensitive integration and query-based recovery of biological process-specific networks. By applying this method to Saccharomyces cerevisiae, we demonstrate that leveraging contextual information can significantly improve the precision of network predictions, including assignment for uncharacterized genes. We expect that this general context-sensitive approach can be applied to other organisms and prediction scenarios.
Availability: A software implementation of our approach is available on request from the authors.
Contact: ogt{at}genomics.princeton.edu
Supplementary information: Supplementary data are available at http://avis.princeton.edu/contextPIXIE/
Associate Editor: Martin Bishop
Received on April 10, 2007; revised on June 1, 2007; accepted on June 18, 2007
This article has been cited by other articles:
![]() |
C. Huttenhower, M. A. Hibbs, C. L. Myers, A. A. Caudy, D. C. Hess, and O. G. Troyanskaya The impact of incomplete knowledge on evaluation: an experimental benchmark for protein function prediction Bioinformatics, September 15, 2009; 25(18): 2404 - 2410. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Huttenhower, E. M. Haley, M. A. Hibbs, V. Dumeaux, D. R. Barrett, H. A. Coller, and O. G. Troyanskaya Exploring the human genome with functional maps Genome Res., June 1, 2009; 19(6): 1093 - 1106. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Y. Yip and M. Gerstein Training set expansion: an approach to improving the reconstruction of biological networks from limited and uneven reliable interactions Bioinformatics, January 15, 2009; 25(2): 243 - 250. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Huttenhower, M. Schroeder, M. D Chikina, and O. G. Troyanskaya The Sleipnir library for computational functional genomics Bioinformatics, July 1, 2008; 24(13): 1559 - 1561. [Abstract] [Full Text] [PDF] |
||||
![]() |
T.A. Knijnenburg, L.F.A. Wessels, and M.J.T. Reinders Combinatorial influence of environmental parameters on transcription factor activity Bioinformatics, July 1, 2008; 24(13): i172 - i181. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Huttenhower and O.G. Troyanskaya Assessing the functional structure of genomic data Bioinformatics, July 1, 2008; 24(13): i330 - i338. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Ideker and R. Sharan Protein networks in disease Genome Res., April 1, 2008; 18(4): 644 - 652. [Abstract] [Full Text] [PDF] |
||||

