Bioinformatics Advance Access originally published online on February 10, 2004
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Bioinformatics 20(8) © Oxford University Press 2004; all rights reserved.
Probabilistic inference of molecular networks from noisy data sources
1 Department of Medical Informatics, 2 Columbia Genome Center, Columbia University, New York, NY 10032, USA, 3 Department of Computer Science, Columbia University, New York, NY 10027, USA, 4 Department of Genetics, Yale University School of Medicine, New Haven, CT 06520, USA and 5 CuraGen Corporation, New Haven, CT 06511, USA
Received on July 29, 2003; accepted on September 3, 2003
Advance Access Publication February 10, 2004
Summary: Information on molecular networks, such as networks of interacting proteins, comes from diverse sources that contain remarkable differences in distribution and quantity of errors. Here, we introduce a probabilistic model useful for predicting protein interactions from heterogeneous data sources. The model describes stochastic generation of proteinprotein interaction networks with real-world properties, as well as generation of two heterogeneous sources of protein-interaction information: research results automatically extracted from the literature and yeast two-hybrid experiments. Based on the domain composition of proteins, we use the model to predict protein interactions for pairs of proteins for which no experimental data are available. We further explore the prediction limits, given experimental data that cover only part of the underlying protein networks. This approach can be extended naturally to include other types of biological data sources.
Contact: iossifov{at}dbmi.columbia.edu
* To whom correspondence should be addressed.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
C. Prieto and J. De Las Rivas APID: Agile Protein Interaction DataAnalyzer. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W298 - W302. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. N. Parker McMaster University Data-Mining and Docking Competition: Computational Models on the Catwalk J Biomol Screen, October 1, 2005; 10(7): 647 - 648. [PDF] |
||||
![]() |
Y. Liu, N. Liu, and H. Zhao Inferring protein-protein interactions through high-throughput interaction data from diverse organisms Bioinformatics, August 1, 2005; 21(15): 3279 - 3285. [Abstract] [Full Text] [PDF] |
||||


