Bioinformatics Advance Access originally published online on June 30, 2005
Bioinformatics 2005 21(17):3482-3489; doi:10.1093/bioinformatics/bti564
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
The inference of proteinprotein interactions by co-evolutionary analysis is improved by excluding the information about the phylogenetic relationships
1Bioinformatics Center, Institute for Chemical Research, Kyoto University Gokasho, Uji, Kyoto 611-0011, Japan
2Centre de Géostatistique Ecole des Mines de Paris, 35 rue Saint-Honoré, 77305 Fontainebleau cedex, France
3Division of Bioinformatics, Medical Institute of Bioregulation, Kyushu University Fukuoka, Fukuoka 812-8582, Japan
*To whom correspondence should be addressed.
Motivation: The prediction of proteinprotein interactions is currently an important issue in bioinformatics. The mirror tree method uses evolutionary information to predict proteinprotein interactions. However, it has been recognized that predictions by the mirror tree method lead to many false positives. The incentive of our study was to solve this problem by improving the method of extracting the co-evolutionary information regarding the protein pairs.
Results: We developed a novel method to predict proteinprotein interactions from co-evolutionary information in the framework of the mirror tree method. The originality is the use of the projection operator to exclude the information about the phylogenetic relationships among the source organisms from the distance matrix. Each distance matrix was transformed into a vector for the operation. The vector is referred to as a phylogenetic vector. We have proposed three ways to extract the phylogenetic information: (1) using the 16S rRNA from the same source organisms as the proteins under consideration, (2) averaging the phylogenetic vectors and (3) analyzing the principal components of the phylogenetic vectors. We examined the performance of the proposed methods to predict interacting protein pairs from Escherichia coli, using experimentally verified data. Our method was successful, and it drastically reduced the number of false positives in the prediction.
Availability: The R script for the prediction of proteinprotein interactions reported in this manuscript is available at http://timpani.genome.ad.jp/~proj/
Contact: sato{at}kuicr.kyoto-u.ac.jp
Supplementary information: The information is also available at the same site as the R script.
Received on April 23, 2005; revised on June 23, 2005; accepted on June 28, 2005
This article has been cited by other articles:
![]() |
E. R.M. Tillier and R. L. Charlebois The human protein coevolution network Genome Res., October 1, 2009; 19(10): 1861 - 1871. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. R Kensche, V. van Noort, B. E Dutilh, and M. A Huynen Practical and theoretical advances in predicting the function of a protein by its phylogenetic distribution J R Soc Interface, February 6, 2008; 5(19): 151 - 170. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Juan, F. Pazos, and A. Valencia High-confidence prediction of global interactomes based on genome-wide coevolutionary networks PNAS, January 22, 2008; 105(3): 934 - 939. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. G. Kann Protein interactions and disease: computational approaches to uncover the etiology of diseases Brief Bioinform, September 1, 2007; 8(5): 333 - 346. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Hakes, S. C. Lovell, S. G. Oliver, and D. L. Robertson Specificity in protein interactions and its relationship with sequence diversity and coevolution PNAS, May 8, 2007; 104(19): 7999 - 8004. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Sato, Y. Yamanishi, K. Horimoto, M. Kanehisa, and H. Toh Partial correlation coefficient between distance matrices as a new indicator of protein-protein interactions Bioinformatics, October 15, 2006; 22(20): 2488 - 2492. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. M. G. Izarzugaza, D. Juan, C. Pons, J. A. G. Ranea, A. Valencia, and F. Pazos TSEMA: interactive prediction of protein pairings between interacting families. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W315 - W319. [Abstract] [Full Text] [PDF] |
||||





