Bioinformatics Advance Access published online on November 27, 2007
Bioinformatics, doi:10.1093/bioinformatics/btm576
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
An assessment of the uses of homologous interactions
Department of Statistics, University of Oxford, 1 South Parks Road, Oxford OX1 3TG, UK
*To whom correspondence should be addressed. Charlotte Deane, E-mail: deane{at}stats.ox.ac.uk
| Abstract |
|---|
Motivation: Protein-protein interactions have proved to be a valuable starting point for understanding the inner workings of the cell. Computational methodologies have been built which both predict interactions and use interaction datasets in order to predict other protein features. Such methods require gold standard positive (GSP) and negative (GSN) interaction sets. Here we examine and demonstrate the usefulness of homologous interactions in predicting good quality positive and negative interaction datasets.
Results: We generate GSP interaction sets as subsets from experimental data using only interaction and sequence information. We can therefore produce sets for several species (many of which at present have no identified GSPs). Comprehensive error rate testing demonstrates the power of the method. We also show how the use of our datasets significantly improves the predictive power of algorithms for interaction prediction and function prediction.
Furthermore we generate GSN interaction sets for yeast and examine the use of homology along with other protein properties such as localisation, expression and function. Using a novel method to assess the accuracy of a negative interaction set we find that the best single selector for negative interactions is a lack of co-function. However, an integrated method using all the characteristics shows significant improvement over any current method for identifying GSN interactions. The nature of homologous interactions is also examined and we demonstrate that interologs are found more commonly within species than across species.
Conclusion: GSP sets built using our homologous verification method are demonstrably better than standard sets in terms of predictive ability. We can build such GSP sets for several species. When generating GSNs we show a combination of protein features and lack of homologous interactions gives the highest quality interaction sets.
Availability: GSP and GSN datasets for all the studied species can be downloaded from (http://www.stats.ox.ac.uk/~deane/HPIV).
Contact: deane{at}stats.ox.ac.uk
Associate Editor: Dr. Limsoon Wong
Received on August 24, 2007; revised on October 26, 2007; accepted on November 17, 2007
This article has been cited by other articles:
![]() |
W. Ali and C. M. Deane Functionally guided alignment of protein interaction networks for module detection Bioinformatics, December 1, 2009; 25(23): 3166 - 3173. [Abstract] [Full Text] [PDF] |
||||
![]() |
C.-C. Chen, C.-Y. Lin, Y.-S. Lo, and J.-M. Yang PPISearch: a web server for searching homologous protein-protein interactions across multiple species Nucleic Acids Res., July 1, 2009; 37(suppl_2): W369 - W375. [Abstract] [Full Text] [PDF] |
||||

