Bioinformatics Advance Access originally published online on August 7, 2006
Bioinformatics 2006 22(20):2493-2499; doi:10.1093/bioinformatics/btl427
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
© 2006 The Author(s)
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
Robust inference of positive selection from recombining coding sequences
Computational Biology Group, Institute of Infectious Disease and Molecular Medicine University of Cape Town, Private Bag, Rondebosch 7701, South Africa
*To whom correspondence should be addressed.
Motivation: Accurate detection of positive Darwinian selection can provide important insights to researchers investigating the evolution of pathogens. However, many pathogens (particularly viruses) undergo frequent recombination and the phylogenetic methods commonly applied to detect positive selection have been shown to give misleading results when applied to recombining sequences. We propose a method that makes maximum likelihood inference of positive selection robust to the presence of recombination. This is achieved by allowing tree topologies and branch lengths to change across detected recombination breakpoints. Further improvements are obtained by allowing synonymous substitution rates to vary across sites.
Results: Using simulation we show that, even for extreme cases where recombination causes standard methods to reach false positive rates >90%, the proposed method decreases the false positive rate to acceptable levels while retaining high power. We applied the method to two HIV-1 datasets for which we have previously found that inference of positive selection is invalid owing to high rates of recombination. In one of these (env gene) we still detected positive selection using the proposed method, while in the other (gag gene) we found no significant evidence of positive selection.
Availability: A HyPhy batch language implementation of the proposed methods and the HIV-1 datasets analysed are available at http://www.cbio.uct.ac.za/pub_support/bioinf06. The HyPhy package is available at http://www.hyphy.org, and it is planned that the proposed methods will be included in the next distribution. RDP2 is available at http://darwin.uvigo.es/rdp/rdp.html.
Contact: konrad{at}cbio.uct.ac.za, cathal{at}science.uct.ac.za
Received on June 26, 2006; revised on July 31, 2006; accepted on August 1, 2006
This article has been cited by other articles:
![]() |
M. Arenas and D. Posada Coalescent Simulation of Intracodon Recombination Genetics, February 1, 2010; 184(2): 429 - 437. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. S. Steiger, A. E. Fidler, J. C. Mueller, and B. Kempenaers Evidence for Adaptive Evolution of Olfactory Receptor Genes in 9 Bird Species J. Hered., December 4, 2009; (2009) esp105v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. A. Medina, F. Torres-Perez, H. Galeno, M. Navarrete, P. A. Vial, R. E. Palma, M. Ferres, J. A. Cook, and B. Hjelle Ecology, Genetic Diversity, and Phylogeographic Structure of Andes Virus in Humans and Rodents in Chile J. Virol., March 15, 2009; 83(6): 2446 - 2459. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. van der Walt, E. P. Rybicki, A. Varsani, J. E. Polston, R. Billharz, L. Donaldson, A. L. Monjane, and D. P. Martin Rapid host adaptation by extensive recombination J. Gen. Virol., March 1, 2009; 90(3): 734 - 746. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Anisimova and C. Kosiol Investigating Protein-Coding Sequence Evolution with Probabilistic Codon Substitution Models Mol. Biol. Evol., February 1, 2009; 26(2): 255 - 271. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Delport, K. Scheffler, and C. Seoighe Models of coding sequence evolution Brief Bioinform, January 1, 2009; 10(1): 97 - 109. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. F. Y. Poon, F. I. Lewis, S. D. W. Frost, and S. L. Kosakovsky Pond Spidermonkey: rapid detection of co-evolving sites using Bayesian graphical models Bioinformatics, September 1, 2008; 24(17): 1949 - 1950. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Strain, L. A. Kelley, S. Schultz-Cherry, S. V. Muse, and M. D. Koci Genomic Analysis of Closely Related Astroviruses J. Virol., May 15, 2008; 82(10): 5099 - 5103. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. McCauley, S. de Groot, T. Mailund, and J. Hein Annotation of selection strengths in viral genomes Bioinformatics, November 15, 2007; 23(22): 2978 - 2986. [Abstract] [Full Text] [PDF] |
||||






