Skip Navigation



Bioinformatics Advance Access published online on August 7, 2006

Bioinformatics, doi:10.1093/bioinformatics/btl427
This Article
Right arrow Advance Access manuscript (PDF) Freely available
Right arrowOA All Versions of this Article:
22/20/2493    most recent
btl427v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Google Scholar
Right arrow Articles by Scheffler, K.
Right arrow Articles by Seoighe, C.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Scheffler, K.
Right arrow Articles by Seoighe, C.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© 2006 The Author(s)
Received June 26, 2006
Revised July 31, 2006
Accepted August 1, 2006

Article

Robust inference of positive selection from recombining coding sequences

Konrad Scheffler 1 *, Darren P. Martin 1, and Cathal Seoighe 1

1 Computational Biology Group, Institute of Infectious Disease and Molecular Medicine, University of Cape Town, Private Bag, Rondebosch 7701, South Africa

* To whom correspondence should be addressed.
Konrad Scheffler, E-mail: konrad{at}cbio.uct.ac.za


   Abstract

Motivation: Accurate detection of positive Darwinian selection can provide important insights to researchers investigating the evolution of pathogens. However, many pathogens (particularly viruses) undergo frequent recombination and the phylogenetic methods commonly applied to detect positive selection have been shown to give misleading results when applied to recombining sequences. We propose a method that makes maximum likelihood inference of positive selection robust to the presence of recombination. This is achieved by allowing tree topologies and branch lengths to change across detected recombination breakpoints. Further improvements are obtained by allowing synonymous substitution rates to vary across sites.

Results: Using simulation we show that, even for extreme cases where recombination causes standard methods to reach false positive rates above 90%, the proposed method decreases the false positive rate to acceptable levels while retaining high power. We applied the method to two HIV-1 data sets for which we have previously found that inference of positive selection is invalid due to high rates of recombination. In one of these (env gene) we still detected positive selection using the proposed method, while in the other (gag gene) we found no significant evidence of positive selection.

Availability: A HyPhy batch language implementation of the proposed methods and the HIV-1 data sets analysed are available at http://www.cbio.uct.ac.za/pub_support/bioinf06. The HyPhy package is available at http://www.hyphy.org, and it is planned that the proposed methods will be included in the next distribution. RDP2 is available at http://darwin.uvigo.es/rdp/rdp.html.


Associate Editor: Keith A Crandall
Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
J. Virol.Home page
R. A. Medina, F. Torres-Perez, H. Galeno, M. Navarrete, P. A. Vial, R. E. Palma, M. Ferres, J. A. Cook, and B. Hjelle
Ecology, Genetic Diversity, and Phylogeographic Structure of Andes Virus in Humans and Rodents in Chile
J. Virol., March 15, 2009; 83(6): 2446 - 2459.
[Abstract] [Full Text] [PDF]


Home page
J. Gen. Virol.Home page
E. van der Walt, E. P. Rybicki, A. Varsani, J. E. Polston, R. Billharz, L. Donaldson, A. L. Monjane, and D. P. Martin
Rapid host adaptation by extensive recombination
J. Gen. Virol., March 1, 2009; 90(3): 734 - 746.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
M. Anisimova and C. Kosiol
Investigating Protein-Coding Sequence Evolution with Probabilistic Codon Substitution Models
Mol. Biol. Evol., February 1, 2009; 26(2): 255 - 271.
[Abstract] [Full Text] [PDF]


Home page
Brief BioinformHome page
W. Delport, K. Scheffler, and C. Seoighe
Models of coding sequence evolution
Brief Bioinform, January 1, 2009; 10(1): 97 - 109.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
A. F. Y. Poon, F. I. Lewis, S. D. W. Frost, and S. L. Kosakovsky Pond
Spidermonkey: rapid detection of co-evolving sites using Bayesian graphical models
Bioinformatics, September 1, 2008; 24(17): 1949 - 1950.
[Abstract] [Full Text] [PDF]


Home page
J. Virol.Home page
E. Strain, L. A. Kelley, S. Schultz-Cherry, S. V. Muse, and M. D. Koci
Genomic Analysis of Closely Related Astroviruses
J. Virol., May 15, 2008; 82(10): 5099 - 5103.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
S. McCauley, S. de Groot, T. Mailund, and J. Hein
Annotation of selection strengths in viral genomes
Bioinformatics, November 15, 2007; 23(22): 2978 - 2986.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.