Bioinformatics 20(Suppl. 1) © Oxford University Press 2004; all rights reserved.
A nucleotide substitution model with nearest-neighbour interactions
Bioinformatics group, Department of Statistics, University of Oxford, 1 South Parks Road, Oxford OX1 3TG, UK
Received on January 15, 2004; accepted on March 1, 2004
Motivation: It is well known that neighbouring nucleotides in DNA sequences do not mutate independently of each other. In this paper, we introduce a context-dependent substitution model and derive an algorithm to calculate the likelihood of sequences evolving under this model. We use this algorithm to estimate neighbour-dependent substitution rates, as well as rates for dinucleotide substitutions, using a Bayesian sampling procedure. The model is irreversible, giving an arrow to time, and allowing the position of the root between a pair of sequences to be inferred without using out-groups.
Results: We applied the model upon aligned humanmouse non-coding data. Clear neighbour dependencies were observed, including 1718-fold increased CpG to TpG/CpA rates compared with other substitutions. Root inference positioned the root halfway the mouse and human tips, suggesting an approximately clock-like behaviour of the irreversible part of the subsitution process.
Contact: lunter{at}stats.ox.ac.uk
* To whom correspondence should be addressed.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
B. Shapiro, A. Rambaut, O. G. Pybus, and E. C. Holmes A Phylogenetic Method for Detecting Positive Epistasis in Gene Sequences and Its Application to RNA Virus Evolution Mol. Biol. Evol., September 1, 2006; 23(9): 1724 - 1730. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Hobolth, R. Nielsen, Y. Wang, F. Wu, and S. D. Tanksley CpG + CpNpG Analysis of Protein-Coding Sequences from Tomato Mol. Biol. Evol., June 1, 2006; 23(6): 1318 - 1323. [Abstract] [Full Text] [PDF] |
||||
