Bioinformatics, Vol 14, 608-616, Copyright © 1998 by Oxford University Press
PH Sneath
MOTIVATION: A modified Sherman statistic can be used to test whether the
differences between two aligned sequences are distributed at random along
the sequences, or whether they are clustered, which suggests anomalies of
evolution such as partial gene recombination or functional constraints. The
presence of evenly spaced constant sites (such as constancy at the second
codon position in genes coding for proteins) lowers the statistic and makes
the significance less than it should be. RESULTS: The magnitude of the
constant-site effect is shown by simulation to depend mainly on the
proportion of differences between two sequences and on the number of
constant sites that are added after each variable site. This latter number
can be estimated from the variance of sites in a sequence matrix at the
first, second and third codon positions, to obtain a ratio that corrects
the statistic. When expressed as standard errors, the uncorrected results
are too low (typically half to one unit when almost all the variation is at
the third codon position). Correction raises the standard errors to levels
close to expectation. If the data show no marked ternary periodicity, the
correction is very small. The method is illustrated with biological data
that show close to random behaviour, and with data that exhibit strong
clustering. AVAILABILITY: The software is available from the author and has
also been placed on the EMBL file server (Software@embl- ebi.ac.uk).
CONTACT: phas1@le.ac.uk
ARTICLES
The effect of evenly spaced constant sites on the distribution of the random division of a molecular sequence
Department of Microbiology and Immunology, Leicester University, Leicester LE1 9HN, UK. phas1@le.ac.uk
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
M. F. Boni, D. Posada, and M. W. Feldman An Exact Nonparametric Method for Inferring Mosaic Structure in Sequence Triplets Genetics, June 1, 2007; 176(2): 1035 - 1047. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Posada Evaluation of Methods for Detecting Recombination from DNA Sequences: Empirical Data Mol. Biol. Evol., May 1, 2002; 19(5): 708 - 717. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Posada and K. A. Crandall Evaluation of methods for detecting recombination from DNA sequences: Computer simulations PNAS, November 20, 2001; 98(24): 13757 - 13762. [Abstract] [Full Text] [PDF] |
||||


