Bioinformatics Vol. 19 no. 4 2003
Pages 490-499
© 2003 Oxford University Press
Statistical alignment based on fragment insertion and deletion models
Johann Wolfgang Goethe-Universität, Fachbereich Mathematik, Frankfurt am Main, Germany
Received on June 6, 2002
; revised on October 7, 2002
; accepted on October 10, 2002
Motivation: The topic of this paper is the estimation of alignments and mutation rates based on stochastic sequenceevolution models that allow insertions and deletions of subsequences (fragments) and not just single bases. The model we propose is a variant of a model introduced by Thorne et al., (J. Mol. Evol., 34, 316, 1992). The computational tractability of the model depends on certain restrictions in the insertion/deletion process; possible effects we discuss.
Results: The process of fragment insertion and deletion in the sequenceevolution model induces a hidden Markov structure at the level of alignments and thus makes possible efficient statistical alignment algorithms. As an example we apply a sampling procedure to assess the variability in alignment and mutation parameter estimates for HVR1 sequences of human and orangutan, improving results of previous work. Simulation studies give evidence that estimation methods based on the proposed model also give satisfactory results when applied to data for which the restrictions in the insertion/deletion process do not hold.
Availability: The source code of the software for sampling alignments and mutation rates for a pair of DNA sequences according to the fragment insertion and deletion model is freely available from http://www.math.uni-frankfurt.de/~stoch/software/mcmcsalut under the terms of the GNU public license (GPL, 2000).
Contact: dmetzler{at}math.uni-frankfurt.de
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
I. Miklos, A. Novak, R. Satija, R. Lyngso, and J. Hein Stochastic models of sequence evolution including insertion--deletion events Statistical Methods in Medical Research, October 1, 2009; 18(5): 453 - 485. [Abstract] [PDF] |
||||
![]() |
R. A. Cartwright Problems and Solutions for Estimating Indel Rates and Length Distributions Mol. Biol. Evol., February 1, 2009; 26(2): 473 - 480. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. W. Mount The Maximum Likelihood Approach for Phylogenetic Prediction CSH Protocols, April 1, 2008; 2008(5): pdb.top34 - pdb.top34. [Abstract] [Full Text] |
||||
![]() |
G. Lunter, A. Rocco, N. Mimouni, A. Heger, A. Caldeira, and J. Hein Uncertainty in homology inferences: Assessing and improving genomic sequence alignment Genome Res., February 1, 2008; 18(2): 298 - 309. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Benavides, R. Baum, D. McClellan, and J. W. Sites Molecular Phylogenetics of the Lizard Genus Microlophus (Squamata:Tropiduridae): Aligning and Retrieving Indel Signal from Nuclear Introns Syst Biol, October 1, 2007; 56(5): 776 - 797. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Kim and S. Sinha Indelign: a probabilistic framework for annotation of insertions and deletions in a multiple alignment Bioinformatics, February 1, 2007; 23(3): 289 - 297. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. H. Ogden and M. S. Rosenberg Multiple Sequence Alignment Accuracy and Phylogenetic Inference Syst Biol, April 1, 2006; 55(2): 314 - 328. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Fleissner, D. Metzler, and A. von Haeseler Simultaneous Statistical Multiple Alignment and Phylogeny Reconstruction Syst Biol, August 1, 2005; 54(4): 548 - 561. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Miklos, G. A. Lunter, and I. Holmes A "Long Indel" Model For Evolutionary Sequence Alignment Mol. Biol. Evol., March 1, 2004; 21(3): 529 - 540. [Abstract] [Full Text] [PDF] |
||||





