Bioinformatics Advance Access originally published online on September 16, 2008
Bioinformatics 2008 24(23):2677-2683; doi:10.1093/bioinformatics/btn495
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Specific alignment of structured RNA: stochastic grammars and sequence annealing
1Biophysics Graduate Group, 2Department of Mathematics and 3Department of Bioengineering, University of California, Berkeley, CA 94720, USA
*To whom correspondence should be addressed.
| Abstract |
|---|
Motivation: Whole-genome screens suggest that eukaryotic genomes are dense with non-coding RNAs (ncRNAs). We introduce a novel approach to RNA multiple alignment which couples a generative probabilistic model of sequence and structure with an efficient sequence annealing approach for exploring the space of multiple alignments. This leads to a new software program, Stemloc-AMA, that is both accurate and specific in the alignment of multiple related RNA sequences.
Results: When tested on the benchmark datasets BRalibase II and BRalibase 2.1, Stemloc-AMA has comparable sensitivity to and better specificity than the best competing methods. We use a large-scale random sequence experiment to show that while most alignment programs maximize sensitivity at the expense of specificity, even to the point of giving complete alignments of non-homologous sequences, Stemloc-AMA aligns only sequences with detectable homology and leaves unrelated sequences largely unaligned. Such accurate and specific alignments are crucial for comparative-genomics analysis, from inferring phylogeny to estimating substitution rates across different lineages.
Availability: Stemloc-AMA is available from http://biowiki.org/StemLocAMA as part of the dart software package for sequence analysis.
Contact: lpachter{at}math.berkeley.edu; ihh{at}berkeley.edu
Supplementary information: Supplementary data are available at Bioinformatics online.
Associate Editor: Ivo Hofacker
Received on June 20, 2008; revised on September 15, 2008; accepted on September 15, 2008
This article has been cited by other articles:
![]() |
M. Hamada, K. Sato, H. Kiryu, T. Mituyama, and K. Asai CentroidAlign: fast and accurate aligner for structured RNAs by maximizing expected sum-of-pairs score Bioinformatics, December 15, 2009; 25(24): 3236 - 3243. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Hamada, K. Sato, H. Kiryu, T. Mituyama, and K. Asai Predictions of RNA secondary structure by combining homologous sequence information Bioinformatics, June 15, 2009; 25(12): i330 - i338. [Abstract] [Full Text] [PDF] |
||||
