Skip Navigation



Bioinformatics Advance Access published online on January 18, 2005

Bioinformatics, doi:10.1093/bioinformatics/bti279
This Article
Right arrow Advance Access manuscript (PDF) Freely available
Right arrow All Versions of this Article:
21/9/1815    most recent
bti279v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Havgaard, J. H.
Right arrow Articles by Gorodkin, J.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Havgaard, J. H.
Right arrow Articles by Gorodkin, J.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Bioinformatics © Oxford University Press 2005; all rights reserved.
Received September 2, 2004
Revised January 6, 2005
Accepted January 17, 2005

Article

Pairwise local structural alignment of RNA sequences with sequence similarity less than 40%

Jakob Hull Havgaard 1, Rune Lyngsø 2, Gary D. Stormo 3, and Jan Gorodkin 1*

1 Center for Bioinformatics and Division of Genetics, IBHV, The Royal Veterinary and Agricultural University, Grønnegårdsvej 3, DK-1870 Frederiksberg C, Denmark
2 Department of Statistics, Oxford University, 1 South Parks Road, Oxford, OX1 3TG, United Kingdom
3 Department of Genetics, Washington University School of Medicine, Campus Box 8232, 4566 Scott Avenue, St. Louis, MO 63110, USA

* To whom correspondence should be addressed.
Jan Gorodkin, E-mail: gorodkin{at}bioinf.kvl.dk


   Abstract

Motivation: Searching for non-coding RNA (ncRNA) genes and structural RNA elements (eleRNA) are major challenges in todays gene finding as these often are conserved in structure rather than in sequence. Even though the number of available methods is growing, it is still of interest to pairwise detect two genes with low sequence similarity, where the genes are part of a larger genomic region.

Results: Here we present such an approach for pairwise local alignment which is based on FOLDALIGN and the Sankoff algorithm for simultaneous structural alignment of multiple sequences. We include the ability to conduct mutual scan of two sequences of arbitrary length while searching for common local structural motifs of some maximum length. This drastically reduces the complexity of the algorithm. The scoring scheme includes structural parameters corresponding to those available for free energy as well as for substitution matrices similar to RIBOSUM. The new FOLDALIGN implementation is tested on a data set where the ncRNAs and eleRNAs have sequence similarity less than 40% and where the ncRNAs and eleRNAs cannot energetically be distinguished from the surrounding genomic sequence context. The method is tested in two ways: (1) its pure ability to find the common structure between the genes only, and (2) its ability to locate ncRNAs and eleRNAs in a genomic context. In case (1) it make sense to compare to methods like Dynalign, and the performances are very similar, but FOLDALIGN is substantially faster. The structure prediction performance for a family is typically around 0.7 using Matthews correlation coefficient. In case (2) the algorithm is fairly successful in locating most of the considered RNA families with an average sensitivity at 0.8 and a positive predictive value of 0.9 using a BLAST like hit selection scheme.

Availability: The program is available online at http://foldalign.kvl.dk/.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Genome ResHome page
P. Larsson, A. Hinas, D. H. Ardell, L. A. Kirsebom, A. Virtanen, and F. Soderbom
De novo search for non-coding RNA genes in the AT-rich genome of Dictyostelium discoideum: Performance of Markov-dependent genome feature scoring
Genome Res., June 1, 2008; 18(6): 888 - 899.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
A. Wilm, D. G. Higgins, and C. Notredame
R-Coffee: a method for multiple alignment of non-coding RNA
Nucleic Acids Res., May 1, 2008; 36(9): e52 - e52.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
S. Lindgreen, P. P. Gardner, and A. Krogh
MASTR: multiple alignment and structure prediction of non-coding RNAs using simulated annealing
Bioinformatics, December 15, 2007; 23(24): 3304 - 3311.
[Abstract] [Full Text] [PDF]


Home page
Brief BioinformHome page
I. M. Meyer
A practical guide to the art of RNA gene prediction
Brief Bioinform, November 1, 2007; 8(6): 396 - 414.
[Abstract] [Full Text] [PDF]


Home page
RNAHome page
E. S. Andersen, A. Lind-Thomsen, B. Knudsen, S. E. Kristensen, J. H. Havgaard, E. Torarinsson, N. Larsen, C. Zwieb, P. Sestoft, J. Kjems, et al.
Semiautomated improvement of RNA alignments
RNA, November 1, 2007; 13(11): 1850 - 1859.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
S. Washietl, J. S. Pedersen, J. O. Korbel, C. Stocsits, A. R. Gruber, J. Hackermuller, J. Hertel, M. Lindemeyer, K. Reiche, A. Tanzer, et al.
Structured RNAs in the ENCODE selected regions of the human genome
Genome Res., June 1, 2007; 17(6): 852 - 864.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
E. Torarinsson, J. H. Havgaard, and J. Gorodkin
Multiple structural alignment and clustering of RNA sequences
Bioinformatics, April 15, 2007; 23(8): 926 - 932.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
E. K. Freyhult, J. P. Bollback, and P. P. Gardner
Exploring genomic dark matter: A critical assessment of the performance of homology search methods on noncoding RNA
Genome Res., January 1, 2007; 17(1): 117 - 125.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
S. Lindgreen, P. P. Gardner, and A. Krogh
Measuring covariation in RNA alignments: physical realism improves information measures
Bioinformatics, December 15, 2006; 22(24): 2988 - 2995.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
B. Voss
Structural analysis of aligned RNAs
Nucleic Acids Res., November 14, 2006; 34(19): 5471 - 5481.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
Y. Tabei, K. Tsuda, T. Kin, and K. Asai
SCARNA: fast and accurate structural alignment of RNA sequences by matching fixed-length stem fragments
Bioinformatics, July 15, 2006; 22(14): 1723 - 1729.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
E. Torarinsson, M. Sawera, J. H. Havgaard, M. Fredholm, and J. Gorodkin
Thousands of corresponding human and mouse genomic regions unalignable in primary sequence contain common RNA structure
Genome Res., July 1, 2006; 16(7): 885 - 889.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
D. Dalli, A. Wilm, I. Mainz, and G. Steger
STRAL: progressive alignment of non-coding RNA using base pairing probability vectors in quadratic time
Bioinformatics, July 1, 2006; 22(13): 1593 - 1599.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
Z. Yao, Z. Weinberg, and W. L. Ruzzo
CMfinder--a covariance model based RNA motif finding algorithm
Bioinformatics, February 15, 2006; 22(4): 445 - 452.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
P. Steffen, B. Voss, M. Rehmsmeier, J. Reeder, and R. Giegerich
RNAshapes: an integrated RNA analysis package based on abstract shapes
Bioinformatics, February 15, 2006; 22(4): 500 - 503.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
J. H. Havgaard, R. B. Lyngso, and J. Gorodkin
The FOLDALIGN web server for pairwise structural RNA alignment and mutual motif search
Nucleic Acids Res., July 1, 2005; 33(suppl_2): W650 - W653.
[Abstract] [Full Text] [PDF]



Disclaimer:
Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.