Bioinformatics Advance Access published online on November 14, 2006
Bioinformatics, doi:10.1093/bioinformatics/btl570
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 Department of Computer and Information Science, Norwegian University of Science and Technology, NO-7052 Trondheim, Norway
* To whom correspondence should be addressed.
Motivation: Mature microRNAs (miRNAs) are processed from long hairpin transcripts. Even though it is only the first of several steps, the initial Drosha processing defines the mature product and is characteristic for all miRNA genes. Methods that can separate between true and false processing sites are therefore essential to miRNA gene discovery. Results: We present a classifier that predicts 5' Drosha processing sites in hairpins that are candidate miRNAs. The classifier, called Microprocessor SVM, correctly predicts the processing site for 50 percent of known human 5' miRNAs, and 90 percent of its predictions are within two nucleotides of the true site. Another classifier that is trained on the output from the Microprocessor SVM outperforms existing methods for prediction of unconserved miRNAs. Reanalysis of characteristics and supporting evidence for a set of newly annotated miRNAs shows that some miRNAs may be misannotated. This suggests that expressed hairpins should not be annotated as miRNAs until they are verified to be Drosha and Dicer substrates. Availability: The classifiers are publicly available at https://demo1.interagon.com/miRNA/. Supplementary information: Supplementary data is available at Bioinformatics online.
Received August 13, 2006
Revised November 8, 2006
Accepted November 8, 2006
Article
Reliable prediction of Drosha processing sites improves microRNA gene prediction
Snorre A. Helvik 1, Ola Snøve Jr. 2, and Pål Sætrom 3 *
2 Interagon AS, Laboratoriesenteret, NO-7006 Trondheim, Norway; Department of Cancer Research and Molecular Medicine, Faculty of Medicine, Norwegian University of Science and Technology, N0-7489 Trondheim, Norway
3 Department of Computer and Information Science, Norwegian University of Science and Technology, NO-7052 Trondheim, Norway; Interagon AS, Laboratoriesenteret, NO-7006 Trondheim, Norway
Pål Sætrom, E-mail: paal.saetrom{at}interagon.com
![]()
Abstract
Associate Editor: Chris Stoeckert
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
X. Tang, J. Gal, X. Zhuang, W. Wang, H. Zhu, and G. Tang A simple array platform for microRNA analysis and its application in mouse tissues RNA, October 1, 2007; 13(10): 1803 - 1822. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Jiang, H. Wu, W. Wang, W. Ma, X. Sun, and Z. Lu MiPred: classification of real and pseudo microRNA precursors using random forest prediction model with combined features Nucleic Acids Res., July 13, 2007; 35(suppl_2): W339 - W344. [Abstract] [Full Text] [PDF] |
||||

