Bioinformatics Advance Access published online on March 16, 2006
Bioinformatics, doi:10.1093/bioinformatics/btl094
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 The Wistar Institute, Philadelphia, PA 19104, USA
* To whom correspondence should be addressed.
Motivation: Numerous computational methodologies utilize techniques based on sequence conservation and/or structural similarity for microRNA gene prediction. In this study we describe a new technique, which is applicable across several species, for predicting microRNA genes. This technique is based on machine learning, using the Naïve Bayes classifier. This computational procedure automatically generates a model from the input or training data, which is the sequence and structure of known microRNAs from a variety of species. Results: This study shows that the application of machine learning techniques, along with the integration of data from multiple species is a useful and general approach for microRNA gene prediction. Based on our experiments, we believe that this new technique is applicable to an extensive range of eukaryotes' genomes. Specific structure and sequence features are first used to identify microRNAs followed by a comparative analysis to decrease the number of false positives. The resulting algorithm exhibits higher specificity and similar sensitivity compared to currently used algorithms, which utilize low sensitivity and conserved genomic regions to decrease the rate of false positives. Availability: The BayesMiRNAfind program is available at http://wotan.wistar.upenn.edu/miRNA. Supplementary information:
Received December 1, 2005
Revised February 21, 2006
Accepted March 9, 2006
Article
Combining multi-species genomic data for microRNA identification using a Naïve Bayes classifier machine learning for identification of microRNA genes
Malik Yousef 1,
Michael Nebozhyn 1,
Hagit Shatkay 2,
Stathis Kanterakis 1,
Louise C. Showe 1,
and
Michael K. Showe 1 *
2 School of Computing, Queen's University, Kingston, Ontario
Michael K. Showe, E-mail: showe{at}wistar.org
![]()
Abstract
Associate Editor: Keith A Crandall
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
N. J. Beveridge, P. A. Tooney, A. P. Carroll, E. Gardiner, N. Bowden, R. J. Scott, N. Tran, I. Dedova, and M. J. Cairns Dysregulation of miRNA 181b in the temporal cortex in schizophrenia Hum. Mol. Genet., April 15, 2008; 17(8): 1156 - 1168. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Terai, T. Komori, K. Asai, and T. Kin miRRim: A novel system to find conserved miRNAs with high sensitivity and specificity RNA, December 1, 2007; 13(12): 2081 - 2090. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Yousef, S. Jung, A. V. Kossenkov, L. C. Showe, and M. K. Showe Naive Bayes for microRNA target predictions machine learning for microRNA targets Bioinformatics, November 15, 2007; 23(22): 2987 - 2992. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. M. Meyer A practical guide to the art of RNA gene prediction Brief Bioinform, November 1, 2007; 8(6): 396 - 414. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Jiang, H. Wu, W. Wang, W. Ma, X. Sun, and Z. Lu MiPred: classification of real and pseudo microRNA precursors using random forest prediction model with combined features Nucleic Acids Res., July 13, 2007; 35(suppl_2): W339 - W344. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. L. S. Ng and S. K. Mishra De novo SVM classification of precursor microRNAs from genomic pseudo hairpins using global and intrinsic folding measures Bioinformatics, June 1, 2007; 23(11): 1321 - 1330. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. NG Kwang Loong and S. K. Mishra Unique folding of precursor microRNAs: Quantitative evidence and implications for de novo identification RNA, February 1, 2007; 13(2): 170 - 187. [Abstract] [Full Text] [PDF] |
||||




