Bioinformatics Advance Access originally published online on March 16, 2006
Bioinformatics 2006 22(11):1325-1334; doi:10.1093/bioinformatics/btl094
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Combining multi-species genomic data for microRNA identification using a Naïve Bayes classifier
1 The Wistar Institute, Philadelphia PA 19104, USA
2 School of Computing, Queen's University Kingston, Ontario, Canada
*To whom correspondence should be addressed.
Motivation: Most computational methodologies for microRNA gene prediction utilize techniques based on sequence conservation and/or structural similarity. In this study we describe a new technique, which is applicable across several species, for predicting miRNA genes. This technique is based on machine learning, using the Naïve Bayes classifier. It automatically generates a model from the training data, which consists of sequence and structure information of known miRNAs from a variety of species.
Results: Our study shows that the application of machine learning techniques, along with the integration of data from multiple species is a useful and general approach for miRNA gene prediction. Based on our experiments, we believe that this new technique is applicable to an extensive range of eukaryotes' genomes. Specific structure and sequence features are first used to identify miRNAs followed by a comparative analysis to decrease the number of false positives (FPs). The resulting algorithm exhibits higher specificity and similar sensitivity compared to currently used algorithms that rely on conserved genomic regions to decrease the rate of FPs.
Availability: The BayesMiRNAfind program is available at http://wotan.wistar.upenn.edu/miRNA
Contact: showe{at}wistar.org
Supplementary information: Supplementary data are available at Bioinformatics online.
Received on December 1, 2005; revised on February 21, 2006; accepted on March 9, 2006
This article has been cited by other articles:
![]() |
A. Oulas, A. Boutla, K. Gkirtzou, M. Reczko, K. Kalantidis, and P. Poirazi Prediction of novel microRNA genes in cancer-associated genomic regions--a combined computational and experimental approach Nucleic Acids Res., June 1, 2009; 37(10): 3276 - 3287. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. D. Mendes, A. T. Freitas, and M.-F. Sagot Current tools for the identification of miRNA genes and their targets Nucleic Acids Res., May 1, 2009; 37(8): 2419 - 2433. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. J. Beveridge, P. A. Tooney, A. P. Carroll, E. Gardiner, N. Bowden, R. J. Scott, N. Tran, I. Dedova, and M. J. Cairns Dysregulation of miRNA 181b in the temporal cortex in schizophrenia Hum. Mol. Genet., April 15, 2008; 17(8): 1156 - 1168. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Terai, T. Komori, K. Asai, and T. Kin miRRim: A novel system to find conserved miRNAs with high sensitivity and specificity RNA, December 1, 2007; 13(12): 2081 - 2090. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Yousef, S. Jung, A. V. Kossenkov, L. C. Showe, and M. K. Showe Naive Bayes for microRNA target predictions machine learning for microRNA targets Bioinformatics, November 15, 2007; 23(22): 2987 - 2992. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. M. Meyer A practical guide to the art of RNA gene prediction Brief Bioinform, November 1, 2007; 8(6): 396 - 414. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Jiang, H. Wu, W. Wang, W. Ma, X. Sun, and Z. Lu MiPred: classification of real and pseudo microRNA precursors using random forest prediction model with combined features Nucleic Acids Res., July 13, 2007; 35(suppl_2): W339 - W344. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. L. S. Ng and S. K. Mishra De novo SVM classification of precursor microRNAs from genomic pseudo hairpins using global and intrinsic folding measures Bioinformatics, June 1, 2007; 23(11): 1321 - 1330. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. NG Kwang Loong and S. K. Mishra Unique folding of precursor microRNAs: Quantitative evidence and implications for de novo identification RNA, February 1, 2007; 13(2): 170 - 187. [Abstract] [Full Text] [PDF] |
||||




