Bioinformatics Advance Access originally published online on February 20, 2009
Bioinformatics 2009 25(8):989-995; doi:10.1093/bioinformatics/btp107
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
microPred: effective classification of pre-miRNAs for human miRNA gene prediction
Oxford University Computing Laboratory, University of Oxford, Wolfson Building, Parks Road, Oxford, OX1 3QD, UK
*To whom correspondence should be addressed.
| Abstract |
|---|
Motivation: In this article, we show that the classification of human precursor microRNA (pre-miRNAs) hairpins from both genome pseudo hairpins and other non-coding RNAs (ncRNAs) is a common and essential requirement for both comparative and non-comparative computational recognition of human miRNA genes. However, the existing computational methods do not address this issue completely or successfully. Here we present the development of an effective classifier system (named as microPred) for this classification problem by using appropriate machine learning techniques. Our approach includes the introduction of more representative datasets, extraction of new biologically relevant features, feature selection, handling of class imbalance problem in the datasets and extensive classifier performance evaluation via systematic cross-validation methods.
Results: Our microPred classifier yielded higher and, especially, much more reliable classification results in terms of both sensitivity (90.02%) and specificity (97.28%) than the exiting pre-miRNA classification methods. When validated with 6095 non-human animal pre-miRNAs and 139 virus pre-miRNAs from miRBase, microPred resulted in 92.71% (5651/6095) and 94.24% (131/139) recognition rates, respectively.
Availability: The microPred classifier, the datasets used, and the features extracted are freely available at http://web.comlab.ox.ac.uk/people/ManoharaRukshan.Batuwita/microPred.htm.
Contact: manb{at}comlab.ox.ac.uk; vasile.palade{at}comlab.ox.ac.uk
Supplementary information: Supplementary data are available at Bioinformatics online.
Associate Editor: Dmitrij Frishman
Received on November 27, 2008; revised on February 18, 2009; accepted on February 18, 2009