A genetic algorithm-based weighted ensemble method for predicting transposon-derived piRNAs.

BMC Bioinformatics

State Key Lab of Software Engineering, Wuhan University, Wuhan, 430072, China.

Published: August 2016

Background: Predicting piwi-interacting RNA (piRNA) is an important topic in the small non-coding RNAs, which provides clues for understanding the generation mechanism of gamete. To the best of our knowledge, several machine learning approaches have been proposed for the piRNA prediction, but there is still room for improvements.

Results: In this paper, we develop a genetic algorithm-based weighted ensemble method for predicting transposon-derived piRNAs. We construct datasets for three species: Human, Mouse and Drosophila. For each species, we compile the balanced dataset and imbalanced dataset, and thus obtain six datasets to build and evaluate prediction models. In the computational experiments, the genetic algorithm-based weighted ensemble method achieves 10-fold cross validation AUC of 0.932, 0.937 and 0.995 on the balanced Human dataset, Mouse dataset and Drosophila dataset, respectively, and achieves AUC of 0.935, 0.939 and 0.996 on the imbalanced datasets of three species. Further, we use the prediction models trained on the Mouse dataset to identify piRNAs of other species, and the models demonstrate the good performances in the cross-species prediction.

Conclusions: Compared with other state-of-the-art methods, our method can lead to better performances. In conclusion, the proposed method is promising for the transposon-derived piRNA prediction. The source codes and datasets are available in https://github.com/zw9977129/piRNAPredictor .

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5006569PMC
http://dx.doi.org/10.1186/s12859-016-1206-3DOI Listing

Publication Analysis

Top Keywords

genetic algorithm-based
12
algorithm-based weighted
12
weighted ensemble
12
ensemble method
12
method predicting
8
predicting transposon-derived
8
transposon-derived pirnas
8
pirna prediction
8
datasets three
8
three species
8

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!