Extremely-randomized-tree-based Prediction of N-Methyladenosine Sites in .

Curr Genomics

1HotSpot Therapeutics, 50 Milk Street, 16 Floor, Boston, MA02109, USA; 2Research and Development Center, In-silicogen Inc., Yongin-si 16954, Gyeonggi-do, Republic of Korea; 3Department of Biotechnology, Dr. N.G.P. Arts and Science College, Coimbatore, Tamil Nadu641048, India; 4Department of Physiology, Ajou University School of Medicine, Suwon, Republic of Korea.

Published: January 2020

Introduction: N-methyladenosine (m6A) is one of the most common post-transcriptional modifications in RNA, which has been related to several biological processes. The accurate prediction of m6A sites from RNA sequences is one of the challenging tasks in computational biology. Several computational methods utilizing machine-learning algorithms have been proposed that accelerate screening of m6A sites, thereby drastically reducing the experimental time and labor costs involved.

Methodology: In this study, we proposed a novel computational predictor termed ERT-m6Apred, for the accurate prediction of m6A sites. To identify the feature encodings with more discriminative capability, we applied a two-step feature selection technique on seven different feature encodings and identified the corresponding optimal feature set.

Results: Subsequently, performance comparison of the corresponding optimal feature set-based extremely randomized tree model revealed that Pseudo k-tuple composition encoding, which includes 14 physicochemical properties significantly outperformed other encodings. Moreover, ERT-m6Apred achieved an accuracy of 78.84% during cross-validation analysis, which is comparatively better than recently reported predictors.

Conclusion: In summary, ERT-m6Apred predicts m6A sites with higher accuracy, thus facilitating biological hypothesis generation and experimental validations.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7324895PMC
http://dx.doi.org/10.2174/1389202921666200219125625DOI Listing

Publication Analysis

Top Keywords

m6a sites
16
accurate prediction
8
prediction m6a
8
feature encodings
8
corresponding optimal
8
optimal feature
8
sites
5
m6a
5
feature
5
extremely-randomized-tree-based prediction
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!