An Ensemble Approach to Predict the Pathogenicity of Synonymous Variants.

Genes (Basel)

Department of Healthcare Genetics, Clemson University, Clemson, SC 29634, USA.

Published: September 2020

Single-nucleotide variants (SNVs) are a major form of genetic variation in the human genome that contribute to various disorders. There are two types of SNVs, namely non-synonymous (missense) variants (nsSNVs) and synonymous variants (sSNVs), predominantly involved in RNA processing or gene regulation. sSNVs, unlike missense or nsSNVs, do not alter the amino acid sequences, thereby making challenging candidates for downstream functional studies. Numerous computational methods have been developed to evaluate the clinical impact of nsSNVs, but very few methods are available for understanding the effects of sSNVs. For this analysis, we have downloaded sSNVs from the ClinVar database with various features such as conservation, DNA-RNA, and splicing properties. We performed feature selection and implemented an ensemble random forest (RF) classification algorithm to build a classifier to predict the pathogenicity of the sSNVs. We demonstrate that the ensemble predictor with selected features (20 features) enhances the classification of sSNVs into two categories, pathogenic and benign, with high accuracy (87%), precision (79%), and recall (91%). Furthermore, we used this prediction model to reclassify sSNVs with unknown clinical significance. Finally, the method is very robust and can be used to predict the effect of other unknown sSNVs.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7565489PMC
http://dx.doi.org/10.3390/genes11091102DOI Listing

Publication Analysis

Top Keywords

predict pathogenicity
8
synonymous variants
8
ssnvs
8
ensemble approach
4
approach predict
4
pathogenicity synonymous
4
variants
4
variants single-nucleotide
4
single-nucleotide variants
4
variants snvs
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!