Background: In the process of finding the causative variant of rare diseases, accurate assessment and prioritization of genetic variants is essential. Previous variant prioritization tools mainly depend on the in-silico prediction of the pathogenicity of variants, which results in low sensitivity and difficulty in interpreting the prioritization result. In this study, we propose an explainable algorithm for variant prioritization, named 3ASC, with higher sensitivity and ability to annotate evidence used for prioritization. 3ASC annotates each variant with the 28 criteria defined by the ACMG/AMP genome interpretation guidelines and features related to the clinical interpretation of the variants. The system can explain the result based on annotated evidence and feature contributions.

Results: We trained various machine learning algorithms using in-house patient data. The performance of variant ranking was assessed using the recall rate of identifying causative variants in the top-ranked variants. The best practice model was a random forest classifier that showed top 1 recall of 85.6% and top 3 recall of 94.4%. The 3ASC annotates the ACMG/AMP criteria for each genetic variant of a patient so that clinical geneticists can interpret the result as in the CAGI6 SickKids challenge. In the challenge, 3ASC identified causal genes for 10 out of 14 patient cases, with evidence of decreased gene expression for 6 cases. Among them, two genes (HDAC8 and CASK) had decreased gene expression profiles confirmed by transcriptome data.

Conclusions: 3ASC can prioritize genetic variants with higher sensitivity compared to previous methods by integrating various features related to clinical interpretation, including features related to false positive risk such as quality control and disease inheritance pattern. The system allows interpretation of each variant based on the ACMG/AMP criteria and feature contribution assessed using explainable AI techniques.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10956189PMC
http://dx.doi.org/10.1186/s40246-024-00595-8DOI Listing

Publication Analysis

Top Keywords

genetic variants
12
prioritization genetic
8
machine learning
8
learning algorithms
8
variant prioritization
8
higher sensitivity
8
3asc annotates
8
features clinical
8
clinical interpretation
8
top recall
8

Similar Publications

Background: Aicardi-Goutières Syndrome is a monogenic type 1 interferonopathy with infantile onset, characterized by a variable degree of neurological damage. Approximately 7% of Aicardi-Goutières Syndrome cases are caused by pathogenic variants in the ADAR gene and are classified as Aicardi-Goutières Syndrome type 6. Here, we present a new homozygous pathogenic variant in the ADAR gene.

View Article and Find Full Text PDF

Background: Epistasis, the phenomenon where the effect of one gene (or variant) is masked or modified by one or more other genes, significantly contributes to the phenotypic variance of complex traits. Traditionally, epistasis has been modeled using the Cartesian epistatic model, a multiplicative approach based on standard statistical regression. However, a recent study investigating epistasis in obesity-related traits has identified potential limitations of the Cartesian epistatic model, revealing that it likely only detects a fraction of the genetic interactions occurring in natural systems.

View Article and Find Full Text PDF

Introduction: Cryptorchidism impairs sperm development and increases the risk of infertility and testicular cancer. Estrogen signalling is critical for proper descent of the testicles, and hormonal imbalances play a role in cryptorchidism. CYP19, also known as aromatase, encodes an enzyme that converts testosterone, a male sex hormone, into estradiol, the main form of estrogen.

View Article and Find Full Text PDF

Clinical Spectrum and Prognosis of Atypical Autosomal Dominant Polycystic Kidney Disease Caused by Monoallelic Pathogenic Variants of IFT140.

Am J Kidney Dis

December 2024

Service de Néphrologie, Hémodialyse et Transplantation Rénale, Centre de référence MARHEA, CHRU Brest, Brest, France; Institut de Recherche Expérimentale et Clinique (IREC), UCLouvain, Brussels, Belgium. Electronic address:

Rationale & Objective: Monoallelic predicted Loss-of-Function (pLoF) variants in IFT140 have recently been associated with an autosomal dominant polycystic kidney disease (ADPKD)-like phenotype. This study sought to enhance the characterization of this phenotype.

Study Design: Case series.

View Article and Find Full Text PDF

Loss-of-function SLC25A20 mutation causes carnitine-acylcarnitine translocase deficiency by reducing SLC25A20 protein stability.

Gene

December 2024

Department of Medical Genetics/Experimental Education/Administration Center, School of Basic Medical Sciences, Southern Medical University, Guangzhou 510515, China; Guangdong Provincial Key Laboratory of Single Cell Technology and Application, Guangzhou 510515, China; Department of Fetal Medicine and Prenatal Diagnosis, Zhujiang Hospital, Southern Medical University, Guangzhou 510280, China. Electronic address:

Background/aim: Autosomal-recessive carnitine-acylcarnitine translocase deficiency (CACTD) is a rare disorder of long-chain fatty acid oxidation caused by variants in the SLC25A20 gene. Under fasting conditions, most newborns with severe CACTD experience sudden cardiac arrest and hypotonia, often leading to premature death due to rapid disease progression. Understanding of genetic factors and pathogenic mechanisms in CACTD is essential for its diagnosis, treatment, and prevention.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!