Amyotrophic lateral sclerosis (ALS) is a severe motor neuron disease, with most sporadic cases lacking clear genetic causes. Abnormal pre-mRNA splicing is a fundamental mechanism in neurodegenerative diseases. For example, TAR DNA-binding protein 43 (TDP-43) loss-of-function (LOF) causes widespread RNA mis-splicing events in ALS. Additionally, splicing mutations are major contributors to neurological disorders. However, the role of intronic variants driving RNA mis-splicing in ALS remains poorly understood. To address this, we developed Spliformer to predict RNA splicing. Spliformer is a transformer-based deep learning model trained and tested on splicing events from the GENCODE database, as well as RNA-seq data from blood and central nervous system tissues. We benchmarked Spliformer against SpliceAI and Pangolin using testing datasets and paired whole-genome sequencing (WGS) with RNA-seq data. We further developed the Spliformer-motif model to identify splicing regulatory motifs. We analyzed Clinvar dataset to identify the link of splicing variants with disease pathogenicity. Additionally, we analyzed WGS data of ALS patients and controls to identify common intronic splicing variants linked to ALS risk or disease phenotypes. We also profiled rare intronic splicing variants in ALS patients to identify known or novel ALS-associated genes. Minigene assays were employed to validate candidate splicing variants. Finally, we measured spine density in neurons with a specific gene knockdown or those expressing a TDP-43 disease-causing mutant. Spliformer accurately predicts the possibilities of a nucleotide within a pre-mRNA sequence being a splice donor, acceptor, or neither. Spliformer outperformed SpliceAI and Pangolin in both speed and accuracy in tested splicing events and/or paired WGS/RNA-seq data. Spliformer-motif successfully identified canonical and novel splicing regulatory motifs. In Clinvar dataset, splicing variants are highly related to disease pathogenicity. Genome-wide analyses of common intronic splicing variants nominated one variant linked to ALS progression. Deep learning analyses of WGS data from 1,370 ALS patients revealed rare splicing variants in reported ALS genes (such as PTPRN2 and CFAP410, validated through minigene assays and RNA-seq), and TDP-43 LOF related RNA mis-splicing genes (such as PTPRD). Further genetic analysis and minigene assays nominated PCP4 and TMEM63A as ALS-associated genes. Functional assays demonstrated that PCP4 is critical for maintaining spine density and can rescue spine loss in neurons expressing a disease-causing TDP-43 mutant. In summary, we developed Spliformer and Spliformer-motif that accurately predict and interpret pre-mRNA splicing. Our findings highlight an intronic genetic mechanism driving RNA mis-splicing in ALS and nominate PCP4 as an ALS-associated gene.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1093/brain/awaf025 | DOI Listing |
Viruses
January 2025
Center for Retrovirus Research, Department of Veterinary Biosciences, The Ohio State University, Columbus, OH 43210, USA.
Since the discovery of RNA in the early 1900s, scientific understanding of RNA form and function has evolved beyond protein coding. Viruses, particularly retroviruses like human T-cell leukemia virus type 1 (HTLV-1), rely heavily on RNA and RNA post-transcriptional modifications to regulate the viral lifecycle, pathogenesis, and evasion of host immune responses. With the emergence of new sequencing technologies in the last decade, our ability to dissect the intricacies of RNA has flourished.
View Article and Find Full Text PDFJ Clin Med
January 2025
Department of Systems Medicine, University of Rome Tor Vergata, 00133 Rome, Italy.
: The nuclear factor (NF)-kB essential modulator (NEMO) has a crucial role in the NFκB pathway. Hypomorphic pathogenic variants cause ectodermal dysplasia with immunodeficiency (EDA-ID) in affected males. However, heterozygous amorphic variants could be responsible for Incontinentia Pigmenti (IP) in female carriers.
View Article and Find Full Text PDFGenes (Basel)
December 2024
The School of Genetics and Microbiology, Trinity College Dublin, Dublin 2, D02 VF25 Dublin, Ireland.
Background: An estimated 10-15% of all genetic diseases are attributable to variants in noncanonical splice sites, auxiliary splice sites and deep-intronic variants. Most of these unstudied variants are classified as variants of uncertain significance (VUS), which are not clinically actionable. This study investigated two novel splice-altering variants, NM_000390.
View Article and Find Full Text PDFGenes (Basel)
December 2024
Dmitry Rogachev National Medical Center of Pediatric Hematology, Oncology and Immunology, 117198 Moscow, Russia.
The advent of next-generation sequencing (NGS) has revolutionized the analysis of genetic data, enabling rapid identification of pathogenic variants in patients with inborn errors of immunity (IEI). Sometimes, the use of NGS-based technologies is associated with challenges in the evaluation of the clinical significance of novel genetic variants. In silico prediction tools, such as SpliceAI neural network, are often used as a first-tier approach for the primary examination of genetic variants of uncertain clinical significance.
View Article and Find Full Text PDFGenes (Basel)
December 2024
Institute of Biomedical Chemistry, 119121 Moscow, Russia.
Background: This study aims to analyze the exploration degree of popular model organisms by utilizing annotations from the UniProtKB (Swiss-Prot) knowledge base. The research focuses on understanding the genomic and post-genomic data of various organisms, particularly in relation to aging as an integral model for studying the molecular mechanisms underlying pathological processes and physiological states.
Methods: Having characterized the organisms by selected parameters (numbers of gene splice variants, post-translational modifications, etc.
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!