Rare structural variants (SVs) - insertions, deletions, and complex rearrangements - can cause Mendelian disease, yet they remain difficult to accurately detect and interpret. We sequenced and analyzed Oxford Nanopore long-read genomes of 68 individuals from the Undiagnosed Disease Network (UDN) with no previously identified diagnostic mutations from short-read sequencing. Using our optimized SV detection pipelines and 571 control long-read genomes, we detected 716 long-read rare (MAF < 0.01) SV alleles per genome on average, achieving a 2.4x increase from short-reads. To characterize the functional effects of rare SVs, we assessed their relationship with gene expression from blood or fibroblasts from the same individuals, and found that rare SVs overlapping enhancers were enriched (LOR = 0.46) near expression outliers. We also evaluated tandem repeat expansions (TREs) and found 14 rare TREs per genome; notably these TREs were also enriched near overexpression outliers. To prioritize candidate functional SVs, we developed Watershed-SV, a probabilistic model that integrates expression data with SV-specific genomic annotations, which significantly outperforms baseline models that don't incorporate expression data. Watershed-SV identified a median of eight high-confidence functional SVs per UDN genome. Notably, this included compound heterozygous deletions in shared by two siblings, which were likely causal for a rare neurodevelopmental disorder. Our observations demonstrate the promise of integrating long-read sequencing with gene expression towards improving the prioritization of functional SVs and TREs in rare disease patients.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10996727PMC
http://dx.doi.org/10.1101/2024.03.22.24304565DOI Listing

Publication Analysis

Top Keywords

functional svs
12
structural variants
8
rare
8
rare disease
8
long-read genomes
8
rare svs
8
gene expression
8
tres rare
8
genome notably
8
expression data
8

Similar Publications

: As Repeated Low-Level Red Light (RLRL) therapy is becoming increasingly prevalent in clinical practice, mainly in the Far East, largely due to its child-friendly nature and the feasibility of home use, this study aims to conduct a systematic review and meta-analysis to evaluate the efficacy of RLRL therapy in managing childhood myopia, specifically in relation to axial length (AL) and spherical equivalent refraction (SER), across a larger group of children aged from 6 to 16 years. : A systematic literature search was performed using PubMed, Scopus, and Web of Science to access relevant databases and to locate outcome studies. Eligibility criteria included publication type, participant characteristics, and outcomes report.

View Article and Find Full Text PDF

Transposable elements (TEs) are significant drivers of genome evolution, yet their recent dynamics and impacts within and among species, as well as the roles of host genes and non-coding RNAs in the transposition process, remain elusive. With advancements in large-scale pan-genome sequencing and the development of open data sharing, large-scale comparative genomics studies have become feasible. Here, we performed complete de novo TE annotations and identified active TEs in 310 plant genome assemblies across 119 species and seven crop populations.

View Article and Find Full Text PDF

Background And Objectives: Methylenetetrahydrofolate reductase (MTHFR) is a key enzyme that regulates folate and homocysteine metabolism. Genetic variation in has been implicated in cerebrovascular disease risk, although research in diverse populations is lacking. We thus aimed to investigate the effect of genetically predicted MTHFR activity on risk of ischemic stroke (IS) and its main subtypes using a multiancestry Mendelian randomization (MR) approach.

View Article and Find Full Text PDF

Carcinogenesis often involves significant alterations in the cancer genome, marked by large structural variants (SVs) and copy number variations (CNVs) that are difficult to capture with short-read sequencing. Traditionally, cytogenetic techniques are applied to detect such aberrations, but they are limited in resolution and do not cover features smaller than several hundred kilobases. Optical genome mapping (OGM) and nanopore sequencing [Oxford Nanopore Technologies (ONT)] bridge this resolution gap and offer enhanced performance for cytogenetic applications.

View Article and Find Full Text PDF

Given the presence of highly repetitive genomic regions such as subtelomeric regions, understanding human genomic evolution remains challenging. Recently, long-read sequencing technology has facilitated the identification of complex genetic variants, including structural variants (SVs), at the single-nucleotide level. Here, we resolved SVs and their underlying DNA damage-repair mechanisms in subtelomeric regions, which are among the most uncharted genomic regions.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!