A key goal of whole-genome sequencing for studies of human genetics is to interrogate all forms of variation, including single-nucleotide variants, small insertion or deletion (indel) variants and structural variants. However, tools and resources for the study of structural variants have lagged behind those for smaller variants. Here we used a scalable pipeline to map and characterize structural variants in 17,795 deeply sequenced human genomes. We publicly release site-frequency data to create the largest, to our knowledge, whole-genome-sequencing-based structural variant resource so far. On average, individuals carry 2.9 rare structural variants that alter coding regions; these variants affect the dosage or structure of 4.2 genes and account for 4.0-11.2% of rare high-impact coding alleles. Using a computational model, we estimate that structural variants account for 17.2% of rare alleles genome-wide, with predicted deleterious effects that are equivalent to loss-of-function coding alleles; approximately 90% of such structural variants are noncoding deletions (mean 19.1 per genome). We report 158,991 ultra-rare structural variants and show that 2% of individuals carry ultra-rare megabase-scale structural variants, nearly half of which are balanced or complex rearrangements. Finally, we infer the dosage sensitivity of genes and noncoding elements, and reveal trends that relate to element class and conservation. This work will help to guide the analysis and interpretation of structural variants in the era of whole-genome sequencing.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7547914PMC
http://dx.doi.org/10.1038/s41586-020-2371-0DOI Listing

Publication Analysis

Top Keywords

structural variants
36
variants
13
structural
11
human genomes
8
whole-genome sequencing
8
individuals carry
8
coding alleles
8
mapping characterization
4
characterization structural
4
structural variation
4

Similar Publications

variants in children with neurodevelopmental impairment are difficult to assess due to their heterogeneity and unclear pathogenic mechanisms. We describe a child with neonatal-onset epilepsy, developmental impairment of intermediate severity, and G256W heterozygosity. Analyzing prior KCNQ2 channel cryoelectron microscopy models revealed G256 as a node of an arch-shaped non-covalent bond network linking S5, the pore turret, and the ion path.

View Article and Find Full Text PDF

Karst caves, formed from the dissolution of soluble rocks, are characterized by the absence of photosynthetic activity and low levels of organic matter. Organisms evolve under these particular conditions, which causes high levels of endemic biodiversity in both macroorganism and microbes. Recent research has highlighted the presence of testate amoebae (Arcellinida) group in cave environments.

View Article and Find Full Text PDF

This study investigates the effectiveness and efficiency of two topological data analysis (TDA) techniques, the conventional Mapper (CM) and its variant version, the Ball Mapper (BM), in analyzing the behavior of six major air pollutants (NO, PM, PM, O, CO, and SO) across 60 air quality monitoring stations in Malaysia. Topological graphs produced by CM and BM reveal redundant monitoring stations and geographical relationships corresponding to air pollutant behavior, providing better visualization than traditional hierarchical clustering. Additionally, a comparative analysis of topological graph structures was conducted using node degree distribution, topological graph indices, and Dynamic Time Warping (DTW) to evaluate the sensitivity and performance of these TDA techniques.

View Article and Find Full Text PDF

Non-syndromic hearing loss (NSHL) is a genetically heterogeneous disorder accounting for almost 70% of the total congenital hearing loss. The implementation of rapid advanced sequencing methods has significantly contributed to the correct molecular diagnosis for several rare genetic disorders, including NHSL. Features of two probands with NHSL were clinically and genetically evaluated.

View Article and Find Full Text PDF

Sustainable agriculture approaches necessitate a concerted effort from researchers to establish paths that meet global population needs without compromising environmental resources. Goats are unique among ruminants because of their ability to adapt to some of the harshest environments around the world. Growth Hormone (GH) gene is a major regulator of muscle mass growth.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!