Dicentric chromosomes are products of genomic rearrangements that place two centromeres on the same chromosome. Due to the presence of two primary constrictions, they are inherently unstable and overcome their instability by epigenetically inactivating and/or deleting one of the two centromeres, thus resulting in functionally monocentric chromosomes that segregate normally during cell division. Our understanding to date of dicentric chromosome formation, behavior and fate has been largely inferred from observational studies in plants and humans as well as artificially produced de novo dicentrics in yeast and in human cells.
View Article and Find Full Text PDFAttention-Deficit Hyperactivity Disorder (ADHD) has high heritability; however, studies of common variation account for <5% of ADHD variance. Using data from affected participants without a family history of ADHD, we sought to identify de novo variants that could account for sporadic ADHD. Considering a total of 128 families, two analyses were conducted in parallel: first, in 11 unaffected parent/affected proband trios (or quads with the addition of an unaffected sibling) we completed exome sequencing.
View Article and Find Full Text PDFBackground: Gene innovation by duplication is a fundamental evolutionary process but is difficult to study in humans due to the large size, high sequence identity, and mosaic nature of segmental duplication blocks. The human-specific gene hydrocephalus-inducing 2, HYDIN2, was generated by a 364 kbp duplication of 79 internal exons of the large ciliary gene HYDIN from chromosome 16q22.2 to chromosome 1q21.
View Article and Find Full Text PDFGene-disruptive mutations contribute to the biology of neurodevelopmental disorders (NDDs), but most of the related pathogenic genes are not known. We sequenced 208 candidate genes from >11,730 cases and >2,867 controls. We identified 91 genes, including 38 new NDD genes, with an excess of de novo mutations or private disruptive mutations in 5.
View Article and Find Full Text PDFMost evolutionary new centromeres (ENC) are composed of large arrays of satellite DNA and surrounded by segmental duplications. However, the hypothesis is that ENCs are seeded in an anonymous sequence and only over time have acquired the complexity of "normal" centromeres. Up to now evidence to test this hypothesis was lacking.
View Article and Find Full Text PDFDegradation of proteins by the ubiquitin-proteasome system (UPS) is an essential biological process in the development of eukaryotic organisms. Dysregulation of this mechanism leads to numerous human neurodegenerative or neurodevelopmental disorders. Through a multi-center collaboration, we identified six de novo genomic deletions and four de novo point mutations involving PSMD12, encoding the non-ATPase subunit PSMD12 (aka RPN5) of the 19S regulator of 26S proteasome complex, in unrelated individuals with intellectual disability, congenital malformations, ophthalmologic anomalies, feeding difficulties, deafness, and subtle dysmorphic facial features.
View Article and Find Full Text PDFBackground: Although many algorithms are now available that aim to characterize different classes of structural variation, discovery of balanced rearrangements such as inversions remains an open problem. This is mainly due to the fact that breakpoints of such events typically lie within segmental duplications or common repeats, which reduces the mappability of short reads. The algorithms developed within the 1000 Genomes Project to identify inversions are limited to relatively short inversions, and there are currently no available algorithms to discover large inversions using high throughput sequencing technologies.
View Article and Find Full Text PDFWhole-exome and whole-genome sequencing have facilitated the large-scale discovery of de novo variants in human disease. To date, most de novo discovery through next-generation sequencing focused on congenital heart disease and neurodevelopmental disorders (NDDs). Currently, de novo variants are one of the most significant risk factors for NDDs with a substantial overlap of genes involved in more than one NDD.
View Article and Find Full Text PDFThe ubiquitin pathway is an enzymatic cascade including activating E1, conjugating E2, and ligating E3 enzymes, which governs protein degradation and sorting. It is crucial for many physiological processes. Compromised function of members of the ubiquitin pathway leads to a wide range of human diseases, such as cancer, neurodegenerative diseases, and neurodevelopmental disorders.
View Article and Find Full Text PDFRecurrent de novo (DN) and likely gene-disruptive (LGD) mutations contribute significantly to autism spectrum disorders (ASDs) but have been primarily investigated in European cohorts. Here, we sequence 189 risk genes in 1,543 Chinese ASD probands (1,045 from trios). We report an 11-fold increase in the odds of DN LGD mutations compared with expectation under an exome-wide neutral model of mutation.
View Article and Find Full Text PDFMolecular inversion probes (MIPs) in combination with massively parallel DNA sequencing represent a versatile, yet economical tool for targeted sequencing of genomic DNA. Several thousand genomic targets can be selectively captured using long oligonucleotides containing unique targeting arms and universal linkers. The ability to append sequencing adaptors and sample-specific barcodes allows large-scale pooling and subsequent high-throughput sequencing at relatively low cost per sample.
View Article and Find Full Text PDFRecurrent rearrangements of Chromosome 8p23.1 are associated with congenital heart defects and developmental delay. The complexity of this region has led to inconsistencies in the current reference assembly, confounding studies of genetic variation.
View Article and Find Full Text PDFStructural variation (SV) represents a major source of differences between individual human genomes and has been linked to disease phenotypes. However, the majority of studies provide neither a global view of the full spectrum of these variants nor integrate them into reference panels of genetic variation. Here, we analyse whole genome sequencing data of 769 individuals from 250 Dutch families, and provide a haplotype-resolved map of 1.
View Article and Find Full Text PDFDuplications are the primary force by which new gene functions arise and provide a substrate for large-scale structural variation. Analysis of thousands of genomes shows that humans and great apes have more genetic differences in content and structure over recent segmental duplications than any other euchromatic region. Novel human-specific duplicated genes, ARHGAP11B and SRGAP2C, have recently been described with a potential role in neocortical expansion and increased neuronal spine density.
View Article and Find Full Text PDFBackground: ABO is a blood group system of high clinical significance due to the prevalence of ABO variation that can cause major, potentially life-threatening, transfusion reactions.
Study Design And Methods: Using multiple large-scale next-generation sequence data sets, we demonstrate the application of read-depth approaches to discover previously unsuspected structural variation (SV) in the ABO gene in individuals of African ancestry.
Results: Our analysis of SV in the ABO gene across 6432 exomes reveals a partial deletion in the ABO gene in 32 individuals of African ancestry that predicts a novel O allele.
Short-read sequencing has enabled the de novo assembly of several individual human genomes, but with inherent limitations in characterizing repeat elements. Here we sequence a Chinese individual HX1 by single-molecule real-time (SMRT) long-read sequencing, construct a physical map by NanoChannel arrays and generate a de novo assembly of 2.93 Gb (contig N50: 8.
View Article and Find Full Text PDFAdult human brains retain the capacity to undergo tissue reorganization during second-language learning. Brain-imaging studies show a relationship between neuroanatomical properties and learning for adults exposed to a second language. However, the role of genetic factors in this relationship has not been investigated.
View Article and Find Full Text PDFSkeletal atavism in Shetland ponies is a heritable disorder characterized by abnormal growth of the ulna and fibula that extend the carpal and tarsal joints, respectively. This causes abnormal skeletal structure and impaired movements, and affected foals are usually killed. In order to identify the causal mutation we subjected six confirmed Swedish cases and a DNA pool consisting of 21 control individuals to whole genome resequencing.
View Article and Find Full Text PDFCongenital heart disease (CHD) has a complex genetic etiology, and recent studies suggest that high penetrance de novo mutations may account for only a small fraction of disease. In a multi-institutional cohort surveyed by exome sequencing, combining analysis of 987 individuals (discovery cohort of 59 affected trios and 59 control trios, and a replication cohort of 100 affected singletons and 533 unaffected singletons) we observe variation at novel and known loci related to a specific cardiac malformation the atrioventricular septal defect (AVSD). In a primary analysis, by combining developmental coexpression networks with inheritance modeling, we identify a de novo mutation in the DNA binding domain of NR1D2 (p.
View Article and Find Full Text PDFDeciphering the genetic basis of human disease requires a comprehensive knowledge of genetic variants irrespective of their class or frequency. Although an impressive number of human genetic variants have been catalogued, a large fraction of the genetic difference that distinguishes two human genomes is still not understood at the base-pair level. This is because the emphasis has been on single-nucleotide variation as opposed to less tractable and more complex genetic variants, including indels and structural variants.
View Article and Find Full Text PDFAccurate sequence and assembly of genomes is a critical first step for studies of genetic variation. We generated a high-quality assembly of the gorilla genome using single-molecule, real-time sequence technology and a string graph de novo assembly algorithm. The new assembly improves contiguity by two to three orders of magnitude with respect to previously released assemblies, recovering 87% of missing reference exons and incomplete gene models.
View Article and Find Full Text PDFIntellectual disability (ID) and autism spectrum disorders (ASD) are genetically heterogeneous, and a significant number of genes have been associated with both conditions. A few mutations in POGZ have been reported in recent exome studies; however, these studies do not provide detailed clinical information. We collected the clinical and molecular data of 25 individuals with disruptive mutations in POGZ by diagnostic whole-exome, whole-genome, or targeted sequencing of 5,223 individuals with neurodevelopmental disorders (ID primarily) or by targeted resequencing of this locus in 12,041 individuals with ASD and/or ID.
View Article and Find Full Text PDFThe next-generation sequencing revolution has substantially increased our understanding of the mutated genes that underlie complex neurodevelopmental disease. Exome sequencing has enabled us to estimate the number of genes involved in the etiology of neurodevelopmental disease, whereas targeted sequencing approaches have provided the means for quick and cost-effective sequencing of thousands of patient samples to assess the significance of individual genes. By leveraging such technologies and clinical exome sequencing, a genotype-first approach has emerged in which patients with a common genotype are first identified and then clinically reassessed as a group.
View Article and Find Full Text PDF