Establishing the genetic and geographic structure of populations is fundamental, both to understand their evolutionary past and preserve their future. Nevertheless, the patterns of genetic population structure are unknown for most endangered species. This is the case for bonobos (Pan paniscus), which, together with chimpanzees (Pan troglodytes), are humans' closest living relatives.
View Article and Find Full Text PDFTwo independent exome sequencing initiatives aimed to identify new genes involved in the predisposition to nonpolyposis colorectal cancer led to the identification of heterozygous loss-of-function variants in NPAT, a gene that encodes a cyclin E/CDK2 effector required for S phase entry and a coactivator of histone transcription, in two families with multiple members affected with colorectal cancer. Enrichment of loss-of-function and predicted deleterious NPAT variants was identified in familial/early-onset colorectal cancer patients compared to non-cancer gnomAD individuals, further supporting the association with the disease. Previous studies in Drosophila models showed that NPAT abrogation results in chromosomal instability, increase of double strand breaks, and induction of tumour formation.
View Article and Find Full Text PDFUnlabelled: The aim of this study was to determine how mutations impact glioblastoma prognosis.
Materials And Methods: mutations were assessed in a retrospective cohort of 258 uniformly treated glioblastoma patients. RNA-sequencing and whole exome sequencing results were available in a subset of patients.
In the late 19th century, formalin fixation with paraffin-embedding (FFPE) of tissues was developed as a fixation and conservation method and is still used to this day in routine clinical and pathological practice. The implementation of state-of-the-art nucleic acid sequencing technologies has sparked much interest for using historical FFPE samples stored in biobanks as they hold promise in extracting new information from these valuable samples. However, formalin fixation chemically modifies DNA, which potentially leads to incorrect sequences or misinterpretations in downstream processing and data analysis.
View Article and Find Full Text PDFThe exonuclease domain of DNA polymerases epsilon's catalytic subunit (POLE) removes misincorporated nucleotides, called proofreading. POLE-exonuclease mutations cause colorectal- and endometrial cancers with an extreme burden of single nucleotide substitutions. We recently reported that particularly the hereditary POLE exonuclease mutation N363K predisposes in addition to aggressive giant cell glioblastomas.
View Article and Find Full Text PDFThe precisionFDA Truth Challenge V2 aimed to assess the state of the art of variant calling in challenging genomic regions. Starting with FASTQs, 20 challenge participants applied their variant-calling pipelines and submitted 64 variant call sets for one or more sequencing technologies (Illumina, PacBio HiFi, and Oxford Nanopore Technologies). Submissions were evaluated following best practices for benchmarking small variants with updated Genome in a Bottle benchmark sets and genome stratifications.
View Article and Find Full Text PDFEndometrial cancer (EC) is the second most frequent gynecological cancer worldwide. Although improvements in EC classification have enabled an accurate establishment of disease prognosis, women with a high-risk or recurrent EC face a dramatic situation due to limited further treatment options. Therefore, new strategies that closely mimic the disease are required to maximize drug development success.
View Article and Find Full Text PDFMany patients experiencing a rare disease remain undiagnosed even after genomic testing. Reanalysis of existing genomic data has shown to increase diagnostic yield, although there are few systematic and comprehensive reanalysis efforts that enable collaborative interpretation and future reinterpretation. The Undiagnosed Rare Disease Program of Catalonia project collated previously inconclusive good quality genomic data (panels, exomes, and genomes) and standardized phenotypic profiles from 323 families (543 individuals) with a neurologic rare disease.
View Article and Find Full Text PDFBackground: Mechanisms driving the progression of chronic lymphocytic leukemia (CLL) from its early stages are not fully understood. The acquisition of molecular changes at the time of progression has been observed in a small fraction of patients, suggesting that CLL progression is not mainly driven by dynamic clonal evolution. In order to shed light on mechanisms that lead to CLL progression, we investigated longitudinal changes in both the genetic and immunological scenarios.
View Article and Find Full Text PDFBrain metastases are the most common tumor of the brain with a dismal prognosis. A fraction of patients with brain metastasis benefit from treatment with immune checkpoint inhibitors (ICI) and the degree and phenotype of the immune cell infiltration has been used to predict response to ICI. However, the anatomical location of brain lesions limits access to tumor material to characterize the immune phenotype.
View Article and Find Full Text PDFColorectal cancer (CRC) shows aggregation in some families but no alterations in the known hereditary CRC genes. We aimed to identify new candidate genes which are potentially involved in germline predisposition to familial CRC. An integrated analysis of germline and tumor whole-exome sequencing data was performed in 18 unrelated CRC families.
View Article and Find Full Text PDFDetecting the genomic changes underlying phenotypic changes between species is a main goal of evolutionary biology and genomics. Evolutionary theory predicts that changes in cis-regulatory elements are important for morphological changes. We combined genome sequencing, functional genomics and genome-wide comparative analyses to investigate regulatory elements in lineages that lost morphological traits.
View Article and Find Full Text PDFThis unit describes the usage of geneid, an efficient gene-finding program that allows for the analysis of large genomic sequences, including whole mammalian chromosomes. These sequences can be partially annotated, and geneid can be used to refine this initial annotation. Training geneid is relatively easy, and parameter configurations exist for a number of eukaryotic species.
View Article and Find Full Text PDFThe genome comprises 263 Mb and 34,240 gene models organized in 20 different chromosomes. To improve our understanding of gene function we have generated an EMS mutant platform, consisting of 3,751 independent M2 families. The quality of the collection has been evaluated based on phenotyping and whole-genome re-sequencing (WGS) results.
View Article and Find Full Text PDFThe growing number of sequenced genomes allows us now to address a key question in genetics and evolutionary biology: which genomic changes underlie particular phenotypic changes between species? Previously, we developed a computational framework called Forward Genomics that associates phenotypic to genomic differences by focusing on phenotypes that are independently lost in different lineages. However, our previous implementation had three main limitations. Here, we present two new Forward Genomics methods that overcome these limitations by (1) directly controlling for phylogenetic relatedness, (2) controlling for differences in evolutionary rates, and (3) computing a statistical significance.
View Article and Find Full Text PDFBalancing selection is an important evolutionary force that maintains genetic and phenotypic diversity in populations. Most studies in humans have focused on long-standing balancing selection, which persists over long periods of time and is generally shared across populations. But balanced polymorphisms can also promote fast adaptation, especially when the environment changes.
View Article and Find Full Text PDFAs humans migrated around the world, they came to inhabit environments that differ widely in the soil levels of certain micronutrients, including selenium (Se). Coupled with cultural variation in dietary practices, these migrations have led to a wide range of Se intake levels in populations around the world. Both excess and deficiency of Se in the diet can have adverse health consequences in humans, with severe Se deficiency resulting in diseases of the bone and heart.
View Article and Find Full Text PDFBalancing selection maintains advantageous genetic and phenotypic diversity in populations. When selection acts for long evolutionary periods selected polymorphisms may survive species splits and segregate in present-day populations of different species. Here, we investigate the role of long-term balancing selection in the evolution of protein-coding sequences in the Homo-Pan clade.
View Article and Find Full Text PDFClassically, gene prediction programs are based on detecting signals such as boundary sites (splice sites, starts, and stops) and coding regions in the DNA sequence in order to build potential exons and join them into a gene structure. Although nowadays it is possible to improve their performance with additional information from related species or/and cDNA databases, further improvement at any step could help to obtain better predictions. Here, we present WISCOD, a web-enabled tool for the identification of significant protein coding regions, a novel software tool that tackles the exon prediction problem in eukaryotic genomes.
View Article and Find Full Text PDFWe present the DNA sequence of 17,367 protein-coding genes in two Neandertals from Spain and Croatia and analyze them together with the genome sequence recently determined from a Neandertal from southern Siberia. Comparisons with present-day humans from Africa, Europe, and Asia reveal that genetic diversity among Neandertals was remarkably low, and that they carried a higher proportion of amino acid-changing (nonsynonymous) alleles inferred to alter protein structure or function than present-day humans. Thus, Neandertals across Eurasia had a smaller long-term effective population than present-day humans.
View Article and Find Full Text PDFSelenoDB (http://www.selenodb.org) aims to provide high-quality annotations of selenoprotein genes, proteins and SECIS elements.
View Article and Find Full Text PDFBackground: In today's age of genomic discovery, no attempt has been made to comprehensively sequence a gymnosperm genome. The largest genus in the coniferous family Pinaceae is Pinus, whose 110-120 species have extremely large genomes (c. 20-40 Gb, 2N = 24).
View Article and Find Full Text PDFGenome sequencing projects have been initiated for a wide range of eukaryotes. A few projects have reached completion, but most exist as draft assemblies. As one of the main reasons to sequence a genome is to obtain its catalog of genes, an important question is how complete or completable the catalog is in unfinished genomes.
View Article and Find Full Text PDF