Functional annotation and interpretation of genetic variants are a critical step in genetic diagnosis, as it may lead to personalized therapeutic options and genetic counseling. While the number of confirmed pathogenic genetic variants in an individual is relatively low, the number of variants of uncertain significance (VOUS) can be considerably higher, increasing the number of potential carriers of genetic disorders. Thus, reducing uncertainty and assessing the real effect of VOUS are crucial for clinical and medical genetics.
View Article and Find Full Text PDFRNA-binding proteins are emerging as critical modulators of oncogenic cell transformation, malignancy and therapy resistance. We have previously found that the RNA-binding protein Cold Shock Domain containing protein E1 (CSDE1) promotes invasion and metastasis of melanoma, the deadliest form of skin cancer and also a highly heterogeneous disease in need of predictive biomarkers and druggable targets. Here, we design a monoclonal antibody useful for IHC in the clinical setting and use it to evaluate the prognosis potential of CSDE1 in an exploratory cohort of 149 whole tissue sections including benign nevi and primary tumors and metastasis from melanoma patients.
View Article and Find Full Text PDFAn important fraction of patients with rare disorders remains with no clear genetic diagnostic, even after whole-exome or whole-genome sequencing, posing a difficulty in giving adequate treatment and genetic counseling. The analysis of genomic data in rare disorders mostly considers the presence of single gene variants in coding regions that follow a concrete monogenic mode of inheritance. A digenic inheritance, with variants in two functionally-related genes in the same individual, is a plausible alternative that might explain the genetic basis of the disease in some cases.
View Article and Find Full Text PDFThe occurrence of natural variation in human microRNAs has been the focus of numerous studies during the last 20 years. Most of them have been focused on the role of specific mutations in disease, while a minor proportion seek to analyse microRNA diversity in the genomes of human populations. We analyse the latest human microRNA annotations in the light of the most updated catalogue of genetic variation provided by the 1000 Genomes Project.
View Article and Find Full Text PDFThe ability of detecting adaptive (positive) selection in the genome has opened the possibility of understanding the genetic basis of population-specific adaptations genome-wide. Here, we present the analysis of recent selective sweeps, specifically in the X chromosome, in human populations from the third phase of the 1,000 Genomes Project using three different haplotype-based statistics. We describe instances of recent positive selection that fit the criteria of hard or soft sweeps, and detect a higher number of events among sub-Saharan Africans than non-Africans (Europe and East Asia).
View Article and Find Full Text PDFTissue function and homeostasis reflect the gene expression signature by which the combination of ubiquitous and tissue-specific genes contribute to the tissue maintenance and stimuli-responsive function. Enhancers are central to control this tissue-specific gene expression pattern. Here, we explore the correlation between the genomic location of enhancers and their role in tissue-specific gene expression.
View Article and Find Full Text PDFNAR Genom Bioinform
September 2020
After diverging, each chimpanzee subspecies has been the target of unique selective pressures. Here, we employ a machine learning approach to classify regions as under positive selection or neutrality genome-wide. The regions determined to be under selection reflect the unique demographic and adaptive history of each subspecies.
View Article and Find Full Text PDFBackground: In the process of adaptation of humans to their environment, positive or adaptive selection has played a main role. Positive selection has, however, been under-studied in African populations, despite their diversity and importance for understanding human history.
Results: Here, we have used 119 available whole-genome sequences from five Ethiopian populations (Amhara, Oromo, Somali, Wolayta and Gumuz) to investigate the modes and targets of positive selection in this part of the world.
The Roma people are the largest transnational ethnic minority in Europe and can be considered the last human migration of South Asian origin into the continent. They left Northwest India approximately 1,000 years ago, reaching the Balkan Peninsula around the twelfth century and Romania in the fourteenth century. Here, we analyze whole-genome sequencing data of 40 Roma and 40 non-Roma individuals from Romania.
View Article and Find Full Text PDFMultiple sequence alignments (MSAs) are used for structural and evolutionary predictions, but the complexity of aligning large datasets requires the use of approximate solutions, including the progressive algorithm. Progressive MSA methods start by aligning the most similar sequences and subsequently incorporate the remaining sequences, from leaf to root, based on a guide tree. Their accuracy declines substantially as the number of sequences is scaled up.
View Article and Find Full Text PDFBackground: Determining the factors involved in the likelihood of a gene being under adaptive selection is still a challenging goal in Evolutionary Biology. Here, we perform an evolutionary analysis of the human metabolic genes to explore the associations between network structure and the presence and strength of natural selection in the genes whose products are involved in metabolism. Purifying and positive selection are estimated at interspecific (among mammals) and intraspecific (among human populations) levels, and the connections between enzymatic reactions are differentiated between incoming (in-degree) and outgoing (out-degree) links.
View Article and Find Full Text PDFMetabolic networks comprise thousands of enzymatic reactions functioning in a controlled manner and have been shaped by natural selection. Thanks to the genome data, the footprints of adaptive (positive) selection are detectable, and the strength of purifying selection can be measured. This has made possible to know where, in the metabolic network, adaptive selection has acted and where purifying selection is more or less strong and efficient.
View Article and Find Full Text PDFDuring the demographic history of the Pan clade, there has been gene-flow between species, likely >200,000 years ago. Bonobo haplotypes in three subspecies of chimpanzee have been identified to be segregating in modern-day chimpanzee populations, suggesting that these haplotypes, with increased differentiation, may be a target of natural selection. Here, we investigate signatures of adaptive introgression within the bonobo-like haplotypes in chimpanzees using site frequency spectrum-based tests.
View Article and Find Full Text PDFThe chimpanzee is arguably the most important species for the study of human origins. A key resource for these studies is a high-quality reference genome assembly; however, as with most mammalian genomes, the current iteration of the chimpanzee reference genome assembly is highly fragmented. In the current iteration of the chimpanzee reference genome assembly (Pan_tro_2.
View Article and Find Full Text PDFThe 1000 Genomes Project (1000GP) represents the most comprehensive world-wide nucleotide variation data set so far in humans, providing the sequencing and analysis of 2504 genomes from 26 populations and reporting >84 million variants. The availability of this sequence data provides the human lineage with an invaluable resource for population genomics studies, allowing the testing of molecular population genetics hypotheses and eventually the understanding of the evolutionary dynamics of genetic variation in human populations. Here we present PopHuman, a new population genomics-oriented genome browser based on JBrowse that allows the interactive visualization and retrieval of an extensive inventory of population genetics metrics.
View Article and Find Full Text PDFWe present 42 new Y-chromosomal sequences from diverse Indian tribal and non-tribal populations, including the Jarawa and Onge from the Andaman Islands, which are analysed within a calibrated Y-chromosomal phylogeny incorporating South Asian (in total 305 individuals) and worldwide (in total 1286 individuals) data from the 1000 Genomes Project. In contrast to the more ancient ancestry in the South than in the North that has been claimed, we detected very similar coalescence times within Northern and Southern non-tribal Indian populations. A closest neighbour analysis in the phylogeny showed that Indian populations have an affinity towards Southern European populations and that the time of divergence from these populations substantially predated the Indo-European migration into India, probably reflecting ancient shared ancestry rather than the Indo-European migration, which had little effect on Indian male lineages.
View Article and Find Full Text PDFNatural selection is crucial for the adaptation of populations to their environments. Here, we present the first global study of natural selection in the Hominidae (humans and great apes) based on genome-wide information from population samples representing all extant species (including most subspecies). Combining several neutrality tests we create a multi-species map of signatures of natural selection covering all major types of natural selection.
View Article and Find Full Text PDFTo shed light on the peopling of South Asia and the origins of the morphological adaptations found there, we analyzed whole-genome sequences from 10 Andamanese individuals and compared them with sequences for 60 individuals from mainland Indian populations with different ethnic histories and with publicly available data from other populations. We show that all Asian and Pacific populations share a single origin and expansion out of Africa, contradicting an earlier proposal of two independent waves of migration. We also show that populations from South and Southeast Asia harbor a small proportion of ancestry from an unknown extinct hominin, and this ancestry is absent from Europeans and East Asians.
View Article and Find Full Text PDFNucleotide variants in microRNA regions have been associated with disease; nevertheless, few studies still have addressed the allele-dependent effect of these changes. We studied microRNA genetic variation in human populations and found that while low-frequency variants accumulate indistinctly in microRNA regions, the mature and seed regions tend to be depleted of high-frequency variants, probably as a result of purifying selection. Comparison of pairwise population fixation indexes among regions showed that the seed had higher population fixation indexes than the other regions, suggesting the existence of local adaptation in the seed region.
View Article and Find Full Text PDFMotivation: Detecting positive selection in genomic regions is a recurrent topic in natural population genetic studies. However, there is little consistency among the regions detected in several genome-wide scans using different tests and/or populations. Furthermore, few methods address the challenge of classifying selective events according to specific features such as age, intensity or state (completeness).
View Article and Find Full Text PDFEast Africa is a strategic region to study human genetic diversity due to the presence of ethnically, linguistically, and geographically diverse populations. Here, we provide new insight into the genetic history of populations living in the Sudanese region of East Africa by analysing nine ethnic groups belonging to three African linguistic families: Niger-Kordofanian, Nilo-Saharan and Afro-Asiatic. A total of 500 individuals were genotyped for 200,000 single-nucleotide polymorphisms.
View Article and Find Full Text PDFGenes vary in their likelihood to undergo adaptive evolution. The genomic factors that determine adaptability, however, remain poorly understood. Genes function in the context of molecular networks, with some occupying more important positions than others and thus being likely to be under stronger selective pressures.
View Article and Find Full Text PDFSummary: A wealth of large-scale genome sequencing projects opens the doors to new approaches to study the relationship between genotype and phenotype. One such opportunity is the possibility to apply genotype networks analysis to population genetics data. Genotype networks are a representation of the set of genotypes associated with a single phenotype, and they allow one to estimate properties such as the robustness of the phenotype to mutations, and the ability of its associated genotypes to evolve new adaptations.
View Article and Find Full Text PDF