Sex-differential selection (SDS), which occurs when the fitness effects of alleles differ between males and females, can have profound impacts on the maintenance of genetic variation, disease risk, and other key aspects of natural populations. Because the sexes mix their autosomal genomes each generation, quantifying SDS is not possible using conventional population genetic approaches. Here, we introduce a method that exploits subtle sex differences in haplotype frequencies resulting from SDS acting in the current generation.
View Article and Find Full Text PDFA major obstacle hindering the broad adoption of polygenic scores (PGS) is their lack of "portability" to people that differ-in genetic ancestry or other characteristics-from the GWAS samples in which genetic effects were estimated. Here, we use the UK Biobank to measure the change in PGS prediction accuracy as a continuous function of individuals' genome-wide genetic dissimilarity to the GWAS sample ("genetic distance"). Our results highlight three gaps in our understanding of PGS portability.
View Article and Find Full Text PDFLD score regression (LDSC) is a method to estimate narrow-sense heritability from genome-wide association study (GWAS) summary statistics alone, making it a fast and popular approach. In this work, we present interaction-LD score (i-LDSC) regression: an extension of the original LDSC framework that accounts for interactions between genetic variants. By studying a wide range of generative models in simulations, and by re-analyzing 25 well-studied quantitative phenotypes from 349,468 individuals in the UK Biobank and up to 159,095 individuals in BioBank Japan, we show that the inclusion of a -interaction score (i.
View Article and Find Full Text PDFPolygenic scores (PGS) have emerged as the tool of choice for genomic prediction in a wide range of fields. We show that PGS performance varies broadly across contexts and biobanks. Contexts such as age, sex and income can impact PGS accuracy with similar magnitudes as genetic ancestry.
View Article and Find Full Text PDFGenetic effects on complex traits may depend on context, such as age, sex, environmental exposures or social settings. However, it is often unclear if the extent of context dependency, or Gene-by-Environment interaction (GxE), merits more involved models than the additive model typically used to analyze data from genome-wide association studies (GWAS). Here, we suggest considering the utility of GxE models in GWAS as a tradeoff between bias and variance parameters.
View Article and Find Full Text PDFThe scientific literature has seen a resurgence of interest in genetic influences on human behavior and socioeconomic outcomes. Such studies face the central difficulty of distinguishing possible causal influences, in particular genetic and non-genetic ones. When confounding between possible influences is not rigorously addressed, it invites over- and misinterpretation of data.
View Article and Find Full Text PDFPolygenic scores (PGS) have emerged as the tool of choice for genomic prediction in a wide range of fields from agriculture to personalized medicine. We analyze data from two large biobanks in the US (All of Us) and the UK (UK Biobank) to find widespread variability in PGS performance across contexts. Many contexts, including age, sex, and income, impact PGS accuracies with similar magnitudes as genetic ancestry.
View Article and Find Full Text PDFSex differences in complex traits are suspected to be in part due to widespread gene-by-sex interactions (GxSex), but empirical evidence has been elusive. Here, we infer the mixture of ways in which polygenic effects on physiological traits covary between males and females. We find that GxSex is pervasive but acts primarily through systematic sex differences in the magnitude of many genetic effects ("amplification") rather than in the identity of causal variants.
View Article and Find Full Text PDFAncient DNA has revolutionized our understanding of human population history. However, its potential to examine how rapid cultural evolution to new lifestyles may have driven biological adaptation has not been met, largely due to limited sample sizes. We assembled genome-wide data from 1,291 individuals from Europe over 10,000 years, providing a dataset that is large enough to resolve the timing of selection into the Neolithic, Bronze Age, and Historical periods.
View Article and Find Full Text PDFAlthough gene duplication is an important source of evolutionary innovation, the functional divergence of duplicates can be opposed by ongoing gene conversion between them. Here, we report on the evolution of a tandem duplication of Na,K-ATPase subunit α1 (ATP1A1) shared by frogs in the genus Leptodactylus, a group of species that feeds on toxic toads. One ATP1A1 paralog evolved resistance to toad toxins although the other retained ancestral susceptibility.
View Article and Find Full Text PDFDemographic noise, the change in the composition of a population due to random birth and death events, is an important driving force in evolution because it reduces the efficacy of natural selection. Demographic noise is typically thought to be set by the population size and the environment, but recent experiments with microbial range expansions have revealed substantial strain-level differences in demographic noise under the same growth conditions. Many genetic and phenotypic differences exist between strains; to what extent do single mutations change the strength of demographic noise? To investigate this question, we developed a high-throughput method for measuring demographic noise in colonies without the need for genetic manipulation.
View Article and Find Full Text PDFThe selection pressures that have shaped the evolution of complex traits in humans remain largely unknown, and in some contexts highly contentious, perhaps above all where they concern mean trait differences among groups. To date, the discussion has focused on whether such group differences have any genetic basis, and if so, whether they are without fitness consequences and arose via random genetic drift, or whether they were driven by selection for different trait optima in different environments. Here, we highlight a plausible alternative: that many complex traits evolve under stabilizing selection in the face of shifting environmental effects.
View Article and Find Full Text PDFBrown rats (Rattus norvegicus) thrive in urban environments by navigating the anthropocentric environment and taking advantage of human resources and by-products. From the human perspective, rats are a chronic problem that causes billions of dollars in damage to agriculture, health, and infrastructure. Did genetic adaptation play a role in the spread of rats in cities? To approach this question, we collected whole-genome sequences from 29 brown rats from New York City (NYC) and scanned for genetic signatures of adaptation.
View Article and Find Full Text PDFFields as diverse as human genetics and sociology are increasingly using polygenic scores based on genome-wide association studies (GWAS) for phenotypic prediction. However, recent work has shown that polygenic scores have limited portability across groups of different genetic ancestries, restricting the contexts in which they can be used reliably and potentially creating serious inequities in future clinical applications. Using the UK Biobank data, we demonstrate that even within a single ancestry group (i.
View Article and Find Full Text PDFUnlabelled: Several recent papers have reported strong signals of selection on European polygenic height scores. These analyses used height effect estimates from the GIANT consortium and replication studies. Here, we describe a new analysis based on the the UK Biobank (UKB), a large, independent dataset.
View Article and Find Full Text PDFGene expression variation is a major contributor to phenotypic variation in human complex traits. Selection on complex traits may therefore be reflected in constraint on gene expression. Here, we explore the effects of stabilizing selection on -regulatory genetic variation in humans.
View Article and Find Full Text PDFWithin-host adaptation of pathogens such as human immunodeficiency virus (HIV) often occurs at more than two loci. Multiple beneficial mutations may arise simultaneously on different genetic backgrounds and interfere, affecting each other's fixation trajectories. Here, we explore how these evolutionary dynamics are mirrored in multilocus linkage disequilibrium (MLD), a measure of multi-way associations between alleles.
View Article and Find Full Text PDFProc Natl Acad Sci U S A
November 2017
Gene conversion is the copying of a genetic sequence from a "donor" region to an "acceptor." In nonallelic gene conversion (NAGC), the donor and the acceptor are at distinct genetic loci. Despite the role NAGC plays in various genetic diseases and the concerted evolution of gene families, the parameters that govern NAGC are not well characterized.
View Article and Find Full Text PDFGiven the implications of tumor dynamics for precision medicine, there is a need to systematically characterize the mode of evolution across diverse solid tumor types. In particular, methods to infer the role of natural selection within established human tumors are lacking. By simulating spatial tumor growth under different evolutionary modes and examining patterns of between-region subclonal genetic divergence from multiregion sequencing (MRS) data, we demonstrate that it is feasible to distinguish tumors driven by strong positive subclonal selection from those evolving neutrally or under weak selection, as the latter fail to dramatically alter subclonal composition.
View Article and Find Full Text PDFThe site frequency spectrum (SFS) has long been used to study demographic history and natural selection. Here, we extend this summary by examining the SFS conditional on the alleles found at the same site in other species. We refer to this extension as the "phylogenetically-conditioned SFS" or cSFS.
View Article and Find Full Text PDFAccurate annotation of protein coding regions is essential for understanding how genetic information is translated into function. We describe riboHMM, a new method that uses ribosome footprint data to accurately infer translated sequences. Applying riboHMM to human lymphoblastoid cell lines, we identified 7273 novel coding sequences, including 2442 translated upstream open reading frames.
View Article and Find Full Text PDFEvolution
September 2014
Evolution experiments with microorganisms coupled with genome-wide sequencing now allow for the systematic study of population genetic processes under a wide range of conditions. In learning about these processes in natural, sexual populations, neutral models that describe the behavior of diversity and divergence summaries have played a pivotal role. It is therefore natural to ask whether neutral models, suitably modified, could be useful in the context of evolution experiments.
View Article and Find Full Text PDF