Publications by Ryan N Gutenkunst | LitMetric

Publications by authors named "Ryan N Gutenkunst"

Page 1 of 3

Modeling biases from low-pass genome sequencing to enable accurate population genetic inferences.

Emanuel M Fonseca Linh N Tran Hannah Mendoza Ryan N Gutenkunst

bioRxiv

July 2024

Low-pass genome sequencing is cost-effective and enables analysis of large cohorts. However, it introduces biases by reducing heterozygous genotypes and low-frequency alleles, impacting subsequent analyses such as demographic history inference. We developed a probabilistic model of low-pass biases from the Genome Analysis Toolkit (GATK) multi-sample calling pipeline, and we implemented it in the population genomic inference software dadi.

View Article and Find Full Text PDF

Non-hypermutator cancers access driver mutations through reversals in germline mutational bias.

Marwa Z Tuffaha David Castellano Claudia Serrano Colome Ryan N Gutenkunst Lindi M Wahl

bioRxiv

November 2024

Cancer is an evolutionary disease driven by mutations in asexually-reproducing somatic cells. In asexual microbes, bias reversals in the mutation spectrum can speed adaptation by increasing access to previously undersampled beneficial mutations. By analyzing tumors from 20 tissues, along with normal tissue and the germline, we demonstrate this effect in cancer.

View Article and Find Full Text PDF

Computationally Efficient Demographic History Inference from Allele Frequencies with Supervised Machine Learning.

Linh N Tran Connie K Sun Travis J Struck Mathews Sajan Ryan N Gutenkunst

Mol Biol Evol

May 2024

Inferring past demographic history of natural populations from genomic data is of central concern in many studies across research fields. Previously, our group had developed dadi, a widely used demographic history inference method based on the allele frequency spectrum (AFS) and maximum composite-likelihood optimization. However, dadi's optimization procedure can be computationally expensive.

View Article and Find Full Text PDF

Computationally efficient demographic history inference from allele frequencies with supervised machine learning.

Linh N Tran Connie K Sun Travis J Struck Mathews Sajan Ryan N Gutenkunst

bioRxiv

February 2024

Inferring past demographic history of natural populations from genomic data is of central concern in many studies across research fields. Previously, our group had developed dadi, a widely used demographic history inference method based on the allele frequency spectrum (AFS) and maximum composite likelihood optimization. However, dadi's optimization procedure can be computationally expensive.

View Article and Find Full Text PDF

Shifts in Mutation Bias Promote Mutators by Altering the Distribution of Fitness Effects.

Marwa Z Tuffaha Saranya Varakunan David Castellano Ryan N Gutenkunst Lindi M Wahl

Am Nat

October 2023

AbstractRecent experimental evidence demonstrates that shifts in mutational biases-for example, increases in transversion frequency-can change the distribution of fitness effects of mutations (DFE). In particular, reducing or reversing a prevailing bias can increase the probability that a de novo mutation is beneficial. It has also been shown that mutator bacteria are more likely to emerge if the beneficial mutations they generate have a larger effect size than observed in the wild type.

View Article and Find Full Text PDF

dadi-cli: Automated and distributed population genetic model inference from allele frequency spectra.

Xin Huang Travis J Struck Sean W Davey Ryan N Gutenkunst

bioRxiv

June 2023

Summary: dadi is a popular software package for inferring models of demographic history and natural selection from population genomic data. But using dadi requires Python scripting and manual parallelization of optimization jobs. We developed dadi-cli to simplify dadi usage and also enable straighforward distributed computing.

View Article and Find Full Text PDF

Expanding the stdpopsim species catalog, and lessons learned for realistic genome simulations.

M Elise Lauterbur Maria Izabel A Cavassim Ariella L Gladstein Graham Gower Nathaniel S Pope Ryan N Gutenkunst

Elife

June 2023

Simulation is a key tool in population genetics for both methods development and empirical research, but producing simulations that recapitulate the main features of genomic datasets remains a major obstacle. Today, more realistic simulations are possible thanks to large increases in the quantity and quality of available genetic data, and the sophistication of inference and simulation software. However, implementing these simulations still requires substantial time and specialized knowledge.

View Article and Find Full Text PDF

Demographic history inference and the polyploid continuum.

Paul D Blischak Mathews Sajan Michael S Barker Ryan N Gutenkunst

Genetics

August 2023

Polyploidy is an important generator of evolutionary novelty across diverse groups in the Tree of Life, including many crops. However, the impact of whole-genome duplication depends on the mode of formation: doubling within a single lineage (autopolyploidy) versus doubling after hybridization between two different lineages (allopolyploidy). Researchers have historically treated these two scenarios as completely separate cases based on patterns of chromosome pairing, but these cases represent ideals on a continuum of chromosomal interactions among duplicated genomes.

View Article and Find Full Text PDF

Demes: a standard format for demographic models.

Graham Gower Aaron P Ragsdale Gertjan Bisschop Ryan N Gutenkunst Matthew Hartfield

Genetics

November 2022

Understanding the demographic history of populations is a key goal in population genetics, and with improving methods and data, ever more complex models are being proposed and tested. Demographic models of current interest typically consist of a set of discrete populations, their sizes and growth rates, and continuous and pulse migrations between those populations over a number of epochs, which can require dozens of parameters to fully describe. There is currently no standard format to define such models, significantly hampering progress in the field.

View Article and Find Full Text PDF

Author Correction: Comparative and demographic analysis of orang-utan genomes.

Devin P Locke LaDeana W Hillier Wesley C Warren Kim C Worley Lynne V Nazareth Ryan N Gutenkunst

Nature

August 2022

View Article and Find Full Text PDF

Genomic landscape of lymphatic malformations: a case series and response to the PI3Kα inhibitor alpelisib in an -of-1 clinical trial.

Montaser F Shaheen Julie Y Tse Ethan S Sokol Margaret Masterson Pranshu Bansal Ryan N Gutenkunst

Elife

July 2022

Background: Lymphatic malformations (LMs) often pose treatment challenges due to a large size or a critical location that could lead to disfigurement, and there are no standardized treatment approaches for either refractory or unresectable cases.

Methods: We examined the genomic landscape of a patient cohort of LMs ( = 30 cases) that underwent comprehensive genomic profiling using a large-panel next-generation sequencing assay. Immunohistochemical analyses were completed in parallel.

View Article and Find Full Text PDF

On the prospect of achieving accurate joint estimation of selection with population history.

Parul Johri Adam Eyre-Walker Ryan N Gutenkunst Kirk E Lohmueller Jeffrey D Jensen

Genome Biol Evol

July 2022

As both natural selection and population history can affect genome-wide patterns of variation, disentangling the contributions of each has remained as a major challenge in population genetics. We here discuss historical and recent progress towards this goal-highlighting theoretical and computational challenges that remain to be addressed, as well as inherent difficulties in dealing with model complexity and model violations-and offer thoughts on potentially fruitful next steps.

View Article and Find Full Text PDF

The genomic origins of the world's first farmers.

Nina Marchi Laura Winkelbach Ilektra Schulz Maxime Brami Zuzana Hofmanová Ryan N Gutenkunst

Cell

May 2022

The precise genetic origins of the first Neolithic farming populations in Europe and Southwest Asia, as well as the processes and the timing of their differentiation, remain largely unknown. Demogenomic modeling of high-quality ancient genomes reveals that the early farmers of Anatolia and Europe emerged from a multiphase mixing of a Southwest Asian population with a strongly bottlenecked western hunter-gatherer population after the last glacial maximum. Moreover, the ancestors of the first farmers of Europe and Anatolia went through a period of extreme genetic drift during their westward range expansion, contributing highly to their genetic distinctiveness.

View Article and Find Full Text PDF

Deep connections: Divergence histories with gene flow in mesophotic Agaricia corals.

Katharine E Prata Cynthia Riginos Ryan N Gutenkunst Kelly R W Latijnhouwers Juan A Sánchez

Mol Ecol

May 2022

Largely understudied, mesophotic coral ecosystems lie below shallow reefs (at >30 m depth) and comprise ecologically distinct communities. Brooding reproductive modes appear to predominate among mesophotic-specialist corals and may limit genetic connectivity among populations. Using reduced representation genomic sequencing, we assessed spatial population genetic structure at 50 m depth in an ecologically important mesophotic-specialist species Agaricia grahamae, among locations in the Southern Caribbean.

View Article and Find Full Text PDF

Inferring Genome-Wide Correlations of Mutation Fitness Effects between Populations.

Xin Huang Alyssa Lyn Fortier Alec J Coffman Travis J Struck Megan N Irby Ryan N Gutenkunst

Mol Biol Evol

September 2021

The effect of a mutation on fitness may differ between populations depending on environmental and genetic context, but little is known about the factors that underlie such differences. To quantify genome-wide correlations in mutation fitness effects, we developed a novel concept called a joint distribution of fitness effects (DFE) between populations. We then proposed a new statistic w to measure the DFE correlation between populations.

View Article and Find Full Text PDF

Chromosome-scale inference of hybrid speciation and admixture with convolutional neural networks.

Paul D Blischak Michael S Barker Ryan N Gutenkunst

Mol Ecol Resour

November 2021

Inferring the frequency and mode of hybridization among closely related organisms is an important step for understanding the process of speciation and can help to uncover reticulated patterns of phylogeny more generally. Phylogenomic methods to test for the presence of hybridization come in many varieties and typically operate by leveraging expected patterns of genealogical discordance in the absence of hybridization. An important assumption made by these tests is that the data (genes or SNPs) are independent given the species tree.

View Article and Find Full Text PDF

dadi.CUDA: Accelerating Population Genetics Inference with Graphics Processing Units.

Ryan N Gutenkunst

Mol Biol Evol

May 2021

dadi is a popular but computationally intensive program for inferring models of demographic history and natural selection from population genetic data. I show that running dadi on a Graphics Processing Unit can dramatically speed computation compared with the CPU implementation, with minimal user burden. Motivated by this speed increase, I also extended dadi to four- and five-population models.

View Article and Find Full Text PDF

A community-maintained standard library of population genetic models.

Jeffrey R Adrion Christopher B Cole Noah Dukler Jared G Galloway Ariella L Gladstein Ryan N Gutenkunst

Elife

June 2020

The explosion in population genomic data demands ever more complex modes of analysis, and increasingly, these analyses depend on sophisticated simulations. Recent advances in population genetic simulation have made it possible to simulate large and complex models, but specifying such models for a particular simulation engine remains a difficult and error-prone task. Computational genetics researchers currently re-implement simulation models independently, leading to inconsistency and duplication of effort.

View Article and Find Full Text PDF

Inferring the Demographic History of Inbred Species from Genome-Wide SNP Frequency Data.

Paul D Blischak Michael S Barker Ryan N Gutenkunst

Mol Biol Evol

July 2020

Demographic inference using the site frequency spectrum (SFS) is a common way to understand historical events affecting genetic variation. However, most methods for estimating demography from the SFS assume random mating within populations, precluding these types of analyses in inbred populations. To address this issue, we developed a model for the expected SFS that includes inbreeding by parameterizing individual genotypes using beta-binomial distributions.

View Article and Find Full Text PDF

BATCAVE: calling somatic mutations with a tumor- and site-specific prior.

Brian K Mannakee Ryan N Gutenkunst

NAR Genom Bioinform

March 2020

Detecting somatic mutations withins tumors is key to understanding treatment resistance, patient prognosis and tumor evolution. Mutations at low allelic frequency, those present in only a small portion of tumor cells, are particularly difficult to detect. Many algorithms have been developed to detect such mutations, but none models a key aspect of tumor biology.

View Article and Find Full Text PDF

The impact of genome-wide association studies on biomedical research publications.

Travis J Struck Brian K Mannakee Ryan N Gutenkunst

Hum Genomics

August 2018

The past decade has seen major investment in genome-wide association studies (GWAS). Among the many goals of GWAS, a major one is to identify and motivate research on novel genes involved in complex human disease. To assess whether this goal is being met, we quantified the effect of GWAS on the overall distribution of biomedical research publications and on the subsequent publication history of genes newly associated with complex disease.

View Article and Find Full Text PDF

Sensitive and specific post-call filtering of genetic variants in xenograft and primary tumors.

Brian K Mannakee Uthra Balaji Agnieszka K Witkiewicz Ryan N Gutenkunst Erik S Knudsen

Bioinformatics

May 2018

Motivation: Tumor genome sequencing offers great promise for guiding research and therapy, but spurious variant calls can arise from multiple sources. Mouse contamination can generate many spurious calls when sequencing patient-derived xenografts. Paralogous genome sequences can also generate spurious calls when sequencing any tumor.

View Article and Find Full Text PDF

Exome Sequencing Provides Evidence of Polygenic Adaptation to a Fat-Rich Animal Diet in Indigenous Siberian Populations.

PingHsun Hsieh Brian Hallmark Joseph Watkins Tatiana M Karafet Ludmila P Osipova Ryan N Gutenkunst

Mol Biol Evol

November 2017

Siberia is one of the coldest environments on Earth and has great seasonal temperature variation. Long-term settlement in northern Siberia undoubtedly required biological adaptation to severe cold stress, dramatic variation in photoperiod, and limited food resources. In addition, recent archeological studies show that humans first occupied Siberia at least 45,000 years ago; yet our understanding of the demographic history of modern indigenous Siberians remains incomplete.

View Article and Find Full Text PDF

Inferring Demographic History Using Two-Locus Statistics.

Aaron P Ragsdale Ryan N Gutenkunst

Genetics

June 2017

Population demographic history may be learned from contemporary genetic variation data. Methods based on aggregating the statistics of many single loci into an allele frequency spectrum (AFS) have proven powerful, but such methods ignore potentially informative patterns of linkage disequilibrium (LD) between neighboring loci. To leverage such patterns, we developed a composite-likelihood framework for inferring demographic history from aggregated statistics of pairs of loci.

View Article and Find Full Text PDF

Genomic inferences of domestication events are corroborated by written records in Brassica rapa.

Xinshuai Qi Hong An Aaron P Ragsdale Tara E Hall Ryan N Gutenkunst

Mol Ecol

July 2017

Demographic modelling is often used with population genomic data to infer the relationships and ages among populations. However, relatively few analyses are able to validate these inferences with independent data. Here, we leverage written records that describe distinct Brassica rapa crops to corroborate demographic models of domestication.

View Article and Find Full Text PDF