Sampling schemes and drift can bias admixture proportions inferred by structure.

Mol Ecol Resour

CBGP, INRAE, CIRAD, IRD, Montpellier SupAgro, University of Montpellier, Montpellier, France.

Published: November 2020

The interbreeding of individuals coming from genetically differentiated but incompletely isolated populations can lead to the formation of admixed populations, having important implications in ecology and evolution. In this simulation study, we evaluate how individual admixture proportions estimated by the software structure are quantitatively affected by different factors. Using various scenarios of admixture between two diverging populations, we found that unbalanced sampling from parental populations may seriously bias the inferred admixture proportions; moreover, proportionally large samples from the admixed population can also decrease the accuracy and precision of the inferences. As expected, weak differentiation between parental populations and drift after the admixture event strongly increase the biases caused by uneven sampling. We also show that admixture proportions are generally more biased when parental populations unequally contributed to the admixed population. Finally, with few exceptions, using a large number of markers reduces those biases, but using alternative priors for individual ancestry or the uncorrelated allele model only marginally affect the inference of admixture in most situations. We conclude that unbalanced sampling may cause important biases in the admixture proportions estimated by structure, especially when a small number of markers are used, and those biases can be worsened by the effect of drift and unequal genetic contribution of parental populations. Empirical studies should thus be careful with their sampling design and consider historical characteristics when using this software to estimate the ancestry of individuals from admixed populations.

Download full-text PDF

Source
http://dx.doi.org/10.1111/1755-0998.13234DOI Listing

Publication Analysis

Top Keywords

admixture proportions
20
parental populations
16
admixture
8
populations
8
admixed populations
8
proportions estimated
8
unbalanced sampling
8
admixed population
8
number markers
8
sampling
5

Similar Publications

Objectives: Past studies have shown the efficacy of spinal targeted drug delivery (TDD) in pain relief, reduction in opioid use, and cost-effectiveness in long-term management of complex chronic pain. We conducted a survey to determine treatment variables associated with patient satisfaction.

Materials And Methods: Patients in a single pain clinic who were implanted with Medtronic pain pumps to relieve intractable pain were identified from our electronic health record.

View Article and Find Full Text PDF

Many known and unknown historical events have remained below detection thresholds of genetic studies because subtle ancestry changes are challenging to reconstruct. Methods based on shared haplotypes and rare variants improve power but are not explicitly temporal and have not been possible to adopt in unbiased ancestry models. Here we develop Twigstats, an approach of time-stratified ancestry analysis that can improve statistical power by an order of magnitude by focusing on coalescences in recent times, while remaining unbiased by population-specific drift.

View Article and Find Full Text PDF

Estimation of genetic admixture proportions via haplotypes.

Comput Struct Biotechnol J

December 2024

Department of Computational Medicine, University of California, Los Angeles, Los Angeles, CA 90095, USA.

Estimation of ancestral admixture is essential for creating personal genealogies, studying human history, and conducting genome-wide association studies (GWAS). The following three primary methods exist for estimating admixture coefficients. The frequentist approach directly maximizes the binomial loglikelihood.

View Article and Find Full Text PDF

In this computer note I introduce software, PopCluster, that implements a new likelihood method for unsupervised population structure analysis from marker data. To infer a coarse population structure, it assumes the mixture model and adopts a simulated annealing algorithm to make a maximum likelihood clustering analysis, partitioning the sampled individuals into a predefined number of clusters. To deduce a fine population structure, it further assumes the admixture model and employs an expectation maximisation algorithm to estimate individual admixture proportions.

View Article and Find Full Text PDF
Article Synopsis
  • The study investigates the genetic variance (variome) of the Turkish population, focusing on how admixture and consanguinity influence their genomic makeup related to diseases.
  • Exome sequencing data from 773 affected individuals and 643 unaffected relatives revealed that Turkish genomes are closely related to Europeans, with two main subpopulations showing different levels of genetic mixing.
  • The analysis identified that higher inbreeding coefficients and longer runs of homozygosity in affected individuals are linked to rare harmful genetic variants, emphasizing their role in hereditary diseases within this population.
View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!