BMC Bioinformatics
September 2021
Background: We present ARCHes, a fast and accurate haplotype-based approach for inferring an individual's ancestry composition. Our approach works by modeling haplotype diversity from a large, admixed cohort of hundreds of thousands, then annotating those models with population information from reference panels of known ancestry.
Results: The running time of ARCHes does not depend on the size of a reference panel because training and testing are separate processes, and the inferred population-annotated haplotype models can be written to disk and reused to label large test sets in parallel (in our experiments, it averages less than one minute to assign ancestry from 32 populations using 10 CPU).
Drosophila teissieri and D. yakuba diverged approximately 3 mya and are thought to share a large, ancestral, African range [1-3]. These species now co-occur in parts of continental Africa and in west Africa on the island of Bioko [1, 4].
View Article and Find Full Text PDFHybrid zones formed between recently diverged populations offer an opportunity to study the mechanisms underlying reproductive isolation and the process of speciation. Here, we use a combination of analytical theory and explicit forward simulations to describe how selection against hybrid genotypes impacts patterns of introgression across genomic and geographic space. By describing how lineages move across the hybrid zone, in a model without coalescence, we add to modern understanding of how clines form and how parental haplotypes are broken up during introgression.
View Article and Find Full Text PDFGeographic patterns of genetic differentiation have long been used to understand population history and to learn about the biological mechanisms of adaptation. Here we present an examination of genomic patterns of differentiation between northern and southern populations of Australian and North American Drosophila simulans, with an emphasis on characterizing signals of parallel differentiation. We report on the genomic scale of differentiation and functional enrichment of outlier SNPs.
View Article and Find Full Text PDFRecent genomic studies have highlighted the important role of admixture in shaping genome-wide patterns of diversity. Past admixture leaves a population genomic signature of linkage disequilibrium (LD), reflecting the mixing of parental chromosomes by segregation and recombination. These patterns of LD can be used to infer the timing of admixture, but the results of inference can depend strongly on the assumed demographic model.
View Article and Find Full Text PDF