Correcting for ascertainment bias in the inference of population structure.

Bioinformatics

Centre for Ecological and Evolutionary Synthesis, Department of Biology, University of Oslo, P.O. Box 1066, Blindern 0316, Oslo, Norway.

Published: February 2009

Background: The ascertainment process of molecular markers amounts to disregard loci carrying alleles with low frequencies. This can result in strong biases in inferences under population genetics models if not properly taken into account by the inference algorithm. Attempting to model this censoring process in view of making inference of population structure (i.e.identifying clusters of individuals) brings up challenging numerical difficulties.

Method: These difficulties are related to the presence of intractable normalizing constants in Metropolis-Hastings acceptance ratios. This can be solved via an Markov chain Monte Carlo (MCMC) algorithm known as single variable exchange algorithm (SVEA).

Result: We show how this general solution can be implemented for a class of clustering models of broad interest in population genetics that includes the models underlying the computer programs STRUCTURE, GENELAND and GESTE. We also implement the method proposed for a simple example and show that it allows us to reduce the bias substantially.

Availability: Further details and a computer program implementing the method are available from http://folk.uio.no/gillesg/AscB/.

Download full-text PDF

Source
http://dx.doi.org/10.1093/bioinformatics/btn665DOI Listing

Publication Analysis

Top Keywords

inference population
8
population structure
8
population genetics
8
correcting ascertainment
4
ascertainment bias
4
bias inference
4
population
4
structure background
4
background ascertainment
4
ascertainment process
4

Similar Publications

Prader-Willi syndrome (PWS) is a genetic disorder associated with baseline respiratory impairment caused by multiple contributing etiologies. While this may be expected to increase the risk of severe COVID-19 infections in PWS patients, survey studies have suggested paradoxically low disease severity. To better characterize the course of COVID-19 infection in patients with PWS, this study analyses the outcomes of hospitalizations for COVID-19 among patients with and without PWS.

View Article and Find Full Text PDF

Unveiling the Genetic Diversity and Demographic History of in Sierra Leone Using Genotyping-By-Sequencing.

Plants (Basel)

December 2024

Sustainable Perennial Crops Laboratory, United States Department of Agriculture, Agriculture Research Service, Beltsville, MD 2005, USA.

is a rare Coffea species boasting a flavor profile comparable to Arabica coffee () and has a good adaptability to lowland tropical climates. This species faces increasing threats from climate change, deforestation, and habitat fragmentation in its West African homeland. Using 1037 novel SNP markers derived from Genotyping-by-Sequencing (GBS), we revealed the presence of three distinct natural populations (mean Fst = 0.

View Article and Find Full Text PDF

Background: It remains uncertain whether the utilization of methylprednisolone during surgery effectively mitigates the occurrence of adverse outcomes. To examine the association between perioperative methylprednisolone administration and postoperative pleural effusion and pneumonia in older patients with non-small cell lung cancer.

Methods: A retrospective cohort study included non-small cell lung cancer patients aged 65 years or older undergoing thoracic surgery between January 2012 and December 2019 in China.

View Article and Find Full Text PDF

High-recombining genomic regions affect demography inference based on ancestral recombination graphs.

Genetics

January 2025

Max Planck Research Group Behavioural Genomics, Max Planck Institute for Evolutionary Biology, August-Thienemann-Straße 2, 24306 Plön, Germany.

Multiple methods of demography inference are based on the ancestral recombination graph. This powerful approach uses observed mutations to model local genealogies changing along chromosomes by historical recombination events. However, inference of underlying genealogies is difficult in regions with high recombination rate relative to mutation rate due to the lack of mutations representing genealogies.

View Article and Find Full Text PDF

Objective: To evaluate the relationship between infarct pattern, inferred stroke mechanism and risk of recurrence in patients with ischaemic stroke. The question is clinically relevant to optimise secondary stroke prevention investigations and treatment.

Design: We conducted a retrospective analysis of the dabigatran treatment of acute stroke II (DATAS II) trial (ClinicalTrials.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!