Background: The recent advent of high-throughput SNP genotyping technologies has opened new avenues of research for population genetics. In particular, a growing interest in the identification of footprints of selection, based on genome scans for adaptive differentiation, has emerged.

Methodology/principal Findings: The purpose of this study is to develop an efficient model-based approach to perform bayesian exploratory analyses for adaptive differentiation in very large SNP data sets. The basic idea is to start with a very simple model for neutral loci that is easy to implement under a bayesian framework and to identify selected loci as outliers via Posterior Predictive P-values (PPP-values). Applications of this strategy are considered using two different statistical models. The first one was initially interpreted in the context of populations evolving respectively under pure genetic drift from a common ancestral population while the second one relies on populations under migration-drift equilibrium. Robustness and power of the two resulting bayesian model-based approaches to detect SNP under selection are further evaluated through extensive simulations. An application to a cattle data set is also provided.

Conclusions/significance: The procedure described turns out to be much faster than former bayesian approaches and also reasonably efficient especially to detect loci under positive selection.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2914027PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0011913PLOS

Publication Analysis

Top Keywords

data sets
8
adaptive differentiation
8
bayesian
5
bayesian outlier
4
outlier criterion
4
criterion detect
4
detect snps
4
selection
4
snps selection
4
selection large
4

Similar Publications

Parkinson's disease (PD), a degenerative disorder of the central nervous system, is commonly diagnosed using functional medical imaging techniques such as single-photon emission computed tomography (SPECT). In this study, we utilized two SPECT data sets (n = 634 and n = 202) from different hospitals to develop a model capable of accurately predicting PD stages, a multiclass classification task. We used the entire three-dimensional (3D) brain images as input and experimented with various model architectures.

View Article and Find Full Text PDF

The need for effective early detection and optimal therapy monitoring of cardiovascular diseases as the leading cause of death has led to an adaptation of the guidelines with a focus on cardiac computed tomography (CCTA) in patients with a low to intermediate risk of coronary heart disease (CHD). In particular, the introduction of photon-counting computed tomography (PCCT) in CT diagnostics promises significant advances through higher temporal and spatial resolution, and also enables advanced texture analysis, known as radiomics analysis. Originally developed in oncological imaging, radiomics analysis is increasingly being used in cardiac imaging and research.

View Article and Find Full Text PDF

Identification of high-risk oral leukoplakia (OLK) using combined Raman spectroscopic analysis of brush biopsy and saliva samples: A proof of concept study.

Spectrochim Acta A Mol Biomol Spectrosc

January 2025

Centre for Radiation and Environmental Science, FOCAS Research Institute, Technological University Dublin, City Campus, Dublin, Ireland; School of Physics, Clinical and Optometric Sciences, Technological University Dublin, City Campus, Dublin, Ireland.

The gold standard method of diagnosis of oral leukoplakia (OLK) is a tissue biopsy followed by histological examination. Raman spectroscopic studies of cytological brush biopsy and saliva samples have previously been shown to differentiate low (no and mild dysplasia) and high risk (moderate and severe dysplasia) OLKs, discriminant models of cellular samples achieving higher specificity, whereas those based on saliva samples achieved higher sensitivity. The current study combines the spectral data sets of cell and saliva samples in an attempt to improve the overall efficiency of the discriminating models.

View Article and Find Full Text PDF

Efficient and accurate determination of the degree of substitution of cellulose acetate using ATR-FTIR spectroscopy and machine learning.

Sci Rep

January 2025

Institute of Biological and Chemical Systems - Functional Molecular Systems (IBCS-FMS), Karlsruhe Institute of Technology (KIT), Karlsruhe, 76344, Germany.

Multiple linear regression models were trained to predict the degree of substitution (DS) of cellulose acetate based on raw infrared (IR) spectroscopic data. A repeated k-fold cross validation ensured unbiased assessment of model accuracy. Using the DS obtained from H NMR data as reference, the machine learning model achieved a mean absolute error (MAE) of 0.

View Article and Find Full Text PDF

The application of supervised models to clinical screening tasks is challenging due to the need for annotated data for each considered pathology. Unsupervised Anomaly Detection (UAD) is an alternative approach that aims to identify any anomaly as an outlier from a healthy training distribution. A prevalent strategy for UAD in brain MRI involves using generative models to learn the reconstruction of healthy brain anatomy for a given input image.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!