Analysis and application of European genetic substructure using 300 K SNP information.

Chao Tian Robert M Plenge Michael Ransom Annette Lee Pablo Villoslada Carlo Selmi Lars Klareskog Ann E Pulver Lihong Qi Peter K Gregersen Michael F Seldin

PLoS Genet

Rowe Program in Human Genetics, University of California Davis, Davis, California, United States of America.

Published: January 2008

European population genetic substructure was examined in a diverse set of >1,000 individuals of European descent, each genotyped with >300 K SNPs. Both STRUCTURE and principal component analyses (PCA) showed the largest division/principal component (PC) differentiated northern from southern European ancestry. A second PC further separated Italian, Spanish, and Greek individuals from those of Ashkenazi Jewish ancestry as well as distinguishing among northern European populations. In separate analyses of northern European participants other substructure relationships were discerned showing a west to east gradient. Application of this substructure information was critical in examining a real dataset in whole genome association (WGA) analyses for rheumatoid arthritis in European Americans to reduce false positive signals. In addition, two sets of European substructure ancestry informative markers (ESAIMs) were identified that provide substantial substructure information. The results provide further insight into European population genetic substructure and show that this information can be used for improving error rates in association testing of candidate genes and in replication studies of WGA scans.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2211544	PMC
http://dx.doi.org/10.1371/journal.pgen.0040004	DOI Listing

Publication Analysis

Top Keywords

genetic substructure

european

european population

population genetic

northern european

substructure

analysis application

application european

european genetic

substructure 300

Similar Publications

Characterizing substructure via mixture modeling in large-scale genetic summary statistics.

Am J Hum Genet

January 2025

Department of Biomedical Informatics, University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA; Human Medical Genetics and Genomics Program, University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA; Mathematical and Statistical Sciences, University of Colorado Denver, Denver, CO 80204, USA; Colorado Center for Personalized Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA. Electronic address:

Hayley R Stoneman Adelle M Price Nikole Scribner Trout Riley Lamont Souha Tifour

Genetic summary data are broadly accessible and highly useful, including for risk prediction, causal inference, fine mapping, and incorporation of external controls. However, collapsing individual-level data into summary data, such as allele frequencies, masks intra- and inter-sample heterogeneity, leading to confounding, reduced power, and bias. Ultimately, unaccounted-for substructure limits summary data usability, especially for understudied or admixed populations.

View Article and Find Full Text PDF

Similar Publications

Control of striatal circuit development by the chromatin regulator .

Sci Adv

January 2025

Department of Neuroscience, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA.

Kyuhyun Choi Nathan T Henderson Emily R Feierman Sean Louzon Jamie Galanaugh

The pathophysiology of neurodevelopmental disorders involves vulnerable neural populations, including striatal circuitry, and convergent molecular nodes, including chromatin regulation and synapse function. Despite this, how epigenetic regulation regulates striatal development is understudied. Recurrent de novo mutations in are associated with intellectual disability and autism.

View Article and Find Full Text PDF

Similar Publications

Fragmenstein: predicting protein-ligand structures of compounds derived from known crystallographic fragment hits using a strict conserved-binding-based methodology.

J Cheminform

January 2025

Oxford Protein Informatics Group, Department of Statistics, University of Oxford, Oxford, UK.

Matteo P Ferla Rubén Sánchez-García Rachael E Skyner Stefan Gahbauer Jenny C Taylor

Current strategies centred on either merging or linking initial hits from fragment-based drug design (FBDD) crystallographic screens generally do not fully leaverage 3D structural information. We show that an algorithmic approach (Fragmenstein) that 'stitches' the ligand atoms from this structural information together can provide more accurate and reliable predictions for protein-ligand complex conformation than general methods such as pharmacophore-constrained docking. This approach works under the assumption of conserved binding: when a larger molecule is designed containing the initial fragment hit, the common substructure between the two will adopt the same binding mode.

View Article and Find Full Text PDF

Similar Publications

generation of dual-target compounds using artificial intelligence.

iScience

January 2025

Department of Bioscience and Bioinformatics, Kyushu Institute of Technology, 680-4 Kawazu, Iizuka, Fukuoka 820-8502, Japan.

Kasumi Yasuda Francois Berenger Kazuma Amaike Ayaka Ueda Tomoya Nakagomi

Drugs that interact with multiple therapeutic targets are potential high-value products in polypharmacology-based drug discovery, but the rational design remains a formidable challenge. Here, we present artificial intelligence (AI)-based methods to design the chemical structures of compounds that interact with multiple therapeutic target proteins. The molecular structure generation is performed by a fragment-based approach using a genetic algorithm with chemical substructures and a deep learning approach using reinforcement learning with stochastic policy gradients in the framework of generative adversarial networks.

View Article and Find Full Text PDF

Similar Publications

A machine learning approach for estimating Eastern Asian origins from massive screening of Y chromosomal short tandem repeats polymorphisms.

Int J Legal Med

January 2025

Institute of Forensic and Anthropological Science, Seoul National University Medical Research Center, 103 Daehak-ro, Jongno-gu, Seoul, 03080, Republic of Korea.

Haeun You Soong Deok Lee Sohee Cho

Inferring the ancestral origin of DNA evidence recovered from crime scenes is crucial in forensic investigations, especially in the absence of a direct suspect match. Ancestry informative markers (AIMs) have been widely researched and commercially developed into panels targeting multiple continental regions. However, existing forensic ancestry inference panels typically group East Asian individuals into a homogenous category without further differentiation.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!