Integrating EMR-linked and in vivo functional genetic data to identify new genotype-phenotype associations.

PLoS One

Department of Medicine, Vanderbilt University, Nashville, Tennessee, United States of America; Biomedical Informatics, Vanderbilt University, Nashville, Tennessee, United States of America.

Published: February 2015

The coupling of electronic medical records (EMR) with genetic data has created the potential for implementing reverse genetic approaches in humans, whereby the function of a gene is inferred from the shared pattern of morbidity among homozygotes of a genetic variant. We explored the feasibility of this approach to identify phenotypes associated with low frequency variants using Vanderbilt's EMR-based BioVU resource. We analyzed 1,658 low frequency non-synonymous SNPs (nsSNPs) with a minor allele frequency (MAF)<10% collected on 8,546 subjects. For each nsSNP, we identified diagnoses shared by at least 2 minor allele homozygotes and with an association p<0.05. The diagnoses were reviewed by a clinician to ascertain whether they may share a common mechanistic basis. While a number of biologically compelling clinical patterns of association were observed, the frequency of these associations was identical to that observed using genotype-permuted data sets, indicating that the associations were likely due to chance. To refine our analysis associations, we then restricted the analysis to 711 nsSNPs in genes with phenotypes in the On-line Mendelian Inheritance in Man (OMIM) or knock-out mouse phenotype databases. An initial comparison of the EMR diagnoses to the known in vivo functions of the gene identified 25 candidate nsSNPs, 19 of which had significant genotype-phenotype associations when tested using matched controls. Twleve of the 19 nsSNPs associations were confirmed by a detailed record review. Four of 12 nsSNP-phenotype associations were successfully replicated in an independent data set: thrombosis (F5,rs6031), seizures/convulsions (GPR98,rs13157270), macular degeneration (CNGB3,rs3735972), and GI bleeding (HGFAC,rs16844401). These analyses demonstrate the feasibility and challenges of using reverse genetics approaches to identify novel gene-phenotype associations in human subjects using low frequency variants. As increasing amounts of rare variant data are generated from modern genotyping and sequence platforms, model organism data may be an important tool to enable discovery.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4065041PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0100322PLOS

Publication Analysis

Top Keywords

genetic data
8
low frequency
8
integrating emr-linked
4
emr-linked vivo
4
vivo functional
4
genetic
4
functional genetic
4
data identify
4
identify genotype-phenotype
4
genotype-phenotype associations
4

Similar Publications

[PLSR model based on near-infrared spectroscopy for the detection of wood fiber anatomy of ].

Ying Yong Sheng Tai Xue Bao

October 2024

Research Institute of Subtropical Forestry, Chinese Academy of Forestry/Zhejiang Key Laboratory of Forest Genetics and Bree-ding, Hangzhou 311400, China.

To rapidly acquire fiber phenotypic data for wood quality assessment, we used a portable NIR spectro-meter to collect spectral data in 100 individuals of at 18-year-old of 20 different provenances, and simultaneously collected wood cores. Wood basic density and the anatomical structure of wood fiber were measured. The standard normal variate (SNV), orthogonal signal correction (OSC), and multiplicative scatter correction (MSC) methods were used for spectral preprocessing, the competitive adaptive reweighted sampling (CARS) method were used for wavelength selection, and the partial least squares regression (PLSR) model were established.

View Article and Find Full Text PDF

Objective: To explore the association between smoking, genetic susceptibility and early menopause (EM) and clarify the potential mechanisms underlying this relationship.

Design: An observational and Transcriptome-wide association analysis (TWAS) study.

Setting: UK Biobank and public summary statistics.

View Article and Find Full Text PDF

Ankylosing Spondylitis (AS) and Systemic Sclerosis (SSc) are both autoimmune diseases, albeit with distinct anatomical targets. AS primarily affects the spine and sacroiliac joints, triggering inflammation and eventual fusion of the vertebrae. SSc predominantly impacts the skin and connective tissues, leading to skin fibrosis, thickening, and potential damage to vital organs such as the lungs, heart, and kidneys.

View Article and Find Full Text PDF

Objective: Inflammatory factors play a crucial role in the onset and progression of heart failure. To further explore the causal relationship between inflammatory factors and heart failure, we employed bidirectional Mendelian randomization analysis to investigate the causal links between 91 inflammatory cytokines and heart failure.

Methods: We conducted our study using the bidirectional Mendelian randomization approach.

View Article and Find Full Text PDF

Introduction: Back pain (BP) is a complex heritable trait with an estimated heritability of 40% to 60%. Less than half of this can be explained by known genetic variants identified in genome-wide association studies.

Objectives: We applied a powerful multi-trait and gene-based approach to association analysis of BP to identify novel genes associated with BP.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!