Publications by Sheila Gaynor | LitMetric

Publications by authors named "Sheila Gaynor"

Page 1 of 2

Joint testing of rare variant burden scores using non-negative least squares.

Andrey Ziyatdinov Joelle Mbatchou Anthony Marcketta Joshua Backman Sheila Gaynor

Am J Hum Genet

October 2024

Gene-based burden tests are a popular and powerful approach for analysis of exome-wide association studies. These approaches combine sets of variants within a gene into a single burden score that is then tested for association. Typically, a range of burden scores are calculated and tested across a range of annotation classes and frequency bins.

View Article and Find Full Text PDF

Yield of genetic association signals from genomes, exomes and imputation in the UK Biobank.

Sheila M Gaynor Tyler Joseph Xiaodong Bai Yuxin Zou Boris Boutkov

Nat Genet

November 2024

Whole-genome sequencing (WGS), whole-exome sequencing (WES) and array genotyping with imputation (IMP) are common strategies for assessing genetic variation and its association with medically relevant phenotypes. To date, there has been no systematic empirical assessment of the yield of these approaches when applied to hundreds of thousands of samples to enable the discovery of complex trait genetic signals. Using data for 100 complex traits from 149,195 individuals in the UK Biobank, we systematically compare the relative yield of these strategies in genetic association studies.

View Article and Find Full Text PDF

Whole genome sequencing based analysis of inflammation biomarkers in the Trans-Omics for Precision Medicine (TOPMed) consortium.

Min-Zhi Jiang Sheila M Gaynor Xihao Li Eric Van Buren Adrienne Stilp

Hum Mol Genet

August 2024

Article Synopsis

Inflammation biomarkers offer crucial insights into the inflammatory processes linked to various diseases, and their sequencing can help reveal the genetic makeup of these traits.
A study analyzed 21 inflammation biomarkers from around 38,465 individuals, discovering 22 significant associations across 6 inflammatory traits after considering existing findings.
The research combined single-variant and rare variant analyses, identifying additional significant associations and highlighting the complexity and diversity of genetic influences on inflammation traits across different ancestries.

View Article and Find Full Text PDF

Author Correction: Genotyping, sequencing and analysis of 140,000 adults from Mexico City.

Andrey Ziyatdinov Jason Torres Jesús Alegre-Díaz Joshua Backman Joelle Mbatchou Sheila M Gaynor

Nature

February 2024

View Article and Find Full Text PDF

Genotyping, sequencing and analysis of 140,000 adults from Mexico City.

Andrey Ziyatdinov Jason Torres Jesús Alegre-Díaz Joshua Backman Joelle Mbatchou Sheila M Gaynor

Nature

October 2023

Article Synopsis

The Mexico City Prospective Study is a large-scale research initiative involving over 150,000 adults from urban areas in Mexico City, aimed at understanding genetic diversity and ancestry.
The study reveals a mix of Indigenous American, European, and African ancestries among participants, highlighting significant genetic differences and a unique genetic landscape within the Indigenous Mexican population.
Researchers created a valuable reference panel for genetic research, improving the accuracy of studying genetic variants in populations with high Indigenous ancestry, and providing essential resources for future genetic studies in both Mexico and the US.

View Article and Find Full Text PDF

Whole Genome Sequencing Based Analysis of Inflammation Biomarkers in the Trans-Omics for Precision Medicine (TOPMed) Consortium.

Min-Zhi Jiang Sheila M Gaynor Xihao Li Eric Van Buren Adrienne Stilp

bioRxiv

September 2023

Article Synopsis

Inflammation biomarkers play a crucial role in understanding diseases and can reveal insights into genetic traits through whole-genome sequencing studies.
A comprehensive analysis of 21 inflammation biomarkers in over 38,000 individuals found 22 significant single-variant associations across six different inflammatory traits, indicating the complexity and diversity of these biomarkers.
The study also included rare variant analyses, identifying 19 additional significant associations, which highlights the importance of using multiple analytical approaches to enhance the understanding of inflammation-related traits across different ancestries.

View Article and Find Full Text PDF

Investigating Gene-Diet Interactions Impacting the Association Between Macronutrient Intake and Glycemic Traits.

Kenneth E Westerman Maura E Walker Sheila M Gaynor Jennifer Wessel Daniel DiCorpo

Diabetes

May 2023

Unlabelled: Few studies have demonstrated reproducible gene-diet interactions (GDIs) impacting metabolic disease risk factors, likely due in part to measurement error in dietary intake estimation and insufficient capture of rare genetic variation. We aimed to identify GDIs across the genetic frequency spectrum impacting the macronutrient-glycemia relationship in genetically and culturally diverse cohorts. We analyzed 33,187 participants free of diabetes from 10 National Heart, Lung, and Blood Institute Trans-Omics for Precision Medicine program cohorts with whole-genome sequencing, self-reported diet, and glycemic trait data.

View Article and Find Full Text PDF

Powerful, scalable and resource-efficient meta-analysis of rare variant associations in large whole genome sequencing studies.

Xihao Li Corbin Quick Hufeng Zhou Sheila M Gaynor Yaowu Liu

Nat Genet

January 2023

Article Synopsis

MetaSTAAR is a new framework designed for analyzing rare genetic variants in large studies, specifically whole genome and whole exome sequencing (WGS/WES).
It effectively manages relatedness and population differences while analyzing various traits, enhancing the ability to detect significant rare variant associations by utilizing functional annotations.
In tests with over 30,000 diverse samples, MetaSTAAR yielded results similar to pooled data analysis and successfully identified significant rare variant associations related to lipid traits.

View Article and Find Full Text PDF

A framework for detecting noncoding rare-variant associations of large-scale whole-genome sequencing studies.

Zilin Li Xihao Li Hufeng Zhou Sheila M Gaynor Margaret Sunitha Selvaraj

Nat Methods

December 2022

Article Synopsis

Large-scale whole-genome sequencing studies allow researchers to examine associations between rare noncoding variants and complex diseases, although current methods struggle with the noncoding genome analysis.
The STAARpipeline framework offers a comprehensive solution for detecting noncoding rare variant associations through various analytical approaches, including gene-centric and non-gene-centric analyses that utilize functional annotations.
The effectiveness of STAARpipeline is demonstrated through its application in identifying significant noncoding RV sets linked to lipid traits in over 21,000 samples, with successful replication in an additional group, and further analysis of other traits.

View Article and Find Full Text PDF

Whole genome sequence association analysis of fasting glucose and fasting insulin levels in diverse cohorts from the NHLBI TOPMed program.

Daniel DiCorpo Sheila M Gaynor Emily M Russell Kenneth E Westerman Laura M Raffield

Commun Biol

July 2022

Article Synopsis

- The study analyzed genetic factors affecting fasting glucose (FG) and fasting insulin (FI) using high-coverage whole genome sequencing from over 23,000 non-diabetic individuals across five different racial and ethnic groups.
- Researchers identified eight significant genetic variants linked to FG or FI in known gene regions, while also suggesting associations with additional regions related to metabolic processes.
- The project compiled functional annotation resources to help understand the implications of these genetic variations and laid the groundwork for future research on glycemic traits.

View Article and Find Full Text PDF

Connectivity in eQTL networks dictates reproducibility and genomic properties.

Sheila M Gaynor Maud Fagny Xihong Lin John Platig John Quackenbush

Cell Rep Methods

May 2022

Expression quantitative trait locus (eQTL) analysis associates SNPs with gene expression; these relationships can be represented as a bipartite network with association strength as "edge weights" between SNPs and genes. However, most eQTL networks use binary edge weights based on thresholded FDR estimates: definitions that influence reproducibility and downstream analyses. We constructed twenty-nine tissue-specific eQTL networks using GTEx data and evaluated a comprehensive set of network specifications based on false discovery rates, test statistics, and p values, focusing on the degree centrality-a metric of an SNP or gene node's potential network influence.

View Article and Find Full Text PDF

STAAR workflow: a cloud-based workflow for scalable and reproducible rare variant analysis.

Sheila M Gaynor Kenneth E Westerman Lea L Ackovic Xihao Li Zilin Li

Bioinformatics

May 2022

Summary: We developed the variant-Set Test for Association using Annotation infoRmation (STAAR) workflow description language (WDL) workflow to facilitate the analysis of rare variants in whole genome sequencing association studies. The open-access STAAR workflow written in the WDL allows a user to perform rare variant testing for both gene-centric and genetic region approaches, enabling genome-wide, candidate and conditional analyses. It incorporates functional annotations into the workflow as introduced in the STAAR method in order to boost the rare variant analysis power.

View Article and Find Full Text PDF

COVID-19 Spread Mapper: a multi-resolution, unified framework and open-source tool.

Andy Shi Sheila M Gaynor Rounak Dey Haoyu Zhang Corbin Quick

Bioinformatics

April 2022

Summary: Amidst the continuing spread of coronavirus disease-19 (COVID-19), real-time data analysis and visualization remain critical the general public to track the pandemic's impact and to inform policy making by officials. Multiple metrics permit the evaluation of the spread, infection and mortality of infectious diseases. For example, numbers of new cases and deaths provide easily interpretable measures of absolute impact within a given population and time frame, while the effective reproduction rate provides an epidemiological measure of the rate of spread.

View Article and Find Full Text PDF

Leveraging a surrogate outcome to improve inference on a partially missing target outcome.

Zachary R McCaw Sheila M Gaynor Ryan Sun Xihong Lin

Biometrics

June 2023

Sample sizes vary substantially across tissues in the Genotype-Tissue Expression (GTEx) project, where considerably fewer samples are available from certain inaccessible tissues, such as the substantia nigra (SSN), than from accessible tissues, such as blood. This severely limits power for identifying tissue-specific expression quantitative trait loci (eQTL) in undersampled tissues. Here we propose Surrogate Phenotype Regression Analysis (Spray) for leveraging information from a correlated surrogate outcome (eg, expression in blood) to improve inference on a partially missing target outcome (eg, expression in SSN).

View Article and Find Full Text PDF

Association of Hormonal Contraceptive Use with Headache and Temporomandibular Pain: The OPPERA Study.

Sheila M Gaynor Roger B Fillingim Denniz A Zolnoun Joel D Greenspan William Maixner

J Oral Facial Pain Headache

June 2021

Aims: To determine the relationship between hormonal contraceptive (HC) use and painful symptoms, particularly those associated with headache and painful temporomandibular disorders (TMD).

Methods: Data from the Orofacial Pain: Prospective Evaluation and Risk Assessment (OPPERA) prospective cohort study were used. During the 2.

View Article and Find Full Text PDF

Identifying US County-level characteristics associated with high COVID-19 burden.

Daniel Li Sheila M Gaynor Corbin Quick Jarvis T Chen Briana J K Stephenson

BMC Public Health

May 2021

Background: Identifying county-level characteristics associated with high coronavirus 2019 (COVID-19) burden can help allow for data-driven, equitable allocation of public health intervention resources and reduce burdens on health care systems.

Methods: Synthesizing data from various government and nonprofit institutions for all 3142 United States (US) counties, we studied county-level characteristics that were associated with cumulative and weekly case and death rates through 12/21/2020. We used generalized linear mixed models to model cumulative and weekly (40 repeated measures per county) cases and deaths.

View Article and Find Full Text PDF

Identifying US Counties with High Cumulative COVID-19 Burden and Their Characteristics.

Daniel Li Sheila M Gaynor Corbin Quick Jarvis T Chen Briana J K Stephenson

medRxiv

January 2021

Unlabelled: Identifying areas with high COVID-19 burden and their characteristics can help improve vaccine distribution and uptake, reduce burdens on health care systems, and allow for better allocation of public health intervention resources. Synthesizing data from various government and nonprofit institutions of 3,142 United States (US) counties as of 12/21/2020, we studied county-level characteristics that are associated with cumulative case and death rates using regression analyses. Our results showed counties that are more rural, counties with more White/non-White segregation, and counties with higher percentages of people of color, in poverty, with no high school diploma, and with medical comorbidities such as diabetes and hypertension are associated with higher cumulative COVID-19 case and death rates.

View Article and Find Full Text PDF

Phenotypic profile clustering pragmatically identifies diagnostically and mechanistically informative subgroups of chronic pain patients.

Sheila M Gaynor Andrey Bortsov Eric Bair Roger B Fillingim Joel D Greenspan

Pain

May 2021

Traditional classification and prognostic approaches for chronic pain conditions focus primarily on anatomically based clinical characteristics not based on underlying biopsychosocial factors contributing to perception of clinical pain and future pain trajectories. Using a supervised clustering approach in a cohort of temporomandibular disorder cases and controls from the Orofacial Pain: Prospective Evaluation and Risk Assessment study, we recently developed and validated a rapid algorithm (ROPA) to pragmatically classify chronic pain patients into 3 groups that differed in clinical pain report, biopsychosocial profiles, functional limitations, and comorbid conditions. The present aim was to examine the generalizability of this clustering procedure in 2 additional cohorts: a cohort of patients with chronic overlapping pain conditions (Complex Persistent Pain Conditions study) and a real-world clinical population of patients seeking treatment at duke innovative pain therapies.

View Article and Find Full Text PDF

Integration of multiomic annotation data to prioritize and characterize inflammation and immune-related risk variants in squamous cell lung cancer.

Ryan Sun Miao Xu Xihao Li Sheila Gaynor Hufeng Zhou

Genet Epidemiol

February 2021

Clinical trial results have recently demonstrated that inhibiting inflammation by targeting the interleukin-1β pathway can offer a significant reduction in lung cancer incidence and mortality, highlighting a pressing and unmet need to understand the benefits of inflammation-focused lung cancer therapies at the genetic level. While numerous genome-wide association studies (GWAS) have explored the genetic etiology of lung cancer, there remains a large gap between the type of information that may be gleaned from an association study and the depth of understanding necessary to explain and drive translational findings. Thus, in this study we jointly model and integrate extensive multiomics data sources, utilizing a total of 40 genome-wide functional annotations that augment previously published results from the International Lung Cancer Consortium (ILCCO) GWAS, to prioritize and characterize single nucleotide polymorphisms (SNPs) that increase risk of squamous cell lung cancer through the inflammatory and immune responses.

View Article and Find Full Text PDF

Dynamic incorporation of multiple in silico functional annotations empowers rare variant association analysis of large whole-genome sequencing studies at scale.

Xihao Li Zilin Li Hufeng Zhou Sheila M Gaynor Yaowu Liu

Nat Genet

September 2020

Large-scale whole-genome sequencing studies have enabled the analysis of rare variants (RVs) associated with complex phenotypes. Commonly used RV association tests have limited scope to leverage variant functions. We propose STAAR (variant-set test for association using annotation information), a scalable and powerful RV association test method that effectively incorporates both variant categories and multiple complementary annotations using a dynamic weighting scheme.

View Article and Find Full Text PDF

Allelic Heterogeneity at the CRP Locus Identified by Whole-Genome Sequencing in Multi-ancestry Cohorts.

Laura M Raffield Apoorva K Iyengar Biqi Wang Sheila M Gaynor Cassandra N Spracklen

Am J Hum Genet

January 2020

Whole-genome sequencing (WGS) can improve assessment of low-frequency and rare variants, particularly in non-European populations that have been underrepresented in existing genomic studies. The genetic determinants of C-reactive protein (CRP), a biomarker of chronic inflammation, have been extensively studied, with existing genome-wide association studies (GWASs) conducted in >200,000 individuals of European ancestry. In order to discover novel loci associated with CRP levels, we examined a multi-ancestry population (n = 23,279) with WGS (∼38× coverage) from the Trans-Omics for Precision Medicine (TOPMed) program.

View Article and Find Full Text PDF

Identification of differentially expressed gene sets using the Generalized Berk-Jones statistic.

Sheila M Gaynor Ryan Sun Xihong Lin John Quackenbush

Bioinformatics

November 2019

Motivation: Cancer genomics studies frequently aim to identify genes that are differentially expressed between clinically distinct patient subgroups, generally by testing single genes one at a time. However, the results of any individual transcriptomic study are often not fully reproducible. A particular challenge impeding statistical analysis is the difficulty of distinguishing between differential expression comprising part of the genomic disease etiology and that induced by downstream effects.

View Article and Find Full Text PDF

Mediation analysis for common binary outcomes.

Sheila M Gaynor Joel Schwartz Xihong Lin

Stat Med

February 2019

Mediation analysis provides an attractive causal inference framework to decompose the total effect of an exposure on an outcome into natural direct effects and natural indirect effects acting through a mediator. For binary outcomes, mediation analysis methods have been developed using logistic regression when the binary outcome is rare. These methods will not hold in practice when a disease is common.

View Article and Find Full Text PDF

Identification of relevant subtypes via preweighted sparse clustering.

Sheila Gaynor Eric Bair

Comput Stat Data Anal

December 2017

Cluster analysis methods are used to identify homogeneous subgroups in a data set. In biomedical applications, one frequently applies cluster analysis in order to identify biologically interesting subgroups. In particular, one may wish to identify subgroups that are associated with a particular outcome of interest.

View Article and Find Full Text PDF

Identification of three different types of smokers who are not motivated to quit: Results from a latent class analysis.

Belinda Borrelli Sheila Gaynor Erin Tooley Christopher J Armitage Alison Wearden

Health Psychol

February 2018

Objective: The majority of smokers are not motivated to quit within 30 days. We examined whether these smokers are a homogeneous group, hypothesizing that subtypes of unmotivated smokers could be identified.

Method: Included were 500 smokers not ready to quit within 30 days who completed an online survey assessing variables known to be associated with quitting.

View Article and Find Full Text PDF