Characterizing substructure via mixture modeling in large-scale genetic summary statistics.

Hayley R Stoneman Adelle M Price Nikole Scribner Trout Riley Lamont Souha Tifour Nikita Pozdeyev Kristy Crooks Meng Lin Nicholas Rafaels Christopher R Gignoux Katie M Marker Audrey E Hendricks

Am J Hum Genet

Department of Biomedical Informatics, University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA; Human Medical Genetics and Genomics Program, University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA; Mathematical and Statistical Sciences, University of Colorado Denver, Denver, CO 80204, USA; Colorado Center for Personalized Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA. Electronic address:

Published: January 2025

Genetic summary data are broadly accessible and highly useful, including for risk prediction, causal inference, fine mapping, and incorporation of external controls. However, collapsing individual-level data into summary data, such as allele frequencies, masks intra- and inter-sample heterogeneity, leading to confounding, reduced power, and bias. Ultimately, unaccounted-for substructure limits summary data usability, especially for understudied or admixed populations. There is a need for methods to enable the harmonization of summary data where the underlying substructure is matched between datasets. Here, we present Summix2, a comprehensive set of methods and software based on a computationally efficient mixture model to enable the harmonization of genetic summary data by estimating and adjusting for substructure. In extensive simulations and application to public data, we show that Summix2 characterizes finer-scale population structure, identifies ascertainment bias, and scans for potential regions of selection due to local substructure deviation. Summix2 increases the robust use of diverse, publicly available summary data, resulting in improved and more equitable research.

Download full-text PDF	Source
http://dx.doi.org/10.1016/j.ajhg.2024.12.007	DOI Listing

Publication Analysis

Top Keywords

summary data

genetic summary

data

enable harmonization

summary

characterizing substructure

substructure mixture

mixture modeling

modeling large-scale

large-scale genetic

Similar Publications

State-of-the-Art Liver Cancer Organoids: Modeling Cancer Stem Cell Heterogeneity for Personalized Treatment.

BioDrugs

January 2025

Orsay-Vallée Campus, Paris-Saclay University, Gif-sur-Yvette, France.

Julien Giron-Michel Maël Padelli Estelle Oberlin Hind Guenou Jean-Charles Duclos-Vallée

Liver cancer poses a global health challenge with limited therapeutic options. Notably, the limited success of current therapies in patients with primary liver cancers (PLCs) may be attributed to the high heterogeneity of both hepatocellular carcinoma (HCCs) and intrahepatic cholangiocarcinoma (iCCAs). This heterogeneity evolves over time as tumor-initiating stem cells, or cancer stem cells (CSCs), undergo (epi)genetic alterations or encounter microenvironmental changes within the tumor microenvironment.

View Article and Find Full Text PDF

Similar Publications

Methodological Insights on Biomarker-Based Patient Selection: A Review of Scientific Advice Procedures at the European Medicines Agency.

Clin Pharmacol Ther

January 2025

Data Analytics and Methods Task Force, European Medicines Agency, Amsterdam, The Netherlands.

Cynthia Huber Joerg Zinserling Norbert Benda Thorsten Vetter Marcia Rueckbeil

Biomarkers play a pivotal role in the selection and enrollment of trial participants. Particularly, predictive biomarkers help tailor medical care to individual patients; however, also prognostic biomarkers require consideration at the design stage. At the time of initiating a clinical trial, there may be uncertainty about whether a biomarker is predictive or prognostic, and the trial design may need to account for this.

View Article and Find Full Text PDF

Similar Publications

Identification of association between mitochondrial dysfunction and sarcopenia using summary-data-based Mendelian randomization and colocalization analyses.

J Gerontol A Biol Sci Med Sci

January 2025

Department of Joint Surgery, HongHui Hospital, Xi'an Jiaotong University, Xi'an, Shaanxi, China.

Jiale Xie Jinrong Hao Xin Xu Jiachen Wang Dinglong Yang

Background: Mitochondrial dysfunction has been demonstrated to be an important hallmark of sarcopenia, yet its specific mechanism remains obscure. In this study, mitochondrial-related genes were used as instrumental variables to proxy for mitochondrial dysfunction, and summary data for sarcopenia-related traits were used as outcomes to examine their genetic association.

Methods: A total of 1,136 mitochondrial-related genes from the human MitoCarta3.

View Article and Find Full Text PDF

Similar Publications

Pattern of Referral of Patients with COVID-19 for Psychiatric Consultation: Experience at a Multidisciplinary Hospital in Karachi, Pakistan.

J Coll Physicians Surg Pak

January 2025

Department of Psychiatry, The Aga Khan University Hospital, Karachi, Pakistan.

Aisha Noorullah Bilal Ahmed Muhammad Yusuf Hafiz Shahina Pirani Murad Khan

Objective: To determine referral patterns for psychiatric consultations among COVID-19 patients encompassing both the in-patient and Emergency Department of a multidisciplinary hospital in Karachi, Pakistan.

Study Design: A retrospective chart review. Place and Duration of the Study: The Aga Khan University Hospital, Karachi, Pakistan, from March 2020 to December 2021.

View Article and Find Full Text PDF

Similar Publications

Association between weight-adjusted-waist index and long-term prognostic outcomes in cardiovascular disease patients: results from the NHANES 1999-2018 study.

Diabetol Metab Syndr

January 2025

Rehabilitation Medicine Center, Institute of Rehabilitation Medicine, West China Hospital, Sichuan University, Chengdu, 610041, China.

Hanbin Li Wen Zhong Hongxin Cheng Shiqi Wang Ran Li

Background: As cardiovascular disease (CVD) morbidity and mortality increase yearly, this study aimed to explore the potential of the weight-adjusted-waist index (WWI) and its relation to long-term mortality in patients with CVD.

Methods: The diagnosis of CVD was based on standardized medical condition questionnaires that incorporated participants' self-reported physician diagnoses. WWI (cm/√kg) is a continuous variable and calculated as waist circumference (WC, cm) divided by square root of body weight (kg).

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!