Characterizing substructure via mixture modeling in large-scale genetic summary statistics.

Am J Hum Genet

Department of Biomedical Informatics, University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA; Human Medical Genetics and Genomics Program, University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA; Mathematical and Statistical Sciences, University of Colorado Denver, Denver, CO 80204, USA; Colorado Center for Personalized Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA. Electronic address:

Published: January 2025

Genetic summary data are broadly accessible and highly useful, including for risk prediction, causal inference, fine mapping, and incorporation of external controls. However, collapsing individual-level data into summary data, such as allele frequencies, masks intra- and inter-sample heterogeneity, leading to confounding, reduced power, and bias. Ultimately, unaccounted-for substructure limits summary data usability, especially for understudied or admixed populations. There is a need for methods to enable the harmonization of summary data where the underlying substructure is matched between datasets. Here, we present Summix2, a comprehensive set of methods and software based on a computationally efficient mixture model to enable the harmonization of genetic summary data by estimating and adjusting for substructure. In extensive simulations and application to public data, we show that Summix2 characterizes finer-scale population structure, identifies ascertainment bias, and scans for potential regions of selection due to local substructure deviation. Summix2 increases the robust use of diverse, publicly available summary data, resulting in improved and more equitable research.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.ajhg.2024.12.007DOI Listing

Publication Analysis

Top Keywords

summary data
24
genetic summary
12
data
8
enable harmonization
8
summary
7
characterizing substructure
4
substructure mixture
4
mixture modeling
4
modeling large-scale
4
large-scale genetic
4

Similar Publications

Liver cancer poses a global health challenge with limited therapeutic options. Notably, the limited success of current therapies in patients with primary liver cancers (PLCs) may be attributed to the high heterogeneity of both hepatocellular carcinoma (HCCs) and intrahepatic cholangiocarcinoma (iCCAs). This heterogeneity evolves over time as tumor-initiating stem cells, or cancer stem cells (CSCs), undergo (epi)genetic alterations or encounter microenvironmental changes within the tumor microenvironment.

View Article and Find Full Text PDF

Biomarkers play a pivotal role in the selection and enrollment of trial participants. Particularly, predictive biomarkers help tailor medical care to individual patients; however, also prognostic biomarkers require consideration at the design stage. At the time of initiating a clinical trial, there may be uncertainty about whether a biomarker is predictive or prognostic, and the trial design may need to account for this.

View Article and Find Full Text PDF

Background: Mitochondrial dysfunction has been demonstrated to be an important hallmark of sarcopenia, yet its specific mechanism remains obscure. In this study, mitochondrial-related genes were used as instrumental variables to proxy for mitochondrial dysfunction, and summary data for sarcopenia-related traits were used as outcomes to examine their genetic association.

Methods: A total of 1,136 mitochondrial-related genes from the human MitoCarta3.

View Article and Find Full Text PDF

Objective: To determine referral patterns for psychiatric consultations among COVID-19 patients encompassing both the in-patient and Emergency Department of a multidisciplinary hospital in Karachi, Pakistan.

Study Design: A retrospective chart review. Place and Duration of the Study: The Aga Khan University Hospital, Karachi, Pakistan, from March 2020 to December 2021.

View Article and Find Full Text PDF

Association between weight-adjusted-waist index and long-term prognostic outcomes in cardiovascular disease patients: results from the NHANES 1999-2018 study.

Diabetol Metab Syndr

January 2025

Rehabilitation Medicine Center, Institute of Rehabilitation Medicine, West China Hospital, Sichuan University, Chengdu, 610041, China.

Background: As cardiovascular disease (CVD) morbidity and mortality increase yearly, this study aimed to explore the potential of the weight-adjusted-waist index (WWI) and its relation to long-term mortality in patients with CVD.

Methods: The diagnosis of CVD was based on standardized medical condition questionnaires that incorporated participants' self-reported physician diagnoses. WWI (cm/√kg) is a continuous variable and calculated as waist circumference (WC, cm) divided by square root of body weight (kg).

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!