Publications by Satten G | LitMetric

Publications by authors named "Satten G"

Page 1 of 6

Identifying unmeasured heterogeneity in microbiome data via quantile thresholding (QuanT).

Jiuyao Lu Glen A Satten Katie A Meyer Lenore J Launer Wodan Ling

bioRxiv

March 2025

Microbiome data, like other high-throughput data, suffer from technical heterogeneity stemming from differential experimental designs and processing. In addition to measured artifacts such as batch effects, there is heterogeneity due to unknown or unmeasured factors, which lead to spurious conclusions if unaccounted for. With the advent of large-scale multi-center microbiome studies and the increasing availability of public datasets, this issue becomes more pronounced.

View Article and Find Full Text PDF

MIDASim: a fast and simple simulator for realistic microbiome data.

Mengyu He Ni Zhao Glen A Satten

Microbiome

July 2024

Background: Advances in sequencing technology has led to the discovery of associations between the human microbiota and many diseases, conditions, and traits. With the increasing availability of microbiome data, many statistical methods have been developed for studying these associations. The growing number of newly developed methods highlights the need for simple, rapid, and reliable methods to simulate realistic microbiome data, which is essential for validating and evaluating the performance of these methods.

View Article and Find Full Text PDF

MERIT: Controlling Monte-Carlo error rate in large-scale Monte-Carlo hypothesis testing.

Yunxiao Li Yi-Juan Hu Glen A Satten

Stat Med

January 2024

The use of Monte-Carlo (MC) -values when testing the significance of a large number of hypotheses is now commonplace. In large-scale hypothesis testing, we will typically encounter at least some -values near the threshold of significance, which require a larger number of MC replicates than -values that are far from the threshold. As a result, some incorrect conclusions can be reached due to MC error alone; for hypotheses near the threshold, even a very large number (eg, ) of MC replicates may not be enough to guarantee conclusions reached using MC -values.

View Article and Find Full Text PDF

Ranked severe maternal morbidity index for population-level surveillance at delivery hospitalization based on hospital discharge data.

Elena V Kuklina Alexander C Ewing Glen A Satten William M Callaghan David A Goodman

PLoS One

November 2023

Background: Severe maternal morbidity (SMM) is broadly defined as an unexpected and potentially life-threatening event associated with labor and delivery. The Centers for Disease Control and Prevention (CDC) produced 21 different indicators based on International Classification of Diseases, 9th Revision, Clinical Modification (ICD-9-CM) hospital diagnostic and procedure codes to identify cases of SMM.

Objectives: To examine existing SMM indicators and determine which indicators identified the most in-hospital mortality at delivery hospitalization.

View Article and Find Full Text PDF

Compositional analysis of microbiome data using the linear decomposition model (LDM).

Yi-Juan Hu Glen A Satten

Bioinformatics

November 2023

Summary: There are compelling reasons to test compositional hypotheses about microbiome data. We present here linear decomposition model-centered log ratio (LDM-clr), an extension of our LDM approach to allow fitting linear models to centered-log-ratio-transformed taxa count data. As LDM-clr is implemented within the existing LDM program, this extension enjoys all the features supported by LDM, including a compositional analysis of differential abundance at both the taxon and community levels, while allowing for a wide range of covariates and study designs for either association or mediation analysis.

View Article and Find Full Text PDF

Integrative analysis of microbial 16S gene and shotgun metagenomic sequencing data improves statistical efficiency.

Ye Yue Timothy D Read Veronika Fedirko Glen A Satten Yi-Juan Hu

Res Sq

October 2023

Background: The most widely used technologies for profiling microbial communities are 16S marker-gene sequencing and shotgun metagenomic sequencing. Interestingly, many microbiome studies have performed both sequencing experiments on the same cohort of samples. The two sequencing datasets often reveal consistent patterns of microbial signatures, highlighting the potential for an integrative analysis to improve power of testing these signatures.

View Article and Find Full Text PDF

Impact of Experimental Bias on Compositional Analysis of Microbiome Data.

Yingtian Hu Glen A Satten Yi-Juan Hu

Genes (Basel)

September 2023

Microbiome data are subject to experimental bias that is caused by DNA extraction and PCR amplification, among other sources, but this important feature is often ignored when developing statistical methods for analyzing microbiome data. McLaren, Willis, and Callahan (2019) proposed a model for how such biases affect the observed taxonomic profiles; this model assumes the main effects of bias without taxon-taxon interactions. Our newly developed method for testing the differential abundance of taxa, LOCOM, is the first method to account for experimental bias and is robust to the main effect biases.

View Article and Find Full Text PDF

Integrative analysis of microbial 16S gene and shotgun metagenomic sequencing data improves statistical efficiency.

Ye Yue Timothy D Read Veronika Fedirko Glen A Satten Yi-Juan Hu

bioRxiv

June 2023

The most widely used technologies for profiling microbial communities are 16S marker-gene sequencing and shotgun metagenomic sequencing. Interestingly, many microbiome studies have performed both sequencing experiments on the same cohort of samples. The two sequencing datasets often reveal consistent patterns of microbial signatures, highlighting the potential for an integrative analysis to improve power of testing these signatures.

View Article and Find Full Text PDF

Compositional analysis of microbiome data using the linear decomposition model (LDM).

Yi-Juan Hu Glen A Satten

bioRxiv

May 2023

Summary: There are compelling reasons to test compositional hypotheses about microbiome data. We present here LDM-clr, an extension of our linear decomposition model (LDM) approach to allow fitting linear models to centered-log-ratio-transformed taxa count data. As LDM-clr is implemented within the existing LDM program, it enjoys all the features supported by LDM, including a compositional analysis of differential abundance at both the taxon and community levels, while allowing for a wide range of covariates and study designs for either association or mediation analysis.

View Article and Find Full Text PDF

Reply of the Authors: Fresh vs. frozen embryo transfer: new approach to minimize the limitations of using national surveillance data for clinical research.

Marissa Steinberg Weiss Chongliang Luo Yong Chen Glen A Satten Kurt T Barnhart

Fertil Steril

June 2023

View Article and Find Full Text PDF

MIDASim: a fast and simple simulator for realistic microbiome data.

Mengyu He Ni Zhao Glen A Satten

bioRxiv

March 2024

Background: Advances in sequencing technology has led to the discovery of associations between the human microbiota and many diseases, conditions, and traits. With the increasing availability of microbiome data, many statistical methods have been developed for studying these associations. The growing number of newly developed methods highlights the need for simple, rapid, and reliable methods to simulate realistic microbiome data, which is essential for validating and evaluating the performance of these methods.

View Article and Find Full Text PDF

Impact of experimental bias on compositional analysis of microbiome data.

Yingtian Hu Glen A Satten Yi-Juan Hu

bioRxiv

February 2023

Microbiome data are subject to experimental bias that is caused by DNA extraction, PCR amplification among other sources, but this important feature is often ignored when developing statistical methods for analyzing microbiome data. McLaren, Willis and Callahan (2019) proposed a model for how such bias affects the observed taxonomic profiles, which assumes main effects of bias without taxon-taxon interactions. Our newly developed method, LOCOM (logistic regression for compositional analysis) for testing differential abundance of taxa, is the first method that accounted for experimental bias and is robust to the main effect biases.

View Article and Find Full Text PDF

Fresh vs. frozen embryo transfer: new approach to minimize the limitations of using national surveillance data for clinical research.

Marissa Steinberg Weiss Chongliang Luo Yujia Zhang Yong Chen Dmitry M Kissin

Fertil Steril

February 2023

Objective: To assess the benefit of frozen vs. fresh elective single embryo transfer using traditional and novel methods of controlling for confounding.

Design: Retrospective cohort study using data from the National Assisted Reproductive Technology Surveillance System.

View Article and Find Full Text PDF

What Can We Learn about the Bias of Microbiome Studies from Analyzing Data from Mock Communities?

Mo Li Robert E Tyx Angel J Rivera Ni Zhao Glen A Satten

Genes (Basel)

September 2022

It is known that data from both 16S and shotgun metagenomics studies are subject to biases that cause the observed relative abundances of taxa to differ from their true values. Model community analyses, in which the relative abundances of all taxa in the sample are known by construction, seem to offer the hope that these biases can be measured. However, it is unclear whether the bias we measure in a mock community analysis is the same as we measure in a sample in which taxa are spiked in at known relative abundance, or if the biases we measure in spike-in samples is the same as the bias we would measure in a real (e.

View Article and Find Full Text PDF

Testing microbiome associations with survival times at both the community and individual taxon levels.

Yingtian Hu Yunxiao Li Glen A Satten Yi-Juan Hu

PLoS Comput Biol

September 2022

Background: Finding microbiome associations with possibly censored survival times is an important problem, especially as specific taxa could serve as biomarkers for disease prognosis or as targets for therapeutic interventions. The two existing methods for survival outcomes, MiRKAT-S and OMiSA, are restricted to testing associations at the community level and do not provide results at the individual taxon level. An ad hoc approach testing each taxon with a survival outcome using the Cox proportional hazard model may not perform well in the microbiome setting with sparse count data and small sample sizes.

View Article and Find Full Text PDF

LOCOM: A logistic regression model for testing differential abundance in compositional microbiome data with false discovery rate control.

Yingtian Hu Glen A Satten Yi-Juan Hu

Proc Natl Acad Sci U S A

July 2022

Compositional analysis is based on the premise that a relatively small proportion of taxa are differentially abundant, while the ratios of the relative abundances of the remaining taxa remain unchanged. Most existing methods use log-transformed data, but log-transformation of data with pervasive zero counts is problematic, and these methods cannot always control the false discovery rate (FDR). Further, high-throughput microbiome data such as 16S amplicon or metagenomic sequencing are subject to experimental biases that are introduced in every step of the experimental workflow.

View Article and Find Full Text PDF

A Bottom-up Approach to Testing Hypotheses That Have a Branching Tree Dependence Structure, with Error Rate Control.

Yunxiao Li Yi-Juan Hu Glen A Satten

J Am Stat Assoc

September 2020

Modern statistical analyses often involve testing large numbers of hypotheses. In many situations, these hypotheses may have an underlying tree structure that both helps determine the order that tests should be conducted but also imposes a dependency between tests that must be accounted for. Our motivating example comes from testing the association between a trait of interest and groups of microbes that have been organized into operational taxonomic units (OTUs) or amplicon sequence variants (ASVs).

View Article and Find Full Text PDF

A rarefaction-without-resampling extension of PERMANOVA for testing presence-absence associations in the microbiome.

Yi-Juan Hu Glen A Satten

Bioinformatics

August 2022

Motivation: PERMANOVA is currently the most commonly used method for testing community-level hypotheses about microbiome associations with covariates of interest. PERMANOVA can test for associations that result from changes in which taxa are present or absent by using the Jaccard or unweighted UniFrac distance. However, such presence-absence analyses face a unique challenge: confounding by library size (total sample read count), which occurs when library size is associated with covariates in the analysis.

View Article and Find Full Text PDF

Integrative analysis of relative abundance data and presence-absence data of the microbiome using the LDM.

Zhengyi Zhu Glen A Satten Yi-Juan Hu

Bioinformatics

May 2022

Summary: We previously developed the LDM for testing hypotheses about the microbiome that performs the test at both the community level and the individual taxon level. The LDM can be applied to relative abundance data and presence-absence data separately, which work well when associated taxa are abundant and rare, respectively. Here, we propose LDM-omni3 that combines LDM analyses at the relative abundance and presence-absence data scales, thereby offering optimal power across scenarios with different association mechanisms.

View Article and Find Full Text PDF

Associations between microbial communities and key chemical constituents in U.S. domestic moist snuff.

Robert E Tyx Angel J Rivera Glen A Satten Lisa M Keong Peter Kuklenyik

PLoS One

May 2022

Background: Smokeless tobacco (ST) products are widely used throughout the world and contribute to morbidity and mortality in users through an increased risk of cancers and oral diseases. Bacterial populations in ST contribute to taste, but their presence can also create carcinogenic, Tobacco-Specific N-nitrosamines (TSNAs). Previous studies of microbial communities in tobacco products lacked chemistry data (e.

View Article and Find Full Text PDF

Efficient estimation of indirect effects in case-control studies using a unified likelihood framework.

Glen A Satten Sarah W Curtis Claudia Solis-Lemus Elizabeth J Leslie Michael P Epstein

Stat Med

July 2022

Mediation models are a set of statistical techniques that investigate the mechanisms that produce an observed relationship between an exposure variable and an outcome variable in order to deduce the extent to which the relationship is influenced by intermediate mediator variables. For a case-control study, the most common mediation analysis strategy employs a counterfactual framework that permits estimation of indirect and direct effects on the odds ratio scale for dichotomous outcomes, assuming either binary or continuous mediators. While this framework has become an important tool for mediation analysis, we demonstrate that we can embed this approach in a unified likelihood framework for mediation analysis in case-control studies that leverages more features of the data (in particular, the relationship between exposure and mediator) to improve efficiency of indirect effect estimates.

View Article and Find Full Text PDF

The Effect of Antiretroviral Therapy for the Treatment of Human Immunodeficiency Virus (HIV)-1 in Pregnancy on Gestational Weight Gain.

Naima T Joseph Glen A Satten Rachel E Williams Lisa B Haddad Denise J Jamieson

Clin Infect Dis

September 2022

Background: Gestational weight gain above Institute of Medicine recommendations is associated with increased risk of pregnancy complications. The goal was to analyze the association between newer HIV antiretroviral regimens (ART) on gestational weight gain.

Methods: A retrospective cohort study of pregnant women with HIV-1 on ART.

View Article and Find Full Text PDF

Constraining PERMANOVA and LDM to within-set comparisons by projection improves the efficiency of analyses of matched sets of microbiome data.

Zhengyi Zhu Glen A Satten Caroline Mitchell Yi-Juan Hu

Microbiome

June 2021

Background: Matched-set data arise frequently in microbiome studies. For example, we may collect pre- and post-treatment samples from a set of individuals, or use important confounding variables to match data from case participants to one or more control participants. Thus, there is a need for statistical methods for data comprised of matched sets, to test hypotheses against traits of interest (e.

View Article and Find Full Text PDF

Vaginal Microbiome Composition in Early Pregnancy and Risk of Spontaneous Preterm and Early Term Birth Among African American Women.

Anne L Dunlop Glen A Satten Yi-Juan Hu Anna K Knight Cherie C Hill

Front Cell Infect Microbiol

July 2021

Objective: To evaluate the association between the early pregnancy vaginal microbiome and spontaneous preterm birth (sPTB) and early term birth (sETB) among African American women.

Methods: Vaginal samples collected in early pregnancy (8-14 weeks' gestation) from 436 women enrolled in the Emory University African American Vaginal, Oral, and Gut Microbiome in Pregnancy Study underwent 16S rRNA gene sequencing of the V3-V4 region, taxonomic classification, and community state type (CST) assignment. We compared vaginal CST and abundance of taxa for women whose pregnancy ended in sPTB (N = 44) or sETB (N= 84) to those who delivered full term (N = 231).

View Article and Find Full Text PDF

A rarefaction-based extension of the LDM for testing presence-absence associations in the microbiome.

Yi-Juan Hu Andrea Lane Glen A Satten

Bioinformatics

July 2021

Motivation: Many methods for testing association between the microbiome and covariates of interest (e.g. clinical outcomes, environmental factors) assume that these associations are driven by changes in the relative abundance of taxa.

View Article and Find Full Text PDF

A PHP Error was encountered

Severity: Warning

Message: fopen(/var/lib/php/sessions/ci_session76cmfpulgrvm439ll4ltv8svmd93ib0e): Failed to open stream: No space left on device

Filename: drivers/Session_files_driver.php

Line Number: 177

Backtrace:

File: /var/www/html/index.php
Line: 316
Function: require_once

A PHP Error was encountered

Severity: Warning

Message: session_start(): Failed to read session data: user (path: /var/lib/php/sessions)

Filename: Session/Session.php

Line Number: 137

Backtrace:

File: /var/www/html/index.php
Line: 316
Function: require_once