Observational studies of recurrent event rates are common in biomedical statistics. Broadly, the goal is to estimate differences in event rates under 2 treatments within a defined target population over a specified follow-up window. Estimation with observational data is challenging because, while membership in the target population is defined in terms of eligibility criteria, treatment is rarely observed exactly at the time of eligibility.
View Article and Find Full Text PDFWe develop a Bayesian semiparametric model for the impact of dynamic treatment rules on survival among patients diagnosed with pediatric acute myeloid leukemia (AML). The data consist of a subset of patients enrolled in a phase III clinical trial in which patients move through a sequence of four treatment courses. At each course, they undergo treatment that may or may not include anthracyclines (ACT).
View Article and Find Full Text PDFBackground: Cytomegalovirus (CMV) commonly reactivates after allogeneic hematopoietic cell transplant (HCT), potentially leading to CMV disease and significant morbidity and mortality. To reduce morbidity and mortality, many centers conduct weekly CMV blood polymerase chain reaction (PCR) surveillance testing with subsequent initiation of antiviral therapy upon CMV DNAemia detection. However, the impact of CMV DNAemia on subsequent hospitalization risk has not been assessed using models accounting for the time-varying nature of the exposure, outcome, and confounders.
View Article and Find Full Text PDFA major focus of causal inference is the estimation of heterogeneous average treatment effects (HTE) - average treatment effects within strata of another variable of interest such as levels of a biomarker, education, or age strata. Inference involves estimating a stratum-specific regression and integrating it over the distribution of confounders in that stratum - which itself must be estimated. Standard practice involves estimating these stratum-specific confounder distributions independently (e.
View Article and Find Full Text PDFBackground: With rising cost pressures on health care systems, machine-learning (ML)-based algorithms are increasingly used to predict health care costs. Despite their potential advantages, the successful implementation of these methods could be undermined by biases introduced in the design, conduct, or analysis of studies seeking to develop and/or validate ML models. The utility of such models may also be negatively affected by poor reporting of these studies.
View Article and Find Full Text PDFBackground: Ventilator-associated lower respiratory tract infection (VA-LRTI) is common among critically ill patients and has been associated with increased morbidity and mortality. In acute critical illness, respiratory microbiome disruption indices (MDIs) have been shown to predict risk for VA-LRTI, but their utility beyond the first days of critical illness is unknown. We sought to characterize how MDIs previously shown to predict VA-LRTI at initiation of mechanical ventilation change with prolonged mechanical ventilation, and if they remain associated with VA-LRTI risk.
View Article and Find Full Text PDFSubstantial advances in Bayesian methods for causal inference have been made in recent years. We provide an introduction to Bayesian inference for causal effects for practicing statisticians who have some familiarity with Bayesian models and would like an overview of what it can add to causal estimation in practical settings. In the paper, we demonstrate how priors can induce shrinkage and sparsity in parametric models and be used to perform probabilistic sensitivity analyses around causal assumptions.
View Article and Find Full Text PDFObjectives: To evaluate the impact of the Community-Based Care Management (CBCM) program on total costs of care and utilization among adult high-need, high-cost patients enrolled in a Medicaid managed care organization (MCO). CBCM was a Medicaid insurer-led care coordination and disease management program staffed by nurse care managers paired with community health workers.
Study Design: Retrospective cohort analysis.
Researchers are often interested in predicting outcomes, detecting distinct subgroups of their data, or estimating causal treatment effects. Pathological data distributions that exhibit skewness and zero-inflation complicate these tasks-requiring highly flexible, data-adaptive modeling. In this paper, we present a multipurpose Bayesian nonparametric model for continuous, zero-inflated outcomes that simultaneously predicts structural zeros, captures skewness, and clusters patients with similar joint data distributions.
View Article and Find Full Text PDFImportance: The effect of the Patient Protection and Affordable Care Act's Medicaid expansion on cancer care delivery and outcomes is unknown. Patients with cancer are a high-risk group for whom treatment delays are particularly detrimental.
Objective: To examine the association between Medicaid expansion and changes in insurance status, stage at diagnosis, and timely treatment among patients with incident breast, colon, and non-small cell lung cancer.
Previous studies indicate racial/ethnic differences in health care utilization for pediatric atopic dermatitis (AD), but do not account for disease severity impact. We sought to examine the relationship between race/ethnicity and health care utilization, both overall and by specific visit type, while accounting for AD control. A longitudinal cohort study of children with AD in the United States was performed to evaluate the association between race/ethnicity and health care utilization for AD.
View Article and Find Full Text PDFDescribe the development of a claims-based classifier utilizing machine learning to identify patients with probable Lennox-Gastaut syndrome (LGS) from six state Medicaid programs. Patients were included if they had ≥2 medical claims ≥30 days apart for specified or unspecified epilepsy, excluding those with ≥1 claim for petit mal status. The LGS classifier utilized a random forest algorithm, a compilation of thousands of binary decision trees in which machine-generated predictor variables split the data set into branches that predict the presence or absence of LGS.
View Article and Find Full Text PDFPhenotyping, ie, identification of patients possessing a characteristic of interest, is a fundamental task for research conducted using electronic health records. However, challenges to this task include imperfect sensitivity and specificity of clinical codes and inconsistent availability of more detailed data such as laboratory test results. Despite these challenges, most existing electronic health records-derived phenotypes are rule-based, consisting of a series of Boolean arguments informed by expert knowledge of the disease of interest and its coding.
View Article and Find Full Text PDF