We recently developed a machine-learning subgrouping algorithm, iterative causal forest (iCF), to identify subgroups with heterogeneous treatment effects (HTEs) using predefined covariates. However, such predefined covariates may miss or poorly define important features leading to inaccurate subgrouping. To address such limitations, we developed a new semi-automatic subgrouping algorithm, hdiCF, which adapts methodology from high-dimensional propensity score for feature recognition in claims data. The hdiCF algorithm has 3 steps: 1) high-dimensional feature identification by International Classification of Diseases, Current Procedural Terminology, and Anatomical Therapeutic Chemical codes (in/outpatient diagnoses, procedures, prescriptions) and creation of ordinal variables by frequency of occurrence; 2) propensity score trimming and high-dimensional feature preparation; 3) iCF implementation to identify subgroups. We applied hdiCF in a 20% random sample of fee-for-service Medicare beneficiaries who initiated sodium-glucose cotransporter-2 inhibitors (SGLT2i) or glucagon-like peptide-1 receptor agonists to identify subgroups with HTEs for incidence of hospitalized heart failure. HdiCF findings were consistent with studies suggesting SGLT2i to be more beneficial for patients with pre-existing heart failure or chronic kidney disease. HdiCF is not dependent on prior hypotheses about HTEs and identifies subgroups with markers for potential HTEs in real-world evidence studies where active-comparator, new-user study designs limit the potential for unmeasured confounding.

Download full-text PDF

Source
http://dx.doi.org/10.1093/aje/kwae322DOI Listing

Publication Analysis

Top Keywords

identify subgroups
12
iterative causal
8
causal forest
8
claims data
8
subgrouping algorithm
8
predefined covariates
8
propensity score
8
high-dimensional feature
8
heart failure
8
hdicf
6

Similar Publications

Background: Takayasu arteritis (TAK) and giant cell arteritis (GCA), the most common forms of large-vessel vasculitis (LVV), can result in serious morbidity. Understanding the molecular basis of LVV should aid in developing better biomarkers and treatments.

Methods: Plasma proteomic profiling of 184 proteins was performed in two cohorts.

View Article and Find Full Text PDF

Background: Red blood cell distribution width (RDW) is associated with the development and progression of various diseases.

Aim: To explore the association between pretreatment RDW and short-term outcomes after laparoscopic pancreatoduodenectomy (LPD).

Methods: A total of 804 consecutive patients who underwent LPD at our hospital between March 2017 and November 2021 were retrospectively analyzed.

View Article and Find Full Text PDF

Background And Objective: Scabies is the second most common cause of disability due to skin disease in the Philippines. However, there were no cited studies in Global Burden of Disease 2019 and the disability-adjusted life years (DALY) computations were most likely based on statistical modelling. The Philippine Department of Health has embarked on a program to estimate the disease burden of priority diseases in the country, which include scabies.

View Article and Find Full Text PDF

Background: Valvular heart disease (VHD) management has evolved rapidly in recent decades, but disparities in health care access persist among countries with varying socioeconomic backgrounds.

Objectives: The purpose of this study was to investigate global mortality trends from VHD and assess the difference between middle- and high-income countries.

Methods: We obtained mortality data from the World Health Organization Mortality Database for VHD and its subgroups (rheumatic valvular disease [RVD], infective endocarditis [IE], aortic stenosis [AS], and mitral regurgitation [MR]) from 2000 to 2019.

View Article and Find Full Text PDF

Background: Coronary artery disease (CAD) and acute myocardial infarction (AMI) still pose a significant burden to the health care system, affecting population subgroups differently.

Objectives: The purpose of the study was to describe age, sex, and racial disparities in mortality rates for CAD and AMI in the United States between 2000 and 2020.

Methods: This was an ecological study with trend analysis of mortality rates using data from the National Centers for Disease Control and Prevention surveillance databases.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!