Introduction: Body mass index (BMI) is inadequately recorded in US administrative claims databases. We aimed to validate the sensitivity and positive predictive value (PPV) of BMI-related diagnosis codes using an electronic medical records (EMR) claims-linked database. Additionally, we applied machine learning (ML) to identify features in US claims databases to predict obesity status.

Research Design And Methods: This observational, retrospective analysis included 692 119 people ≥18 years of age, with ≥1 BMI reading in MarketScan Explorys Claims-EMR data (January 2013-December 2019). Claims-based obesity status was compared with EMR-based BMI (gold standard) to assess BMI-related diagnosis code sensitivity and PPV. Logistic regression (LR), penalized LR with L1 penalty (Least Absolute Shrinkage and Selection Operator), extreme gradient boosting (XGBoost) and random forest, with features drawn from insurance claims, were trained to predict obesity status (BMI≥30 kg/m) from EMR as the gold standard. Model performance was compared using several metrics, including the area under the receiver operating characteristic curve. The best-performing model was applied to assess feature importance. Obesity risk scores were computed from the best model generated from the claims database and compared against the BMI recorded in the EMR.

Results: The PPV of diagnosis codes from claims alone remained high over the study period (85.4-89.2%); sensitivity was low (16.8-44.8%). XGBoost performed the best at predicting obesity with the highest area under the curve (AUC; 79.4%) and the lowest Brier score. The number of obesity diagnoses and obesity diagnoses from inpatient settings were the most important predictors of obesity. XGBoost showed an AUC of 74.1% when trained without an obesity diagnosis.

Conclusions: Obesity prevalence is under-reported in claims databases. ML models, with or without explicit obesity, show promise in improving obesity prediction accuracy compared with obesity codes alone. Improved obesity status prediction may assist practitioners and payors to estimate the burden of obesity and investigate the potential unmet needs of current treatments.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11429277PMC
http://dx.doi.org/10.1136/bmjdrc-2024-004193DOI Listing

Publication Analysis

Top Keywords

obesity
16
claims databases
12
obesity status
12
machine learning
8
predicting obesity
8
obesity risk
8
administrative claims
8
claims database
8
bmi-related diagnosis
8
diagnosis codes
8

Similar Publications

Background: The pathogenesis of non-alcoholic fatty liver disease (NAFLD) with a global prevalence of 30% is multifactorial and the involvement of gut bacteria has been recently proposed. However, finding robust bacterial signatures of NAFLD has been a great challenge, mainly due to its co-occurrence with other metabolic diseases.

Results: Here, we collected public metagenomic data and integrated the taxonomy profiles with in silico generated community metabolic outputs, and detailed clinical data, of 1206 Chinese subjects w/wo metabolic diseases, including NAFLD (obese and lean), obesity, T2D, hypertension, and atherosclerosis.

View Article and Find Full Text PDF

Background: Hepatocellular carcinoma (HCC) genetic/transcriptomic signatures have been widely described. However, its proteomic characterization is incomplete. We performed non-targeted quantitative proteomics of HCC samples and explored its clinical, functional, and molecular consequences.

View Article and Find Full Text PDF

Association between Mediterranean diet and metabolic health status among adults was not mediated through serum adropin levels.

BMC Public Health

January 2025

Department of Community Nutrition, School of Nutrition and Food Science, Nutrition and Food Security Research Center, Isfahan University of Medical Sciences, PO Box 81745-151, Isfahan, Iran.

Background: Prevalence of metabolic disorders has been increased in recent years around the world. The relationship between Mediterranean diet (MD) with metabolic health status and serum adropin levels has been less examined in Iranian adults. We investigated the association between MD compliance with metabolic health status and adropin hormone in Iranian adults.

View Article and Find Full Text PDF

Background: Overweight and obesity are global issues, especially among women of childbearing age, linked to adverse maternal and neonatal outcomes. These risks vary by age, race, and ethnicity, with increasing rates among immigrant and minority women. This study compares overweight and obesity rates, pregnancy weight gain, and neonatal outcomes in Turkish and Syrian immigrant/refugee women.

View Article and Find Full Text PDF

An introduction to MyBFF@school, a school-based childhood obesity intervention program: a cluster randomized controlled trial.

BMC Public Health

January 2025

Department of Pediatrics, Faculty of Medicine, Universiti Malaya, Kuala Lumpur, Wilayah Persekutuan Kuala Lumpur, 50603, Malaysia.

Obesity trend among Malaysian children is on the rise. Noting that the tendency for them to grow into obese adults and the relationship of obesity to many non-communicable diseases, the My Body is Fit and Fabulous at School (MyBFF@school program) was designed to combat obesity among the schoolchildren. The program was piloted in 2014 in Putrajaya, Malaysia.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!