Background: Sepsis is a life-threatening disease associated with a high mortality rate, emphasizing the need for the exploration of novel models to predict the prognosis of this patient population. This study compared the performance of traditional logistic regression and machine learning models in predicting adult sepsis mortality.

Objective: To develop an optimum model for predicting the mortality of adult sepsis patients based on comparing traditional logistic regression and machine learning methodology.

Methods: Retrospective analysis was conducted on 606 adult sepsis inpatients at our medical center between January 2020 and December 2022, who were randomly divided into training and validation sets in a 7:3 ratio. Traditional logistic regression and machine learning methods were employed to assess the predictive ability of mortality in adult sepsis. Univariate analysis identified independent risk factors for the logistic regression model, while Least Absolute Shrinkage and Selection Operator (LASSO) regression facilitated variable shrinkage and selection for the machine learning model. Among various machine learning models, which included Bagged Tree, , , , , , and , the one with the maximum area under the curve (AUC) was chosen for model construction. Model validation and comparison with the Sequential Organ Failure Assessment (SOFA) and the Acute Physiology and Chronic Health Evaluation (APACHE) scores were performed using receiver operating characteristic (ROC) curves, calibration curves, and decision curve analysis (DCA) curves in the validation set.

Results: Univariate analysis was employed to assess 17 variables, namely gender, history of coronary heart disease (CHD), systolic pressure, white blood cell (WBC), neutrophil count (NEUT), lymphocyte count (LYMP), lactic acid, neutrophil-to-lymphocyte ratio (NLR), red blood cell distribution width (RDW), interleukin-6 (IL-6), prothrombin time (PT), international normalized ratio (INR), fibrinogen (FBI), D-dimer, aspartate aminotransferase (AST), total bilirubin (Tbil), and lung infection. Significant differences ( < 0.05) between the survival and non-survival groups were observed for these variables. Utilizing stepwise regression with the "backward" method, independent risk factors, including systolic pressure, lactic acid, NLR, RDW, IL-6, PT, and Tbil, were identified. These factors were then incorporated into a logistic regression model, chosen based on the minimum Akaike Information Criterion (AIC) value (98.65). Machine learning techniques were also applied, and the RF model, demonstrating the maximum Area Under the Curve (AUC) of 0.999, was selected. LASSO regression, employing the lambda.1SE criteria, identified systolic pressure, lactic acid, NEUT, RDW, IL6, INR, and Tbil as variables for constructing the RF model, validated through ten-fold cross-validation. For model validation and comparison with traditional logistic models, SOFA, and APACHE scoring.

Conclusion: Based on deep machine learning principles, the RF model demonstrates advantages over traditional logistic regression models in predicting adult sepsis prognosis. The RF model holds significant potential for clinical surveillance and interventions to enhance outcomes for sepsis patients.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11743956PMC
http://dx.doi.org/10.3389/fmed.2024.1496869DOI Listing

Publication Analysis

Top Keywords

machine learning
24
logistic regression
20
adult sepsis
20
traditional logistic
16
regression machine
16
mortality adult
12
predicting mortality
8
sepsis patients
8
learning models
8
employed assess
8

Similar Publications

Identification of circadian rhythm-related biomarkers and development of diagnostic models for Crohn's disease using machine learning algorithms.

Comput Methods Biomech Biomed Engin

January 2025

Department of Gastroenterolgy, The Second Affiliated Hospital of Chengdu Medical College, China National Nuclear Corporation 416 Hospital, Chengdu, China.

The global rise in Crohn's Disease (CD) incidence has intensified diagnostic challenges. This study identified circadian rhythm-related biomarkers for CD using datasets from the GEO database. Differentially expressed genes underwent Weighted Gene Co-Expression Network Analysis, with 49 hub genes intersected from GeneCards data.

View Article and Find Full Text PDF

Context.—: Generative artificial intelligence (AI) has emerged as a transformative force in various fields, including anatomic pathology, where it offers the potential to significantly enhance diagnostic accuracy, workflow efficiency, and research capabilities.

Objective.

View Article and Find Full Text PDF

In this research, a green approach utilizing deep eutectic solvent liquid-liquid microextraction is combined with smartphone digital image colorimetry for the determination of boron in nut samples. A smartphone camera was used to capture the image of the analyte extract located in a custom-made colorimetric box. Using ImageJ software, the images were split into RGB channels, with the green channel identified as the optimum.

View Article and Find Full Text PDF

Assessing water quality restoration measures in Lake Pampulha (Brazil) through remote sensing imagery.

Environ Sci Pollut Res Int

January 2025

LEESU, Ecole des Ponts Paris Tech, UPEC, AgroParisTech, F-77455 Marne-la-Vallée, Paris, France.

Urban reservoirs are frequently exposed to impacts from high population density, polluting activities, and the absence of environmental control measures and monitoring. In this study, we investigated the use of satellite imagery to assess restoration measures and support decision-making in a hypereutrophic urban reservoir. Since 2016, Lake Pampulha (Brazil) has undergone restoration measures, including the application of Phoslock®, to mitigate its poor water quality conditions.

View Article and Find Full Text PDF

Objective: Despite the identification of various prognostic factors for anaplastic thyroid carcinoma (ATC) patients over the years, a precise prognostic tool for these patients is still lacking. This study aimed to develop and validate a prognostic model for predicting survival outcomes for ATC patients using random survival forests (RSF), a machine learning algorithm.

Methods: A total of 1222 ATC patients were extracted from the Surveillance, Epidemiology, and End Results (SEER) database and randomly divided into a training set of 855 patients and a validation set of 367 patients.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!