Prediction of Postoperative Lung Function in Lung Cancer Patients Using Machine Learning Models.

Tuberc Respir Dis (Seoul)

Division of Pulmonary, Critical Care and Sleep Medicine, Department of Internal Medicine, College of Medicine, The Catholic University of Korea, Seoul, Republic of Korea.

Published: July 2023

Background: Surgical resection is the standard treatment for early-stage lung cancer. Since postoperative lung function is related to mortality, predicted postoperative lung function is used to determine the treatment modality. The aim of this study was to evaluate the predictive performance of linear regression and machine learning models.

Methods: We extracted data from the Clinical Data Warehouse and developed three sets: set I, the linear regression model; set II, machine learning models omitting the missing data: and set III, machine learning models imputing the missing data. Six machine learning models, the least absolute shrinkage and selection operator (LASSO), Ridge regression, ElasticNet, Random Forest, eXtreme gradient boosting (XGBoost), and the light gradient boosting machine (LightGBM) were implemented. The forced expiratory volume in 1 second measured 6 months after surgery was defined as the outcome. Five-fold cross-validation was performed for hyperparameter tuning of the machine learning models. The dataset was split into training and test datasets at a 70:30 ratio. Implementation was done after dataset splitting in set III. Predictive performance was evaluated by R2 and mean squared error (MSE) in the three sets.

Results: A total of 1,487 patients were included in sets I and III and 896 patients were included in set II. In set I, the R2 value was 0.27 and in set II, LightGBM was the best model with the highest R2 value of 0.5 and the lowest MSE of 154.95. In set III, LightGBM was the best model with the highest R2 value of 0.56 and the lowest MSE of 174.07.

Conclusion: The LightGBM model showed the best performance in predicting postoperative lung function.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10323210PMC
http://dx.doi.org/10.4046/trd.2022.0048DOI Listing

Publication Analysis

Top Keywords

machine learning
24
learning models
20
postoperative lung
16
lung function
16
set iii
12
lung cancer
8
predictive performance
8
linear regression
8
set
8
missing data
8

Similar Publications

Who is coming in? Evaluation of physician performance within multi-physician emergency departments.

Am J Emerg Med

January 2025

Department of Emergency Medicine, Yale University School of Medicine, New Haven, CT, USA; Center for Outcomes Research and Evaluation, Yale University, New Haven, CT, USA.

Background: This study aimed to examine how physician performance metrics are affected by the speed of other attendings (co-attendings) concurrently staffing the ED.

Methods: A retrospective study was conducted using patient data from two EDs between January-2018 and February-2020. Machine learning was used to predict patient length of stay (LOS) conditional on being assigned a physician of average speed, using patient- and departmental-level variables.

View Article and Find Full Text PDF

Background: Large language models (LLMs) have been proposed as valuable tools in medical education and practice. The Chinese National Nursing Licensing Examination (CNNLE) presents unique challenges for LLMs due to its requirement for both deep domain-specific nursing knowledge and the ability to make complex clinical decisions, which differentiates it from more general medical examinations. However, their potential application in the CNNLE remains unexplored.

View Article and Find Full Text PDF

Prediction of Thermodynamic Properties of C-Based Fullerenols Using Machine Learning.

J Chem Theory Comput

January 2025

Guizhou Provincial Engineering Technology Research Center for Chemical Drug R&D, School of Pharmacy, Guizhou Medical University, Guiyang, Guizhou 550025, P. R. China.

Traditional machine learning methods face significant challenges in predicting the properties of highly symmetric molecules. In this study, we developed a machine learning model based on graph neural networks (GNNs) to accurately and swiftly predict the thermodynamic and photochemical properties of fullerenols, such as C(OH) ( = 1 to 30). First, we established a global method for generating fullerenol isomers through isomer fingerprinting, which can generate all possible isomers or produce diverse structural types on demand.

View Article and Find Full Text PDF

This study investigates the geochemical characteristics of rare earth elements (REEs) in highland karstic bauxite deposits located in the Sierra de Bahoruco, Pedernales Province, Dominican Republic. These deposits, formed through intense weathering of volcanic material, represent a potentially valuable REE resource for the nation. Surface and subsurface soil samples were analyzed using portable X-ray fluorescence (pXRF) and a NixPro 2 color sensor validated with inductively coupled plasma optical emission spectrometry (ICP-OES).

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!