Objective: Machine learning (ML) has enabled healthcare discoveries by facilitating efficient modeling, such as for cancer screening. Unlike clinical trials, real-world data used in ML are often gathered for multiple purposes, leading to bias and missing information for a specific classification task. This challenge is especially pronounced in healthcare because of stringent ethical considerations and resource constraints.This study proposed an integrated approach to enhance the quality of health evidence from a classification task for predicting Medicare's Diagnosis-Related Groups of ischemic heart disease (IHD) patients.

Methods: Eligible participants were identified from the Medical Information Mart for Intensive Care IV (MIMIC IV), a publicly available hospital database. Six ML models were selected for model triangulation. Sequential triangulation was employed via Local Process Mining (LPM) and Qualitative Comparative Analysis (QCA).

Results: A total of 1545 IHD hospitalizations from 916 patients were identified from the MIMIC IV. Eight health process features were identified through LPM aligned with clinical knowledge. The correlation coefficients for process features, ranging from 0.24 to 0.42, are higher than those for non-process features ranged from 0.02 to 0.36. A total of 56 unique combinations were identified from the QCA, with 28 configurations having raw coverage lower than 1.0%. The overall model performance (i.e. weighted F1 and area under the curve scores) increased after adopting this integrated approach. The proportion of cases misclassified by any of the six models decreased by 47% after incorporating process features (from 5.29% to 2.91%) and further decreased to 0.0% after applying the QCA solutions.

Conclusion: The integrated approach demonstrates its ability to enhance quality of a classification task through its clinical relevance, improved model performance, and reduced case-level error rates. However, more scalable QCA methods are needed for larger datasets. Developing health process feature engineering for broader applications can be a future direction.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11748077PMC
http://dx.doi.org/10.1177/20552076251314097DOI Listing

Publication Analysis

Top Keywords

process features
16
classification task
12
integrated approach
12
health evidence
8
quality classification
8
enhance quality
8
health process
8
model performance
8
process
6
features
5

Similar Publications

MicroRNAs (miRNAs) are associated with amyloid-β (Aβ) dysmetabolism, a pivotal factor in the pathogenesis of Alzheimer's disease (AD). This study unveiled a novel miRNA, microRNA-32533 (miR-32533), featuring a distinctive base sequence identified through RNA sequencing of the APPswe/PSEN1dE9 (APP/PS1) mouse brain. Its role and underlying mechanisms were subsequently explored.

View Article and Find Full Text PDF

Non-small cell lung cancer (NSCLC) frequently metastasizes to the brain, significantly worsened prognoses. This study aimed to develop an interpretable model for predicting survival in NSCLC patients with brain metastases (BM) integrating radiomic features and RNA sequencing data. 292 samples are collected and analyzed utilizing T1/T2 MRIs.

View Article and Find Full Text PDF

Citrus Huanglongbing (HLB) represents a significant threat to the citrus industry, mainly caused by the phloem-limited bacterium Liberibacter asiaticus (Las). In this review, we summarize recent advances in understanding the relationship between citrus and Las, particularly examining the functions of Sec-dependent effectors (SDEs) and non-classically secreted proteins (ncSPs) in virulence, as well as their targeted interactions with citrus. We further investigate the impact of SDEs on various physiological processes, including systemic acquired resistance (SAR), reactive oxygen species (ROS) accumulation, vesicle trafficking, callose deposition, cell death, autophagy, chlorosis and flowering.

View Article and Find Full Text PDF

Three-dimensional (3D) LiDAR is crucial for the autonomous navigation of orchard mobile robots, offering comprehensive and accurate environmental perception. However, the increased richness of information provided by 3D LiDAR also leads to a higher computational burden for point cloud data processing, posing challenges to real-time navigation. To address these issues, this paper proposes a 3D point cloud optimization method based on the octree data structure for autonomous navigation of orchard mobile robots.

View Article and Find Full Text PDF

Phosphorus-solubilizing fungi promote the growth of P. Y. Li by regulating physiological and biochemical reactions and protecting enzyme system-related gene expression.

Front Genet

January 2025

Chongqing Engineering Laboratory of Green Planting and Deep Processing of Famous-Region Drug in the Three Gorges Reservoir Region, College of Biology and Food Engineering, Chongqing Three Gorges University, Chongqing, China.

Introduction: P. Y. Li is a plant used to treat respiratory diseases such as pneumonia, bronchitis, and influenza.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!