Missing-modality enabled multi-modal fusion architecture for medical data.

J Biomed Inform

School of Biomedical Engineering, Capital Medical University, No.10, Xitoutiao, You An Men, Fengtai District, Beijing 100069, China; Beijing Key Laboratory of Fundamental Research on Biomechanics in Clinical Application, Capital Medical University, No.10, Xitoutiao, You An Men, Fengtai District, Beijing 100069, China. Electronic address:

Published: February 2025

Background: Fusion of multi-modal data can improve the performance of deep learning models. However, missing modalities are common in medical data due to patient specificity, which is detrimental to the performance of multi-modal models in applications. Therefore, it is critical to adapt the models to missing modalities.

Objective: This study aimed to develop an effective multi-modal fusion architecture for medical data that was robust to missing modalities and further improved the performance for clinical tasks.

Methods: X-ray chest radiographs for the image modality, radiology reports for the text modality, and structured value data for the tabular data modality were fused in this study. Each modality pair was fused with a Transformer-based bi-modal fusion module, and the three bi-modal fusion modules were then combined into a tri-modal fusion framework. Additionally, multivariate loss functions were introduced into the training process to improve models' robustness to missing modalities during the inference process. Finally, we designed comparison and ablation experiments to validate the effectiveness of the fusion, the robustness to missing modalities, and the enhancements from each key component. Experiments were conducted on MIMIC-IV and MIMIC-CXR datasets with the 14-label disease diagnosis and patient in-hospital mortality prediction task The area under the receiver operating characteristic curve (AUROC) and the area under the precision-recall curve (AUPRC) were used to evaluate models' performance.

Results: Our proposed architecture showed superior predictive performance, achieving the average AUROC and AUPRC of 0.916 and 0.551 in the 14-label classification task, 0.816 and 0.392 in the mortality prediction task. while the best average AUROC and AUPRC among the comparison methods were 0.876, 0.492 in the 14-label classification task and 0.806, 0.366 in the mortality prediction task. Both metrics decreased only slightly when tested with modal-incomplete data. Different levels of enhancements were achieved through three key components.

Conclusions: The proposed multi-modal fusion architecture effectively fused three modalities and showed strong robustness to missing modalities. This architecture holds promise for scaling up to more modalities to enhance the clinical practicality of the model.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.jbi.2025.104796DOI Listing

Publication Analysis

Top Keywords

missing modalities
20
multi-modal fusion
12
fusion architecture
12
medical data
12
robustness missing
12
mortality prediction
12
prediction task
12
fusion
8
architecture medical
8
models missing
8

Similar Publications

Persistence and Recurrence of Primary Hyperparathyroidism.

Best Pract Res Clin Endocrinol Metab

March 2025

Division of Endocrinology and Metabolism, McMaster University, Hamilton, ON, Canada. Electronic address:

Persistent and recurrent primary hyperparathyroidism (PHPT) represent significant challenges in the management of PHPT. Persistent PHPT is defined as persistence of hypercalcemia following parathyroidectomy (PTX) or the recurrence of hypercalcemia within the first 6 months following surgery. Recurrent PHPT is defined as recurrence of hypercalcemia after 6 months following PTX and requires normalization of serum calcium prior to the recurrence.

View Article and Find Full Text PDF

Purpose: Despite the growing adoption of HIFU treatment for localized prostate cancer (PC), standardized criteria for evaluating success and predicting recurrence remain undefined. Herein, we analyze the predictive value of noninvasive tools such as PSA dynamics and MRI to determine recurrence.

Methods: We identified from our HIFU therapy prospective registry patients who developed biopsy-proven recurrence, between 2016 and 2023.

View Article and Find Full Text PDF

Background And Objectives: It is unknown whether acute CT perfusion (CTP) core imaging may underestimate the follow-up infarct. We hypothesize that infarct underestimation occurs especially in late-presenting patients and that underestimated infarct can partially be detected on baseline noncontrast CT (NCCT).

Methods: We included patients with acute anterior circulation ischemic stroke who underwent baseline NCCT and CTP imaging, complete endovascular reperfusion, and follow-up MRI from the Endovascular Therapy Following Imaging Evaluation for Ischemic Stroke (DEFUSE 3) trial and a consecutive, monocenter cohort.

View Article and Find Full Text PDF

Aim: The purpose of this study was to evaluate the diagnostic value of ultrasound compared to magnetic resonance imaging (MRI) as a reference in detecting peroneus brevis split ruptures.

Material And Methods: We re-reviewed 112 ultrasound examinations performed between 2020 and 2021 by three musculoskeletal radiologists with 8-10 years of experience. Patients were referred due to pain lasting at least 8 months in the posterolateral ankle.

View Article and Find Full Text PDF

Polysomnography (PSG) is a type of sleep study that records multimodal physiological signals and is widely used for purposes such as sleep staging and respiratory event detection. Conventional machine learning methods assume that each sleep study is associated with a fixed set of observed modalities and that all modalities are available for each sample. However, noisy and missing modalities are a common issue in real-world clinical settings.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!