Multi-modal medical Transformers: A meta-analysis for medical image segmentation in oncology.

Comput Med Imaging Graph

LaTIM UMR 1101, Inserm, Brest, France; IMT Atlantique, Brest, France. Electronic address:

Published: December 2023

Multi-modal medical image segmentation is a crucial task in oncology that enables the precise localization and quantification of tumors. The aim of this work is to present a meta-analysis of the use of multi-modal medical Transformers for medical image segmentation in oncology, specifically focusing on multi-parametric MR brain tumor segmentation (BraTS2021), and head and neck tumor segmentation using PET-CT images (HECKTOR2021). The multi-modal medical Transformer architectures presented in this work exploit the idea of modality interaction schemes based on visio-linguistic representations: (i) single-stream, where modalities are jointly processed by one Transformer encoder, and (ii) multiple-stream, where the inputs are encoded separately before being jointly modeled. A total of fourteen multi-modal architectures are evaluated using different ranking strategies based on dice similarity coefficient (DSC) and average symmetric surface distance (ASSD) metrics. In addition, cost indicators such as the number of trainable parameters and the number of multiply-accumulate operations (MACs) are reported. The results demonstrate that multi-path hybrid CNN-Transformer-based models improve segmentation accuracy when compared to traditional methods, but come at the cost of increased computation time and potentially larger model size.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.compmedimag.2023.102308DOI Listing

Publication Analysis

Top Keywords

multi-modal medical
16
medical image
12
image segmentation
12
medical transformers
8
segmentation oncology
8
tumor segmentation
8
segmentation
6
multi-modal
5
medical
5
transformers meta-analysis
4

Similar Publications

Background: People living with HIV (PLWH), especially immunological non-responders (INRs), may experience adverse neurologic events. However, the extent of neurological impairment in INRs remains uncertain. This study evaluates brain structure and function, immune dysregulation, and peripheral immunomarkers in INRs and immunological responders (IRs) among PLWH, classified according to immunological response criteria, within a clinical research setting.

View Article and Find Full Text PDF

Multi-modal cross-domain self-supervised pre-training for fMRI and EEG fusion.

Neural Netw

December 2024

Department of Electrical and Computer Engineering, Lehigh University, Bethlehem, PA 18015, USA; Department of Bioengineering, Lehigh University, Bethlehem, PA 18015, USA. Electronic address:

Neuroimaging techniques including functional magnetic resonance imaging (fMRI) and electroencephalogram (EEG) have shown promise in detecting functional abnormalities in various brain disorders. However, existing studies often focus on a single domain or modality, neglecting the valuable complementary information offered by multiple domains from both fMRI and EEG, which is crucial for a comprehensive representation of disorder pathology. This limitation poses a challenge in effectively leveraging the synergistic information derived from these modalities.

View Article and Find Full Text PDF

Multi-modal medical images are important in tumor lesion detection. However, the existing detection models only use single-modal to detect lesions, a multi-modal semantic correlation is not enough to consider and lacks ability to express the shape, size, and contrast degree features of lesions. A Cross Modal YOLOv5 model (CMYOLOv5) is proposed.

View Article and Find Full Text PDF

Health event prediction is empowered by the rapid and wide application of electronic health records (EHR). In the Intensive Care Unit (ICU), precisely predicting the health related events in advance is essential for providing treatment and intervention to improve the patients outcomes. EHR is a kind of multi-modal data containing clinical text, time series, structured data, etc.

View Article and Find Full Text PDF

Background And Objective: Cerebral aneurysms occur as balloon-like outpouchings in an artery, which commonly develop at the weak curved regions and bifurcations. When aneurysms are detected, understanding the risk of rupture is of immense clinical value for better patient management. Towards this, Fluid-Structure Interaction (FSI) studies can improve our understanding of the mechanics behind aneurysm initiation, progression, and rupture.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!