CFATransUnet: Channel-wise cross fusion attention and transformer for 2D medical image segmentation.

Comput Biol Med

Department of Optical Science and Engineering, Fudan University, Shanghai 200433, China; Academy for Engineering and Technology, Fudan University, Shanghai 200433, China; Zhuhai Fudan Innovation Institute, Zhuhai 519031, China. Electronic address:

Published: January 2024

Medical image segmentation faces current challenges in effectively extracting and fusing long-distance and local semantic information, as well as mitigating or eliminating semantic gaps during the encoding and decoding process. To alleviate the above two problems, we propose a new U-shaped network structure, called CFATransUnet, with Transformer and CNN blocks as the backbone network, equipped with Channel-wise Cross Fusion Attention and Transformer (CCFAT) module, containing Channel-wise Cross Fusion Transformer (CCFT) and Channel-wise Cross Fusion Attention (CCFA). Specifically, we use a Transformer and CNN blocks to construct the encoder and decoder for adequate extraction and fusion of long-range and local semantic features. The CCFT module utilizes the self-attention mechanism to reintegrate semantic information from different stages into cross-level global features to reduce the semantic asymmetry between features at different levels. The CCFA module adaptively acquires the importance of each feature channel based on a global perspective in a network learning manner, enhancing effective information grasping and suppressing non-important features to mitigate semantic gaps. The combination of CCFT and CCFA can guide the effective fusion of different levels of features more powerfully with a global perspective. The consistent architecture of the encoder and decoder also alleviates the semantic gap. Experimental results suggest that the proposed CFATransUnet achieves state-of-the-art performance on four datasets. The code is available at https://github.com/CPU0808066/CFATransUnet.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.compbiomed.2023.107803DOI Listing

Publication Analysis

Top Keywords

channel-wise cross
16
cross fusion
16
fusion attention
12
attention transformer
8
medical image
8
image segmentation
8
local semantic
8
semantic gaps
8
transformer cnn
8
cnn blocks
8

Similar Publications

A conflict-free multi-modal fusion network with spatial reinforcement transformers for brain tumor segmentation.

Comput Biol Med

December 2024

Shanghai Health Commission Key Lab of Artificial Intelligence (AI)-Based Management of Inflammation and Chronic Diseases, Sino-French Cooperative Central Lab, Gongli Hospital of Shanghai Pudong New Area, Shanghai 200135, China.

Brain gliomas are a leading cause of cancer mortality worldwide. Existing glioma segmentation approaches using multi-modal inputs often rely on a simplistic approach of stacking images from all modalities, disregarding modality-specific features that could optimize diagnostic outcomes. This paper introduces STE-Net, a spatial reinforcement hybrid Transformer-based tri-branch multi-modal evidential fusion network designed for conflict-free brain tumor segmentation.

View Article and Find Full Text PDF

Deep learning-based whole-brain B -mapping at 7T.

Magn Reson Med

October 2024

Physikalisch-Technische Bundesanstalt, Berlin, Germany.

Purpose: This study investigates the feasibility of using complex-valued neural networks (NNs) to estimate quantitative transmit magnetic RF field (B ) maps from multi-slice localizer scans with different slice orientations in the human head at 7T, aiming to accelerate subject-specific B -calibration using parallel transmission (pTx).

Methods: Datasets containing channel-wise B -maps and corresponding multi-slice localizers were acquired in axial, sagittal, and coronal orientation in 15 healthy subjects utilizing an eight-channel pTx transceiver head coil. Training included five-fold cross-validation for four network configurations: used transversal, sagittal, coronal data, and was trained on all slice orientations.

View Article and Find Full Text PDF

MulCPred: Learning Multi-Modal Concepts for Explainable Pedestrian Action Prediction.

Sensors (Basel)

October 2024

Graduate School of Informatics, Nagoya University, Furo-cho, Chikusa-ku, Nagoya 464-8601, Japan.

Article Synopsis
  • The paper introduces MulCPred, a new framework designed to provide explainable predictions for pedestrian action, which is essential for applications like autonomous driving.
  • It addresses limitations of existing methods by using a linear aggregator for multi-modal concept integration, a channel-wise recalibration module for focusing on detailed input areas, and a regularization loss to capture diverse patterns.
  • Evaluation on various datasets shows that MulCPred enhances the explainability of predictions without significantly harming accuracy, and by filtering out unrecognizable concepts, it improves performance across different datasets.
View Article and Find Full Text PDF

Assessing the impact of ultrasound image standardization in deep learning-based segmentation of carotid plaque types.

Comput Methods Programs Biomed

December 2024

Department of Electrical Engineering, Computer Engineering and Informatics, Cyprus University of Technology, Limassol, Cyprus. Electronic address:

Background And Objective: Carotid B-mode ultrasound (CBUS) imaging is often used to detect and assess atherosclerotic plaques. Doctors often need to segment plaques in the CBUS images to further examine them. Multiple studies have proposed two-dimensional CBUS plaque segmentation deep learning (DL)-based solutions, achieving promising results.

View Article and Find Full Text PDF

Alpha (8-12 Hz) frequency band oscillations are among the most informative features in electroencephalographic (EEG) assessment of patients with disorders of consciousness (DoC). Because interareal alpha synchrony is thought to facilitate long-range communication in healthy brains, coherence measures of resting-state alpha oscillations may provide insights into a patient's capacity for higher-order cognition beyond channel-wise estimates of alpha power. In multi-channel EEG, global coherence methods may be used to augment standard spectral analysis methods by both estimating the strength and identifying the structure of coherent oscillatory networks.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!