In this article, we propose a Dual Relation-aware Attention Network (DRANet) to handle the task of scene segmentation. How to efficiently exploit context is essential for pixel-level recognition. To address the issue, we adaptively capture contextual information based on the relation-aware attention mechanism. Especially, we append two types of attention modules on the top of the dilated fully convolutional network (FCN), which model the contextual dependencies in spatial and channel dimensions, respectively. In the attention modules, we adopt a self-attention mechanism to model semantic associations between any two pixels or channels. Each pixel or channel can adaptively aggregate context from all pixels or channels according to their correlations. To reduce the high cost of computation and memory caused by the abovementioned pairwise association computation, we further design two types of compact attention modules. In the compact attention modules, each pixel or channel is built into association only with a few numbers of gathering centers and obtains corresponding context aggregation over these gathering centers. Meanwhile, we add a cross-level gating decoder to selectively enhance spatial details that boost the performance of the network. We conduct extensive experiments to validate the effectiveness of our network and achieve new state-of-the-art segmentation performance on four challenging scene segmentation data sets, i.e., Cityscapes, ADE20K, PASCAL Context, and COCO Stuff data sets. In particular, a Mean IoU score of 82.9% on the Cityscapes test set is achieved without using extra coarse annotated data.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TNNLS.2020.3006524DOI Listing

Publication Analysis

Top Keywords

attention modules
16
scene segmentation
12
relation-aware attention
12
dual relation-aware
8
attention network
8
pixels channels
8
pixel channel
8
compact attention
8
gathering centers
8
data sets
8

Similar Publications

This study utilizes the Breast Ultrasound Image (BUSI) dataset to present a deep learning technique for breast tumor segmentation based on a modified UNet architecture. To improve segmentation accuracy, the model integrates attention mechanisms, such as the Convolutional Block Attention Module (CBAM) and Non-Local Attention, with advanced encoder architectures, including ResNet, DenseNet, and EfficientNet. These attention mechanisms enable the model to focus more effectively on relevant tumor areas, resulting in significant performance improvements.

View Article and Find Full Text PDF

An effective vessel segmentation method using SLOA-HGC.

Sci Rep

January 2025

Faculty of Electronic Information and Physics, Central South University of Forestry and Technology, Changsha, 410004, Hunan, China.

Accurate segmentation of retinal blood vessels from retinal images is crucial for detecting and diagnosing a wide range of ophthalmic diseases. Our retinal blood vessel segmentation algorithm enhances microfine vessel extraction, improves edge texture clarity, and normalizes vessel distribution. It stabilizes neural network training for complex retinal vascular features.

View Article and Find Full Text PDF

Bearings are critical in mechanical systems, as their health impacts system reliability. Proactive monitoring and diagnosing of bearing faults can prevent significant safety issues. Among various diagnostic methods that analyze bearing vibration signals, deep learning is notably effective.

View Article and Find Full Text PDF

Background: The pandemic emergent disease multisystem inflammatory syndrome in children (MIS-C) following coronavirus disease-19 infection can mimic endemic typhus. We aimed to use artificial intelligence (AI) to develop a clinical decision support system that accurately distinguishes MIS-C versus Endemic Typhus (MET).

Methods: Demographic, clinical, and laboratory features rapidly available following presentation were extracted for 133 patients with MIS-C and 87 patients hospitalized due to typhus.

View Article and Find Full Text PDF

Dual-domain Wasserstein generative adversarial network with hybrid loss for low-dose CT imaging.

Phys Med Biol

January 2025

Capital Normal University, 105, North West Sanhuan Road, Haidian District, Beijing, Beijing, None Selected, 100048, CHINA.

Objective: Low-dose computed tomography (LDCT) has gained significant attention in hospitals and clinics as a popular imaging modality for reducing the risk of X-ray radiation. However, reconstructed LDCT images often suffer from undesired noise and artifacts, which can negatively impact diagnostic accuracy. This study aims to develop a novel approach to improve LDCT imaging performance.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!