Multi-Channel Attention Selection GANs for Guided Image-to-Image Translation.

IEEE Trans Pattern Anal Mach Intell

Published: May 2023

We propose a novel model named Multi-Channel Attention Selection Generative Adversarial Network (SelectionGAN) for guided image-to-image translation, where we translate an input image into another while respecting an external semantic guidance. The proposed SelectionGAN explicitly utilizes the semantic guidance information and consists of two stages. In the first stage, the input image and the conditional semantic guidance are fed into a cycled semantic-guided generation network to produce initial coarse results. In the second stage, we refine the initial results by using the proposed multi-scale spatial pooling & channel selection module and the multi-channel attention selection module. Moreover, uncertainty maps automatically learned from attention maps are used to guide the pixel loss for better network optimization. Exhaustive experiments on four challenging guided image-to-image translation tasks (face, hand, body, and street view) demonstrate that our SelectionGAN is able to generate significantly better results than the state-of-the-art methods. Meanwhile, the proposed framework and modules are unified solutions and can be applied to solve other generation tasks such as semantic image synthesis. The code is available at https://github.com/Ha0Tang/SelectionGAN.

Download full-text PDF	Source
http://dx.doi.org/10.1109/TPAMI.2022.3212915	DOI Listing

Publication Analysis

Top Keywords

multi-channel attention

attention selection

guided image-to-image

image-to-image translation

semantic guidance

input image

selection module

selection

selection gans

gans guided

Similar Publications

Multi-channel spatio-temporal graph attention contrastive network for brain disease diagnosis.

Neuroimage

January 2025

College of Artificial Intelligence, Nanjing University of Aeronautics and Astronautics, Nanjing, 211106, China; Key Laboratory of Brain-Machine Intelligence Technology, Ministry of Education, Nanjing University of Aeronautics and Astronautics, Nanjing, 211106, China. Electronic address:

Chaojun Li Kai Ma Shengrong Li Xiangshui Meng Ran Wang

Dynamic brain networks (DBNs) can capture the intricate connections and temporal evolution among brain regions, becoming increasingly crucial in the diagnosis of neurological disorders. However, most existing researches tend to focus on isolated brain network sequence segmented by sliding windows, and they are difficult to effectively uncover the higher-order spatio-temporal topological pattern in DBNs. Meantime, it remains a challenge to utilize the structure connectivity prior in the DBNs analysis.

View Article and Find Full Text PDF

Similar Publications

Linking multiple-channel information seeking and lifestyle among Chinese older adults: A moderation and mediation analysis.

Digit Health

January 2025

Department of Communication, University of Macau, Taipa, Macao, China.

Qingrui Li Yifang Wu Xinshu Zhao

Background: Healthy lifestyle improvement of older Chinese adults has drawn a lot of attention due to an exceeding ageing population in mainland China. The current study aims to investigate the beneficial functions of the multi-channel health information seeking on elders' lifestyle self-management.

Objective: We conducted a mediation analysis to test the association between multi-channel information seeking behavior and lifestyle self-management, which mediates by perceived self-management competence.

View Article and Find Full Text PDF

Similar Publications

MambaPose: A Human Pose Estimation Based on Gated Feedforward Network and Mamba.

Sensors (Basel)

December 2024

School of Electronic Information Engineering, Taiyuan University of Science and Technology, Taiyuan 030024, China.

Jianqiang Zhang Jing Hou Qiusheng He Zhengwei Yuan Hao Xue

Human pose estimation is an important research direction in the field of computer vision, which aims to accurately identify the position and posture of keypoints of the human body through images or videos. However, multi-person pose estimation yields false detection or missed detection in dense crowds, and it is still difficult to detect small targets. In this paper, we propose a Mamba-based human pose estimation.

View Article and Find Full Text PDF

Similar Publications

Improving the Performance of Electrotactile Brain-Computer Interface Using Machine Learning Methods on Multi-Channel Features of Somatosensory Event-Related Potentials.

Sensors (Basel)

December 2024

School of Electrical Engineering, University of Belgrade, 11000 Belgrade, Serbia.

Marija Novičić Olivera Djordjević Vera Miler-Jerković Ljubica Konstantinović Andrej M Savić

Traditional tactile brain-computer interfaces (BCIs), particularly those based on steady-state somatosensory-evoked potentials, face challenges such as lower accuracy, reduced bit rates, and the need for spatially distant stimulation points. In contrast, using transient electrical stimuli offers a promising alternative for generating tactile BCI control signals: somatosensory event-related potentials (sERPs). This study aimed to optimize the performance of a novel electrotactile BCI by employing advanced feature extraction and machine learning techniques on sERP signals for the classification of users' selective tactile attention.

View Article and Find Full Text PDF

Similar Publications

Emotion Recognition Model of EEG Signals Based on Double Attention Mechanism.

Brain Sci

December 2024

West China Institute of Children's Brain and Cognition, Chongqing University of Education, Chongqing 400065, China.

Yahong Ma Zhentao Huang Yuyao Yang Shanwen Zhang Qi Dong

Background: Emotions play a crucial role in people's lives, profoundly affecting their cognition, decision-making, and interpersonal communication. Emotion recognition based on brain signals has become a significant challenge in the fields of affective computing and human-computer interaction.

Methods: Addressing the issue of inaccurate feature extraction and low accuracy of existing deep learning models in emotion recognition, this paper proposes a multi-channel automatic classification model for emotion EEG signals named DACB, which is based on dual attention mechanisms, convolutional neural networks, and bidirectional long short-term memory networks.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!