We propose a novel model named Multi-Channel Attention Selection Generative Adversarial Network (SelectionGAN) for guided image-to-image translation, where we translate an input image into another while respecting an external semantic guidance. The proposed SelectionGAN explicitly utilizes the semantic guidance information and consists of two stages. In the first stage, the input image and the conditional semantic guidance are fed into a cycled semantic-guided generation network to produce initial coarse results. In the second stage, we refine the initial results by using the proposed multi-scale spatial pooling & channel selection module and the multi-channel attention selection module. Moreover, uncertainty maps automatically learned from attention maps are used to guide the pixel loss for better network optimization. Exhaustive experiments on four challenging guided image-to-image translation tasks (face, hand, body, and street view) demonstrate that our SelectionGAN is able to generate significantly better results than the state-of-the-art methods. Meanwhile, the proposed framework and modules are unified solutions and can be applied to solve other generation tasks such as semantic image synthesis. The code is available at https://github.com/Ha0Tang/SelectionGAN.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1109/TPAMI.2022.3212915 | DOI Listing |
Neuroimage
January 2025
College of Artificial Intelligence, Nanjing University of Aeronautics and Astronautics, Nanjing, 211106, China; Key Laboratory of Brain-Machine Intelligence Technology, Ministry of Education, Nanjing University of Aeronautics and Astronautics, Nanjing, 211106, China. Electronic address:
Dynamic brain networks (DBNs) can capture the intricate connections and temporal evolution among brain regions, becoming increasingly crucial in the diagnosis of neurological disorders. However, most existing researches tend to focus on isolated brain network sequence segmented by sliding windows, and they are difficult to effectively uncover the higher-order spatio-temporal topological pattern in DBNs. Meantime, it remains a challenge to utilize the structure connectivity prior in the DBNs analysis.
View Article and Find Full Text PDFDigit Health
January 2025
Department of Communication, University of Macau, Taipa, Macao, China.
Background: Healthy lifestyle improvement of older Chinese adults has drawn a lot of attention due to an exceeding ageing population in mainland China. The current study aims to investigate the beneficial functions of the multi-channel health information seeking on elders' lifestyle self-management.
Objective: We conducted a mediation analysis to test the association between multi-channel information seeking behavior and lifestyle self-management, which mediates by perceived self-management competence.
Sensors (Basel)
December 2024
School of Electronic Information Engineering, Taiyuan University of Science and Technology, Taiyuan 030024, China.
Human pose estimation is an important research direction in the field of computer vision, which aims to accurately identify the position and posture of keypoints of the human body through images or videos. However, multi-person pose estimation yields false detection or missed detection in dense crowds, and it is still difficult to detect small targets. In this paper, we propose a Mamba-based human pose estimation.
View Article and Find Full Text PDFSensors (Basel)
December 2024
School of Electrical Engineering, University of Belgrade, 11000 Belgrade, Serbia.
Traditional tactile brain-computer interfaces (BCIs), particularly those based on steady-state somatosensory-evoked potentials, face challenges such as lower accuracy, reduced bit rates, and the need for spatially distant stimulation points. In contrast, using transient electrical stimuli offers a promising alternative for generating tactile BCI control signals: somatosensory event-related potentials (sERPs). This study aimed to optimize the performance of a novel electrotactile BCI by employing advanced feature extraction and machine learning techniques on sERP signals for the classification of users' selective tactile attention.
View Article and Find Full Text PDFBrain Sci
December 2024
West China Institute of Children's Brain and Cognition, Chongqing University of Education, Chongqing 400065, China.
Background: Emotions play a crucial role in people's lives, profoundly affecting their cognition, decision-making, and interpersonal communication. Emotion recognition based on brain signals has become a significant challenge in the fields of affective computing and human-computer interaction.
Methods: Addressing the issue of inaccurate feature extraction and low accuracy of existing deep learning models in emotion recognition, this paper proposes a multi-channel automatic classification model for emotion EEG signals named DACB, which is based on dual attention mechanisms, convolutional neural networks, and bidirectional long short-term memory networks.
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!