Graph convolutional networks (GCN) have recently been studied to exploit the graph topology of the human body for skeleton-based action recognition. However, most of these methods unfortunately aggregate messages via an inflexible pattern for various action samples, lacking the awareness of intra-class variety and the suitableness for skeleton sequences, which often contain redundant or even detrimental connections. In this paper, we propose a novel Deformable Graph Convolutional Network (DeGCN) to adaptively capture the most informative joints. The proposed DeGCN learns the deformable sampling locations on both spatial and temporal graphs, enabling the model to perceive discriminative receptive fields. Notably, considering human action is inherently continuous, the corresponding temporal features are defined in a continuous latent space. Furthermore, we design an innovative multi-branch framework, which not only strikes a better trade-off between accuracy and model size, but also elevates the effect of ensemble between the joint and bone modalities remarkably. Extensive experiments show that our proposed method achieves state-of-the-art performances on three widely used datasets, NTU RGB+D, NTU RGB+D 120, and NW-UCLA.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TIP.2024.3378886DOI Listing

Publication Analysis

Top Keywords

graph convolutional
12
deformable graph
8
convolutional networks
8
skeleton-based action
8
action recognition
8
ntu rgb+d
8
degcn deformable
4
graph
4
networks skeleton-based
4
action
4

Similar Publications

MHNet: Multi-view High-Order Network for Diagnosing Neurodevelopmental Disorders Using Resting-State fMRI.

J Imaging Inform Med

January 2025

Department of Chinese and Bilingual Studies, The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong Special Administrative Region, China.

Deep learning models have shown promise in diagnosing neurodevelopmental disorders (NDD) like ASD and ADHD. However, many models either use graph neural networks (GNN) to construct single-level brain functional networks (BFNs) or employ spatial convolution filtering for local information extraction from rs-fMRI data, often neglecting high-order features crucial for NDD classification. We introduce a Multi-view High-order Network (MHNet) to capture hierarchical and high-order features from multi-view BFNs derived from rs-fMRI data for NDD prediction.

View Article and Find Full Text PDF

Re-locative guided search optimized self-sparse attention enabled deep learning decoder for quantum error correction.

Sci Rep

January 2025

Department of Mathematics, School of Advanced Sciences, VIT-AP University, Besides AP Secretariate, Amaravati, Andhra Pradesh, 522237, India.

Heavy hexagonal coding is a type of quantum error-correcting coding in which the edges and vertices of a low-degree graph are assigned auxiliary and physical qubits. While many topological code decoders have been presented, it is still difficult to construct the optimal decoder due to leakage errors and qubit collision. Therefore, this research proposes a Re-locative Guided Search optimized self-sparse attention-enabled convolutional Neural Network with Long Short-Term Memory (RlGS2-DCNTM) for performing effective error correction in quantum codes.

View Article and Find Full Text PDF

Identification of potential drug-target interactions (DTIs) is a crucial step in drug discovery and repurposing. Although deep learning effectively deciphers DTIs, most deep learning-based methods represent drug features from only a single perspective. Moreover, the fusion method of drug and protein features needs further refinement.

View Article and Find Full Text PDF

 Combination therapy, which synergistically enhances treatment efficacy and inhibits disease progression through the combined effects of multiple drugs, has emerged as a mainstream approach for treating complex diseases and alleviating symptoms. However, drug-drug interactions (DDIs) can sometimes lead to adverse reactions, potentially endangering lives. Therefore, developing efficient and accurate DDI prediction methods is crucial for elucidating drug mechanisms and preventing side effects.

View Article and Find Full Text PDF

EEG involves recording electrical activity generated by the brain through electrodes placed on the scalp. Imagined speech classification has emerged as an essential area of research in brain-computer interfaces (BCIs). Despite significant advances, accurately classifying imagined speech signals remains challenging due to their complex and non-stationary nature.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!