Graph Convolutional Networks (GCNs) are widely used for skeleton-based action recognition and achieved remarkable performance. Due to the locality of graph convolution, GCNs can only utilize short-range node dependencies but fail to model long-range node relationships. In addition, existing graph convolution based methods normally use a uniform skeleton topology for all frames, which limits the ability of feature learning. To address these issues, we present the Graph Convolution Network with Self-Attention (SelfGCN), which consists of a mixing features across self-attention and graph convolution (MFSG) module and a temporal-specific spatial self-attention (TSSA) module. The MFSG module models local and global relationships between joints by executing graph convolution and self-attention branches in parallel. Its bi-directional interactive learning strategy utilizes complementary clues in the channel dimensions and the spatial dimensions across both of these branches. The TSSA module uses self-attention to learn the spatial relationships between joints of each frame in a skeleton sequence. It also models the unique spatial features of the single frames. We conduct extensive experiments on three popular benchmark datasets, NTU RGB+D, NTU RGB+D120, and Northwestern-UCLA. The results of the experiment demonstrate that our method achieves or exceeds the record accuracies on all three benchmarks. Our project website is available at https://github.com/SunPengP/SelfGCN.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TIP.2024.3433581DOI Listing

Publication Analysis

Top Keywords

graph convolution
24
convolution network
8
network self-attention
8
skeleton-based action
8
action recognition
8
mfsg module
8
tssa module
8
relationships joints
8
convolution
6
self-attention
6

Similar Publications

Real-time and accurate traffic forecasting aids in traffic planning and design and helps to alleviate congestion. Addressing the negative impacts of partial data loss in traffic forecasting, and the challenge of simultaneously capturing short-term fluctuations and long-term trends, this paper presents a traffic forecasting model, D-MGDCN-CLSTM, based on Multi-Graph Gated Dilated Convolution and Conv-LSTM. The model uses the DTWN algorithm to fill in missing data.

View Article and Find Full Text PDF

In sports training, personalized skill assessment and feedback are crucial for athletes to master complex movements and improve performance. However, existing research on skill transfer predominantly focuses on skill evaluation through video analysis, addressing only a single facet of the multifaceted process required for skill acquisition. Furthermore, in the limited studies that generate expert comments, the learner's skill level is predetermined, and the spatial-temporal information of human movement is often overlooked.

View Article and Find Full Text PDF

Interleukin-6 (IL-6) is a potent glycoprotein that plays a crucial role in regulating innate and adaptive immunity, as well as metabolism. The expression and release of IL-6 are closely correlated with the severity of various diseases. IL-6-inducing peptides are critical for the development of immunotherapy and diagnostic biomarkers for some diseases.

View Article and Find Full Text PDF

Graph Convolutional Network with Neural Collaborative Filtering for Predicting miRNA-Disease Association.

Biomedicines

January 2025

Major of Big Data Convergence, Division of Data Information Science, Pukyong National University, Busan 48513, Republic of Korea.

Over the past few decades, micro ribonucleic acids (miRNAs) have been shown to play significant roles in various biological processes, including disease incidence. Therefore, much effort has been devoted to discovering the pivotal roles of miRNAs in disease incidence to understand the underlying pathogenesis of human diseases. However, identifying miRNA-disease associations using biological experiments is inefficient in terms of cost and time.

View Article and Find Full Text PDF

: Alzheimer's disease is a progressive neurological condition marked by a decline in cognitive abilities. Early diagnosis is crucial but challenging due to overlapping symptoms among impairment stages, necessitating non-invasive, reliable diagnostic tools. : We applied information geometry and manifold learning to analyze grayscale MRI scans classified into No Impairment, Very Mild, Mild, and Moderate Impairment.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!