A human activity recognition method based on Vision Transformer.

Sci Rep

School of Computer Science and Technology, North University of China, Taiyuan, 030051, China.

Published: July 2024

Human activity recognition has a wide range of applications in various fields, such as video surveillance, virtual reality and human-computer intelligent interaction. It has emerged as a significant research area in computer vision. GCN (Graph Convolutional networks) have recently been widely used in these fields and have made great performance. However, there are still some challenges including over-smoothing problem caused by stack graph convolutions and deficient semantics correlation to capture the large movements between time sequences. Vision Transformer (ViT) is utilized in many 2D and 3D image fields and has surprised results. In our work, we propose a novel human activity recognition method based on ViT (HAR-ViT). We integrate enhanced AGCL (eAGCL) in 2s-AGCN to ViT to make it process spatio-temporal data (3D skeleton) and make full use of spatial features. The position encoder module orders the non-sequenced information while the transformer encoder efficiently compresses sequence data features to enhance calculation speed. Human activity recognition is accomplished through multi-layer perceptron (MLP) classifier. Experimental results demonstrate that the proposed method achieves SOTA performance on three extensively used datasets, NTU RGB+D 60, NTU RGB+D 120 and Kinetics-Skeleton 400.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11222487PMC
http://dx.doi.org/10.1038/s41598-024-65850-3DOI Listing

Publication Analysis

Top Keywords

human activity
16
activity recognition
16
recognition method
8
method based
8
vision transformer
8
ntu rgb+d
8
human
4
recognition
4
based vision
4
transformer human
4

Similar Publications

A conifer metabolite corrects episodic ataxia type 1 by voltage sensor-mediated ligand activation of Kv1.1.

Proc Natl Acad Sci U S A

January 2025

Bioelectricity Laboratory, Department of Physiology and Biophysics, School of Medicine, University of California, Irvine, CA 92697.

Loss-of-function sequence variants in , which encodes the voltage-gated potassium channel Kv1.1, cause Episodic Ataxia Type 1 (EA1) and epilepsy. Due to a paucity of drugs that directly rescue mutant Kv1.

View Article and Find Full Text PDF

Psychological Distress as a Mediator Between Work-Family Conflict and Nurse Managers' Professional and Organizational Turnover Intentions.

J Nurs Adm

December 2024

Author Affiliation: Assistant Professor, School of Nursing and Healthcare Leadership, University of Washington, Tacoma.

Objective: This study aimed to investigate the mediating role of psychological distress in the relationship between work-family conflict and nurse managers' (NMs') professional and organizational turnover intentions.

Background: Work-family conflict is prevalent among NMs. It can have a significant impact on their intent to leave their organization and the profession.

View Article and Find Full Text PDF

Malignant gliomas are heterogeneous tumors, mostly incurable, arising in the central nervous system (CNS) driven by genetic, epigenetic, and metabolic aberrations. Mutations in isocitrate dehydrogenase (IDH1/2) enzymes are predominantly found in low-grade gliomas and secondary high-grade gliomas, with IDH1 mutations being more prevalent. Mutant-IDH1/2 confers a gain-of-function activity that favors the conversion of a-ketoglutarate (α-KG) to the oncometabolite 2-hydroxyglutarate (2-HG), resulting in an aberrant hypermethylation phenotype.

View Article and Find Full Text PDF

Collaborative management partnerships (CMPs) between state wildlife authorities and nonprofit conservation organizations to manage protected areas (PAs) have been used increasingly across Sub-Saharan Africa since the 2000s. They aim to attract funding, build capacity, and increase the environmental effectiveness of PAs. Our study documents the rise of CMPs, examines their current extent, and measures their effectiveness in protecting habitats.

View Article and Find Full Text PDF

The widespread application of genome editing to treat and cure disease requires the delivery of genome editors into the nucleus of target cells. Enveloped delivery vehicles (EDVs) are engineered virally derived particles capable of packaging and delivering CRISPR-Cas9 ribonucleoproteins (RNPs). However, the presence of lentiviral genome encapsulation and replication proteins in EDVs has obscured the underlying delivery mechanism and precluded particle optimization.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!