An enhanced real-time human pose estimation method based on modified YOLOv8 framework.

Sci Rep

Nanjing University of Aeronautics and Astronautics, Nanjing, 210000, Jiangsu, China.

Published: April 2024

The objective of human pose estimation (HPE) derived from deep learning aims to accurately estimate and predict the human body posture in images or videos via the utilization of deep neural networks. However, the accuracy of real-time HPE tasks is still to be improved due to factors such as partial occlusion of body parts and limited receptive field of the model. To alleviate the accuracy loss caused by these issues, this paper proposes a real-time HPE model called based on the YOLOv8 framework. Specifically, we have improved the backbone and neck of the YOLOv8x-pose real-time HPE model to alleviate the feature loss and receptive field constraints. Secondly, we introduce the context coordinate attention module (CCAM) to augment the model's focus on salient features, reduce background noise interference, alleviate key point regression failure caused by limb occlusion, and improve the accuracy of pose estimation. Our approach attains competitive results on multiple metrics of two open-source datasets, MS COCO 2017 and CrowdPose. Compared with the baseline model YOLOv8x-pose, CCAM-Person improves the average precision by 2.8% and 3.5% on the two datasets, respectively.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10997650	PMC
http://dx.doi.org/10.1038/s41598-024-58146-z	DOI Listing

Publication Analysis

Top Keywords

pose estimation

real-time hpe

human pose

yolov8 framework

receptive field

model alleviate

hpe model

enhanced real-time

real-time human

estimation method

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!