Center point to pose: Multiple views 3D human pose estimation for multi-person.

PLoS One

The State Key Laboratory of Automotive Simulation and Control, Jilin University, Changchun, China.

Published: December 2022

3D human pose estimation has always been an important task in computer vision, especially in crowded scenes where multiple people interact with each other. There are many state-of-the-arts for object detection based on single view. However, recovering the location of people is complicated in crowded and occluded scenes due to the lack of depth information for single view, which is the lack of robustness. Multi-view Human Pose Estimation for Multi-Person became an effective approach. The previous multi-view 3D human pose estimation method can be attributed to a strategy to associate the joints of the same person from 2D pose estimation. However, the incompleteness and noise of the 2D pose are inevitable. In addition, how to associate the joints itself is challenging. To solve this issue, we propose a CTP (Center Point to Pose) network based on multi-view which directly operates in the 3D space. The 2D joint features in all cameras are projected into 3D voxel space. Our CTP network regresses the center of one person as the location, and the 3D bounding box as the activity area of one person. Then our CTP network estimates detailed 3D pose for each bounding box. Besides, our CTP network is Non-Maximum Suppression free at the stage of regressing the center of one person, which makes it more efficient and simpler. Our method outperforms competitively on several public datasets which shows the efficacy of our center point to pose network representation.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9469997PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0274450PLOS

Publication Analysis

Top Keywords

pose estimation
20
human pose
16
center point
12
point pose
12
ctp network
12
pose
10
estimation multi-person
8
single view
8
multi-view human
8
associate joints
8

Similar Publications

Double bowtie design for high sensitivity pediatric spectral CT.

Conf Proc Int Conf Image Form Xray Comput Tomogr

August 2024

Department of Radiology, Perelman School of Medicine, Philadelphia, PA USA.

Despite the evident benefits of spectral computed tomography (CT) in delivering qualitative imaging superior to that of conventional CT in adults, its application in pediatric diagnostic imaging is still relatively limited due to various reasons, including design limitations and radiation dose considerations. The use of specialized K-edge filters, in conjunction with other spectral technologies, has been demonstrated to improve spectral quantification accuracy. X-ray flux limitations generally pose challenges in these concepts when applied to adults.

View Article and Find Full Text PDF

Background: Vancomycin-resistant Enterococcus (VRE) are present across the One Health continuum and pose a considerable risk for transmission along the food chain. This systematic review and meta-analysis estimates the prevalence of VRE colonization in livestock, food of animal origin, and in human populations.

Methods: Embase, MEDLINE and CAB Abstracts were searched for eligible literature.

View Article and Find Full Text PDF

Complex traits influenced by multiple genes pose challenges for marker-assisted selection (MAS) in breeding. Genomic selection (GS) is a promising strategy for achieving higher genetic gains in quantitative traits by stacking favorable alleles into elite cultivars. Resistance to Fusarium oxysporum f.

View Article and Find Full Text PDF

Swin-transformer for weak feature matching.

Sci Rep

January 2025

Department of Computer Science and Technology, Qilu University of Technology, No. 3501 Daxue Road, Jinan, 250300, Shandong, China.

Feature matching in computer vision is crucial but challenging in weakly textured scenes due to the lack of pattern repetition. We introduce the SwinMatcher feature matching method, aimed at addressing the issues of low matching quantity and poor matching precision in weakly textured scenes. Given the inherently significant local characteristics of image features, we employ a local self-attention mechanism to learn from weakly textured areas, maximally preserving the features of weak textures.

View Article and Find Full Text PDF

Spatiotemporal estimates of anthropogenic NO emissions across China during 2015-2022 using a deep learning model.

J Hazard Mater

January 2025

Key Laboratory of Geographic Information Science of the Ministry of Education, School of Geographic Sciences, East China Normal University, Shanghai 200241, PR China; Institute of Eco-Chongming (IEC), 20 Cuiniao Road, Chenjia Town, Chongming District, Shanghai 202162, PR China. Electronic address:

As one of the significant air pollutants, nitrogen oxides (NO = NO + NO) not only pose a great threat to human health, but also contribute to the formation of secondary pollutants such as ozone and nitrate particles. Due to substantial uncertainties in bottom-up emission inventories, simulated concentrations of air pollutants using GEOS-Chem model often largely biased from those of ground-level observations. To address this issue, we developed a new deep learning model to simulate the inverse process of the GEOS-Chem model.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!