Hand detection and classification is a very important pre-processing step in building applications based on three-dimensional (3D) hand pose estimation and hand activity recognition. To automatically limit the hand data area on egocentric vision (EV) datasets, especially to see the development and performance of the "You Only Live Once" (YOLO) network over the past seven years, we propose a study comparing the efficiency of hand detection and classification based on the YOLO-family networks. This study is based on the following problems: (1) systematizing all architectures, advantages, and disadvantages of YOLO-family networks from version (v)1 to v7; (2) preparing ground-truth data for pre-trained models and evaluation models of hand detection and classification on EV datasets (FPHAB, HOI4D, RehabHand); (3) fine-tuning the hand detection and classification model based on the YOLO-family networks, hand detection, and classification evaluation on the EV datasets. Hand detection and classification results on the YOLOv7 network and its variations were the best across all three datasets. The results of the YOLOv7-w6 network are as follows: FPHAB is = 97% with = 0.5; HOI4D is = 95% with = 0.5; RehabHand is larger than 95% with = 0.5; the processing speed of YOLOv7-w6 is 60 fps with a resolution of 1280 × 1280 pixels and that of YOLOv7 is 133 fps with a resolution of 640 × 640 pixels.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10058182PMC
http://dx.doi.org/10.3390/s23063255DOI Listing

Publication Analysis

Top Keywords

detection classification
28
hand detection
24
yolo-family networks
12
hand
10
based yolo-family
8
fps resolution
8
detection
7
classification
7
yolo series
4
series human
4

Similar Publications

The shape characteristics of flow hydrographs hold essential information for understanding, monitoring and assessing changes in flow and flood hydrology at reach and catchment scales. However, the analysis of individual hydrographs is time consuming, making the analysis of hundreds or thousands of them unachievable. A method or protocol is needed to ensure that the datasets being generated, and the metrics produced, have been consistently derived and validated.

View Article and Find Full Text PDF

Ensuring the supply of safe and high-quality drinking water can be compromised by the presence of chironomid larvae in drinking water treatment plants (DWTPs), which may contaminate municipal water systems through freshwater resources. Chironomids are dominant species known for their resilience to a broad range of extreme aquatic environments. This study aimed to identify the morphological characteristics and obtain genetic information of the chironomid Paratanytarsus grimmii found in the water intake source and freshwater resource of DWTPs in Korea, highlighting the potential possibility of a parthenogenetic chironomid outbreak within DWTP networks.

View Article and Find Full Text PDF

Bacteria and archaea acquire resistance to genetic parasites by preferentially integrating short fragments of foreign DNA at one end of a Clustered Regularly Interspaced Short Palindromic Repeat (CRISPR). "Leader" DNA upstream of CRISPR loci regulates transcription and foreign DNA integration into the CRISPR. Here, we analyze 37,477 CRISPRs from 39,277 bacterial and 556 archaeal genomes to identify conserved sequence motifs in CRISPR leaders.

View Article and Find Full Text PDF

Osteoarthritis (OA) is heterogeneous and involves structural changes in the whole joint, such as cartilage, meniscus/labrum, ligaments, and tendons, mainly with short T2 relaxation times. Detecting OA before the onset of irreversible changes is crucial for early proactive management and limit growing disease burden. The more recent advanced quantitative imaging techniques and deep learning (DL) algorithms in musculoskeletal imaging have shown great potential for visualizing "pre-OA.

View Article and Find Full Text PDF

Background: Falls are among the most prevalent workplace accidents, necessitating thorough screening for susceptibility to falls and customization of individualized fall prevention programs. The aim of this study was to develop and validate a high fall risk prediction model using machine learning (ML) and video-based first three steps in middle-aged workers.

Methods: Train data (n=190, age 54.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!