Two-Stream Mixed Convolutional Neural Network for American Sign Language Recognition.

Sensors (Basel)

Department of Computer Engineering, Chonnam National University, Yeosu 59626, Korea.

Published: August 2022

The Convolutional Neural Network (CNN) has demonstrated excellent performance in image recognition and has brought new opportunities for sign language recognition. However, the features undergo many nonlinear transformations while performing the convolutional operation and the traditional CNN models are insufficient in dealing with the correlation between images. In American Sign Language (ASL) recognition, J and Z with moving gestures bring recognition challenges. This paper proposes a novel Two-Stream Mixed (TSM) method with feature extraction and fusion operation to improve the correlation of feature expression between two time-consecutive images for the dynamic gestures. The proposed TSM-CNN system is composed of preprocessing, the TSM block, and CNN classifiers. Two consecutive images in the dynamic gesture are used as inputs of streams, and resizing, transformation, and augmentation are carried out in the preprocessing stage. The fusion feature map obtained by addition and concatenation in the TSM block is used as inputs of the classifiers. Finally, a classifier classifies images. The TSM-CNN model with the highest performance scores depending on three concatenation methods is selected as the definitive recognition model for ASL recognition. We design 4 CNN models with TSM: TSM-LeNet, TSM-AlexNet, TSM-ResNet18, and TSM-ResNet50. The experimental results show that the CNN models with the TSM are better than models without TSM. The TSM-ResNet50 has the best accuracy of 97.57% for MNIST and ASL datasets and is able to be applied to a RGB image sensing system for hearing-impaired people.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9414785PMC
http://dx.doi.org/10.3390/s22165959DOI Listing

Publication Analysis

Top Keywords

sign language
12
cnn models
12
models tsm
12
two-stream mixed
8
convolutional neural
8
neural network
8
american sign
8
language recognition
8
asl recognition
8
images dynamic
8

Similar Publications

Automatic Sign Language Recognition Systems (ASLR) offers smooth communication between hearing-impaired and normal-hearing individuals, enhancing educational opportunities for impaired. However, it struggles with "curse of dimensionality" due to excessive features resulting in prolonged training time and exhaustive computational demand. This paper proposes technique that integrates machine learning and swarm intelligence to effectively address this issue.

View Article and Find Full Text PDF

South African parents' views on oral, signing, and bilingual communication for Deaf or hard-of-hearing children.

Afr J Disabil

December 2024

Department of Audiology, Faculty of Human and Community Development, University of the Witwatersrand, Braamfontein, South Africa.

Background: Parents of Deaf or hard-of-hearing (DHH) children are faced with a plethora of overwhelming decisions concerning their children, particularly during the early stages of development. Among these decisions are those concerning assistive devices and the modes of communication for their child.

Objectives: The aim of this study was to explore the perceptions of parents of DHH children towards the various modes of communication for their children within the South African context.

View Article and Find Full Text PDF

Background: Addressing language barriers through accurate interpretation is crucial for providing quality care and establishing trust. While the ability of artificial intelligence (AI) to translate medical documentation has been studied, its role for patient-provider communication is less explored. This review evaluates AI's effectiveness in clinical translation by assessing accuracy, usability, satisfaction, and feedback on its use.

View Article and Find Full Text PDF

Excess cement and peri-implant disease: A cross-sectional clinical endoscopic study.

J Periodontol

January 2025

Department of Biomedical and Neuromotor Sciences, School of Dentistry - Division of Periodontology and Implantology, Alma Mater Studiorum - University of Bologna, Bologna, Italy.

Background: Crown cementation is a common technique for implant-supported prosthodontics. However, for possible slipping of the cement below the mucosal margin, its thorough removal poses some issues. The objective of this study was to evaluate the presence of submucosal cement residues in patients with peri-implant disease by endoscopic visualization and to investigate the potential correlation between the pathological scenario and the spatial position of cement residues.

View Article and Find Full Text PDF

Advancements in sign language processing technology hinge on the availability of extensive, reliable datasets, comprehensive instructions, and adherence to ethical guidelines. To facilitate progress in gesture recognition and translation systems and to support the Azerbaijani sign language community we present the Azerbaijani Sign Language Dataset (AzSLD). This comprehensive dataset was collected from a diverse group of sign language users, encompassing a range of linguistic parameters.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!