Two-Stream Mixed Convolutional Neural Network for American Sign Language Recognition.

Sensors (Basel)

Department of Computer Engineering, Chonnam National University, Yeosu 59626, Korea.

Published: August 2022

The Convolutional Neural Network (CNN) has demonstrated excellent performance in image recognition and has brought new opportunities for sign language recognition. However, the features undergo many nonlinear transformations while performing the convolutional operation and the traditional CNN models are insufficient in dealing with the correlation between images. In American Sign Language (ASL) recognition, J and Z with moving gestures bring recognition challenges. This paper proposes a novel Two-Stream Mixed (TSM) method with feature extraction and fusion operation to improve the correlation of feature expression between two time-consecutive images for the dynamic gestures. The proposed TSM-CNN system is composed of preprocessing, the TSM block, and CNN classifiers. Two consecutive images in the dynamic gesture are used as inputs of streams, and resizing, transformation, and augmentation are carried out in the preprocessing stage. The fusion feature map obtained by addition and concatenation in the TSM block is used as inputs of the classifiers. Finally, a classifier classifies images. The TSM-CNN model with the highest performance scores depending on three concatenation methods is selected as the definitive recognition model for ASL recognition. We design 4 CNN models with TSM: TSM-LeNet, TSM-AlexNet, TSM-ResNet18, and TSM-ResNet50. The experimental results show that the CNN models with the TSM are better than models without TSM. The TSM-ResNet50 has the best accuracy of 97.57% for MNIST and ASL datasets and is able to be applied to a RGB image sensing system for hearing-impaired people.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9414785	PMC
http://dx.doi.org/10.3390/s22165959	DOI Listing

Publication Analysis

Top Keywords

sign language

cnn models

models tsm

two-stream mixed

convolutional neural

neural network

american sign

language recognition

asl recognition

images dynamic

Similar Publications

Improved feature reduction framework for sign language recognition using autoencoders and adaptive Grey Wolf Optimization.

Sci Rep

January 2025

University Institute of Computing, Chandigarh University, Punjab, India.

Rajeev Goel Sandhya Bansal Kavita Gupta

Automatic Sign Language Recognition Systems (ASLR) offers smooth communication between hearing-impaired and normal-hearing individuals, enhancing educational opportunities for impaired. However, it struggles with "curse of dimensionality" due to excessive features resulting in prolonged training time and exhaustive computational demand. This paper proposes technique that integrates machine learning and swarm intelligence to effectively address this issue.

View Article and Find Full Text PDF

Similar Publications

South African parents' views on oral, signing, and bilingual communication for Deaf or hard-of-hearing children.

Afr J Disabil

December 2024

Department of Audiology, Faculty of Human and Community Development, University of the Witwatersrand, Braamfontein, South Africa.

Katijah Khoza-Shangase Jasmine Bent

Background: Parents of Deaf or hard-of-hearing (DHH) children are faced with a plethora of overwhelming decisions concerning their children, particularly during the early stages of development. Among these decisions are those concerning assistive devices and the modes of communication for their child.

Objectives: The aim of this study was to explore the perceptions of parents of DHH children towards the various modes of communication for their children within the South African context.

View Article and Find Full Text PDF

Similar Publications

Artificial intelligence in clinical settings: a systematic review of its role in language translation and interpretation.

Ann Transl Med

December 2024

Division of Advanced Gastrointestinal and Bariatric Surgery, Mayo Clinic, Jacksonville, FL, USA.

Ariana Genovese Sahar Borna Cesar A Gomez-Cabello Syed Ali Haider Srinivasagam Prabha

Background: Addressing language barriers through accurate interpretation is crucial for providing quality care and establishing trust. While the ability of artificial intelligence (AI) to translate medical documentation has been studied, its role for patient-provider communication is less explored. This review evaluates AI's effectiveness in clinical translation by assessing accuracy, usability, satisfaction, and feedback on its use.

View Article and Find Full Text PDF

Similar Publications

Excess cement and peri-implant disease: A cross-sectional clinical endoscopic study.

J Periodontol

January 2025

Department of Biomedical and Neuromotor Sciences, School of Dentistry - Division of Periodontology and Implantology, Alma Mater Studiorum - University of Bologna, Bologna, Italy.

Marco Montevecchi Leoluca Valeriani Maria Francesca Salvadori Martina Stefanini Giovanni Zucchelli

Background: Crown cementation is a common technique for implant-supported prosthodontics. However, for possible slipping of the cement below the mucosal margin, its thorough removal poses some issues. The objective of this study was to evaluate the presence of submucosal cement residues in patients with peri-implant disease by endoscopic visualization and to investigate the potential correlation between the pathological scenario and the spatial position of cement residues.

View Article and Find Full Text PDF

Similar Publications

AzSLD: Azerbaijani sign language dataset for fingerspelling, word, and sentence translation with baseline software.

Data Brief

February 2025

ADA University, Baku, Azerbaijan.

Nigar Alishzade Jamaladdin Hasanov

Advancements in sign language processing technology hinge on the availability of extensive, reliable datasets, comprehensive instructions, and adherence to ethical guidelines. To facilitate progress in gesture recognition and translation systems and to support the Azerbaijani sign language community we present the Azerbaijani Sign Language Dataset (AzSLD). This comprehensive dataset was collected from a diverse group of sign language users, encompassing a range of linguistic parameters.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!