Communication between Deaf and hearing individuals remains a persistent challenge requiring attention to foster inclusivity. Despite notable efforts in the development of digital solutions for sign language recognition (SLR), several issues persist, such as cross-platform interoperability and strategies for tokenizing signs to enable continuous conversations and coherent sentence construction. To address such issues, this paper proposes a non-invasive Portuguese Sign Language ( or LGP) interpretation system-as-a-service, leveraging skeletal posture sequence inference powered by long-short term memory (LSTM) architectures. To address the scarcity of examples during machine learning (ML) model training, dataset augmentation strategies are explored. Additionally, a buffer-based interaction technique is introduced to facilitate LGP terms tokenization. This technique provides real-time feedback to users, allowing them to gauge the time remaining to complete a sign, which aids in the construction of grammatically coherent sentences based on inferred terms/words. To support human-like conditioning rules for interpretation, a large language model (LLM) service is integrated. Experiments reveal that LSTM-based neural networks, trained with 50 LGP terms and subjected to data augmentation, achieved accuracy levels ranging from 80% to 95.6%. Users unanimously reported a high level of intuition when using the buffer-based interaction strategy for terms/words tokenization. Furthermore, tests with an LLM-specifically ChatGPT-demonstrated promising semantic correlation rates in generated sentences, comparable to expected sentences.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10672430 | PMC |
http://dx.doi.org/10.3390/jimaging9110235 | DOI Listing |
J Vis
January 2025
Department of Communicative Disorders, University of Alabama, Tuscaloosa, AL, USA.
The visual environment of sign language users is markedly distinct in its spatiotemporal parameters compared to that of non-signers. Although the importance of temporal and spectral resolution in the auditory modality for language development is well established, the spectrotemporal parameters of visual attention necessary for sign language comprehension remain less understood. This study investigates visual temporal resolution in learners of American Sign Language (ASL) at various stages of acquisition to determine how experience with sign language affects perceptual sampling.
View Article and Find Full Text PDFLinguist Vanguard
December 2024
Laboratoire de Sciences Cognitives et Psycholinguistique (ENS, EHESS, CNRS), Ecole Normale Supérieure - PSL, 29 rue d'Ulm, 75005 Paris, France.
We investigate the degree to which mispronounced signs can be accommodated by signers of French Sign Language (LSF). Using an offline judgment task, we examine both the individual contributions of three parameters - handshape, movement, and location - to sign recognition, and the impact of the individual features that were manipulated to obtain the mispronounced signs. Results indicate that signers judge mispronounced handshapes to be less damaging for well-formedness than mispronounced locations or movements.
View Article and Find Full Text PDFmLife
December 2024
State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, School of Life Sciences and Biotechnology Shanghai Jiao Tong University Shanghai China.
Curr Diab Rep
December 2024
College of Nursing, University of Utah, 10 South 2000 East, Salt Lake City, UT, 84112, USA.
Purpose Of Review: Describe the connection between Deaf/hard of hearing (DHH) and diabetes, explain the bidirectional relationship of blind/low vision (BLV) and diabetes, characterize challenges DHH and BLV populations face when seeking healthcare regarding their diabetes management. Highlight the inaccessibility of diabetes technology in these populations. Provide best practices when communicating with DHH and BLV people in the clinical setting.
View Article and Find Full Text PDFACS Appl Mater Interfaces
December 2024
National Engineering Lab of Special Display Technology, Special Display and Imaging Technology Innovation Center of Anhui Province, Academy of Optoelectronic Technology, Hefei University of Technology, Hefei 230009, China.
Flexible sensors mimic the sensing ability of human skin, and have unique flexibility and adaptability, allowing users to interact with intelligent systems in a more natural and intimate way. To overcome the issues of low sensitivity and limited operating range of flexible strain sensors, this study presents a highly innovative preparation method to develop a conductive elastomeric sensor with a cracked thin film by combining polydimethylsiloxane (PDMS) with multiwalled carbon nanotubes (MCNT). This novel design significantly increases both the sensitivity and operating range of the sensor (strain range 0-50%; the maximum tensile sensitivity of this sensor reaches 4.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!