Communication between Deaf and hearing individuals remains a persistent challenge requiring attention to foster inclusivity. Despite notable efforts in the development of digital solutions for sign language recognition (SLR), several issues persist, such as cross-platform interoperability and strategies for tokenizing signs to enable continuous conversations and coherent sentence construction. To address such issues, this paper proposes a non-invasive Portuguese Sign Language ( or LGP) interpretation system-as-a-service, leveraging skeletal posture sequence inference powered by long-short term memory (LSTM) architectures. To address the scarcity of examples during machine learning (ML) model training, dataset augmentation strategies are explored. Additionally, a buffer-based interaction technique is introduced to facilitate LGP terms tokenization. This technique provides real-time feedback to users, allowing them to gauge the time remaining to complete a sign, which aids in the construction of grammatically coherent sentences based on inferred terms/words. To support human-like conditioning rules for interpretation, a large language model (LLM) service is integrated. Experiments reveal that LSTM-based neural networks, trained with 50 LGP terms and subjected to data augmentation, achieved accuracy levels ranging from 80% to 95.6%. Users unanimously reported a high level of intuition when using the buffer-based interaction strategy for terms/words tokenization. Furthermore, tests with an LLM-specifically ChatGPT-demonstrated promising semantic correlation rates in generated sentences, comparable to expected sentences.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10672430PMC
http://dx.doi.org/10.3390/jimaging9110235DOI Listing

Publication Analysis

Top Keywords

sign language
12
buffer-based interaction
8
lgp terms
8
empowering deaf-hearing
4
deaf-hearing communication
4
communication exploring
4
exploring synergies
4
synergies predictive
4
predictive generative
4
generative ai-based
4

Similar Publications

The visual environment of sign language users is markedly distinct in its spatiotemporal parameters compared to that of non-signers. Although the importance of temporal and spectral resolution in the auditory modality for language development is well established, the spectrotemporal parameters of visual attention necessary for sign language comprehension remain less understood. This study investigates visual temporal resolution in learners of American Sign Language (ASL) at various stages of acquisition to determine how experience with sign language affects perceptual sampling.

View Article and Find Full Text PDF

Sign recognition: the effect of parameters and features in sign mispronunciations.

Linguist Vanguard

December 2024

Laboratoire de Sciences Cognitives et Psycholinguistique (ENS, EHESS, CNRS), Ecole Normale Supérieure - PSL, 29 rue d'Ulm, 75005 Paris, France.

We investigate the degree to which mispronounced signs can be accommodated by signers of French Sign Language (LSF). Using an offline judgment task, we examine both the individual contributions of three parameters - handshape, movement, and location - to sign recognition, and the impact of the individual features that were manipulated to obtain the mispronounced signs. Results indicate that signers judge mispronounced handshapes to be less damaging for well-formedness than mispronounced locations or movements.

View Article and Find Full Text PDF
Article Synopsis
  • Optimizing enzyme thermostability is crucial for protein science and industry, but combining multiple mutations can lead to inactivation, making traditional methods slow and inefficient.
  • Researchers developed an AI-driven method to enhance enzyme thermostability by efficiently recombining beneficial single-point mutations, using data from various mutant groups.
  • After two design rounds, the study achieved 50 combinatorial mutants with 100% success, including one exceptional mutant that significantly increased melting temperature and half-life, while also revealing complex interactions (epistasis) among mutations.
View Article and Find Full Text PDF

Purpose Of Review: Describe the connection between Deaf/hard of hearing (DHH) and diabetes, explain the bidirectional relationship of blind/low vision (BLV) and diabetes, characterize challenges DHH and BLV populations face when seeking healthcare regarding their diabetes management. Highlight the inaccessibility of diabetes technology in these populations. Provide best practices when communicating with DHH and BLV people in the clinical setting.

View Article and Find Full Text PDF

Intelligent Gesture Recognition Gloves for Real-Time Monitoring in Wireless Human-Computer Interaction.

ACS Appl Mater Interfaces

December 2024

National Engineering Lab of Special Display Technology, Special Display and Imaging Technology Innovation Center of Anhui Province, Academy of Optoelectronic Technology, Hefei University of Technology, Hefei 230009, China.

Flexible sensors mimic the sensing ability of human skin, and have unique flexibility and adaptability, allowing users to interact with intelligent systems in a more natural and intimate way. To overcome the issues of low sensitivity and limited operating range of flexible strain sensors, this study presents a highly innovative preparation method to develop a conductive elastomeric sensor with a cracked thin film by combining polydimethylsiloxane (PDMS) with multiwalled carbon nanotubes (MCNT). This novel design significantly increases both the sensitivity and operating range of the sensor (strain range 0-50%; the maximum tensile sensitivity of this sensor reaches 4.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!