Introduction: In the field of medical listening assessments,accurate transcription and effective cognitive load management are critical for enhancing healthcare delivery. Traditional speech recognition systems, while successful in general applications often struggle in medical contexts where the cognitive state of the listener plays a significant role. These conventional methods typically rely on audio-only inputs and lack the ability to account for the listener's cognitive load, leading to reduced accuracy and effectiveness in complex medical environments.
Methods: To address these limitations, this study introduces ClinClip, a novel multimodal model that integrates EEG signals with audio data through a transformer-based architecture. ClinClip is designed to dynamically adjust to the cognitive state of the listener, thereby improving transcription accuracy and robustness in medical settings. The model leverages cognitive-enhanced strategies, including EEG-based modulation and hierarchical fusion of multimodal data, to overcome the challenges faced by traditional methods.
Results And Discussion: Experiments conducted on four datasets-EEGEyeNet, DEAP, PhyAAt, and eSports Sensors-demonstrate that ClinClip significantly outperforms six state-of-the-art models in both Word Error Rate (WER) and Cognitive Modulation Efficiency (CME). These results underscore the model's effectiveness in handling complex medical audio scenarios and highlight its potential to improve the accuracy of medical listening assessments. By addressing the cognitive aspects of the listening process. ClinClip contributes to more reliable and effective healthcare delivery, offering a substantial advancement over traditional speech recognition approaches.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11755411 | PMC |
http://dx.doi.org/10.3389/fnins.2024.1493163 | DOI Listing |
Trends Hear
January 2025
Bionics Institute, East Melbourne, VIC, Australia.
This study used functional near-infrared spectroscopy (fNIRS) to measure aspects of the speech discrimination ability of sleeping infants. We examined the morphology of the fNIRS response to three different speech contrasts, namely "Tea/Ba," "Bee/Ba," and "Ga/Ba." Sixteen infants aged between 3 and 13 months old were included in this study and their fNIRS data were recorded during natural sleep.
View Article and Find Full Text PDFFront Neurosci
January 2025
The Basic Department, The Tourism College of Changchun University, Changchun, China.
Introduction: In the field of medical listening assessments,accurate transcription and effective cognitive load management are critical for enhancing healthcare delivery. Traditional speech recognition systems, while successful in general applications often struggle in medical contexts where the cognitive state of the listener plays a significant role. These conventional methods typically rely on audio-only inputs and lack the ability to account for the listener's cognitive load, leading to reduced accuracy and effectiveness in complex medical environments.
View Article and Find Full Text PDFInt J Psychiatry Med
January 2025
Department of Public Health, Official University of Ruwenzori, Goma, North-Kivu Democratic Republic of the Congo.
Objective: Although religious leaders play an important role in providing informal mental health care to individuals struggling to seek religious and spiritual care, existing studies have not explored the magnitude of psychiatric symptoms and motivators to seek mental health services from religious leaders in religious listening centers and mental hospitals. This paper presents preliminary data from a survey aimed at assessing psychiatric symptoms and factors associated with access to spiritual services among 151 individuals at a religious listening center, as well as to determine the pathways of care among 150 patients attending a mental health clinic in conflict zones of the Eastern Democratic Republic of the Congo.
Method: Three hundred and one participants were screened for psychiatric symptoms and factors motivating access to religious leaders using a semi-structured questionnaire.
Ear Hear
January 2025
Department of Otorhinolaryngology and Head and Neck Surgery, Erasmus Medical Center, University Medical Center, Rotterdam, The Netherlands.
Objectives: The acoustic change complex (ACC) is a cortical auditory evoked potential that shows promise as an objective test of the neural capacity for speech and sound discrimination, particularly for difficult-to-test populations, for example, cognitively impaired adults. There is uncertainty, however, surrounding the performance of the ACC with behavioral measures. The objective of this study was to systematically review the literature, focusing on adult studies, to investigate the relationship between ACC responses and behavioral psychophysical measures.
View Article and Find Full Text PDFJ Acad Nutr Diet
January 2025
Department of Nutritional Sciences, University of Michigan School of Public Health. Electronic address:
Background: Parents are important conduits of weight- and health-related messaging. Weight-related communication and approaches to child feeding used by parents may reflect their past experiences with weight stigma and are understudied pathways through which intergenerational weight stigma may be transmitted.
Objective: To examine how experienced and internalized weight stigma among parents of children with higher weights are associated with weight-related communication and the feeding practices they use.
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!