Automatic Speech Recognition (ASR) is a technology that converts spoken words into text, facilitating interaction between humans and machines. One of the most common applications of ASR is Speech-To-Text (STT) technology, which simplifies user workflows by transcribing spoken words into text. In the medical field, STT has the potential to significantly reduce the workload of clinicians who rely on typists to transcribe their voice recordings. However, developing an STT model for the medical domain is challenging due to the lack of sufficient speech and text datasets. To address this issue, we propose a medical-domain text correction method that modifies the output text of a general STT system using the Vision Language Pre-training (VLP) method. VLP combines textual and visual information to correct text based on image knowledge. Our extensive experiments demonstrate that the proposed method offers quantitatively and clinically significant improvements in STT performance in the medical field. We further show that multi-modal understanding of image and text information outperforms single-modal understanding using only text information.

Download full-text PDF

Source
http://dx.doi.org/10.1109/JBHI.2023.3345897DOI Listing

Publication Analysis

Top Keywords

text
8
spoken text
8
medical field
8
stt
5
improving medical
4
medical speech-to-text
4
speech-to-text accuracy
4
accuracy vision-language
4
vision-language pre-training
4
pre-training models
4

Similar Publications

Sleep quality in lung cancer and specifically non-small-cell lung cancer: a rapid review.

Support Care Cancer

January 2025

Supportive and Palliative Care Service, Strasbourg University Hospital, Strasbourg Translational Medicine Federation (FMTS), Université de Strasbourg, Strasbourg, France.

Purpose: Sleep quality contributes to the improvement of quality of life in cancer patients. However, sleep disturbances, of variable and heterogeneous etiologies, are common and frequently overlooked in lung cancer patients. The present study undertakes a rapid review of available peer-reviewed literature on sleep quality in lung cancer patients, specifically non-small-cell lung cancer patients.

View Article and Find Full Text PDF

Study Objectives: Both the (ICSD) and the sleep-wake disorders section of the (DSM) emphasize the importance of clinical judgment in distinguishing the normal from the pathological in sleep medicine. The fourth edition of the DSM (DSM-IV, 1994) introduced the clinical significance criterion (CSC) to standardize this judgment and enhance diagnostic reliability.

Methods: This review conducts a theoretical and historical content analysis of CSC presence, frequency, and formulation in the diagnostic criteria of sleep disorders.

View Article and Find Full Text PDF

In the context of Chinese clinical texts, this paper aims to propose a deep learning algorithm based on Bidirectional Encoder Representation from Transformers (BERT) to identify privacy information and to verify the feasibility of our method for privacy protection in the Chinese clinical context. We collected and double-annotated 33,017 discharge summaries from 151 medical institutions on a municipal regional health information platform, developed a BERT-based Bidirectional Long Short-Term Memory Model (BiLSTM) and Conditional Random Field (CRF) model, and tested the performance of privacy identification on the dataset. To explore the performance of different substructures of the neural network, we created five additional baseline models and evaluated the impact of different models on performance.

View Article and Find Full Text PDF

Patient Performance of Care Tasks During Acute Hospitalisation: A Scoping Review.

J Clin Nurs

January 2025

The Cheryl Spencer Department of Nursing, Faculty of Social Welfare and Health Sciences, University of Haifa, Haifa, Israel.

Background: Patient self-care is established as improving outcomes, yet acute care in hospitals is provided such that patients tend to be passive recipients of care. Little is known about the extent and type of patient participation in treatment care tasks in acute hospital settings.

Aims: To map and synthesise available literature on self-performance of care tasks in acute hospital settings.

View Article and Find Full Text PDF

Long-acting and extended-release drug delivery strategies have greatly improved treatment for a variety of medical conditions. Special populations, specifically infants, children, young people, and pregnant and postpartum women, could greatly benefit from access to these strategies but are often excluded from clinical trials. We conducted a systematic review of all clinical studies involving the use of a long-acting intramuscular injection or implant in infants, children, young people, and pregnant and postpartum people.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!