With ongoing technical refinements speech recognition systems (SRS) are becoming an increasingly attractive alternative to traditional methods of preparing and transcribing medical reports. The two main components of any SRS are the acoustic model and the language model. Features of modern SRS with continuous speech recognition are macros with individually definable texts and report templates as well as the option to navigate in a text or to control SRS or RIS functions by speech recognition. The best benefit from SRS can be obtained if it is integrated into a RIS/RIS-PACS installation. Report availability and time efficiency of the reporting process (related to recognition rate, time expenditure for editing and correcting a report) are the principal determinants of the clinical performance of any SRS. For practical purposes the recognition rate is estimated by the error rate (unit "word"). Error rates range from 4 to 28%. Roughly 20% of them are errors in the vocabulary which may result in clinically relevant misinterpretation. It is thus mandatory to thoroughly correct any transcribed text as well as to continuously train and adapt the SRS vocabulary. The implementation of SRS dramatically improves report availability. This is most pronounced for CT and CR. However, the individual time expenditure for (SRS-based) reporting increased by 20-25% (CR) and according to literature data there is an increase by 30% for CT and MRI. The extent to which the transcription staff profits from SRS depends largely on its qualification. Online dictation implies a workload shift from the transcription staff to the reporting radiologist.

Download full-text PDF

Source
http://dx.doi.org/10.1007/s00117-005-1253-7DOI Listing

Publication Analysis

Top Keywords

speech recognition
12
srs
9
report availability
8
recognition rate
8
time expenditure
8
transcription staff
8
report
5
recognition
5
[speech recognition
4
recognition impact
4

Similar Publications

VoxRad: Building an open-source locally-hosted radiology reporting system.

Clin Imaging

January 2025

LN Medical College & JK Hospital, Kolar Road, Bhopal, Madhya Pradesh, India. Electronic address:

VoxRad is an open-source application designed to enhance radiology reporting by leveraging generative AI. Utilizing locally hosted Automatic Speech Recognition (ASR) and Large Language Models (LLM), VoxRad enables continuous dictation, transcribing reports into standardized formats with high accuracy, efficiency, and data security. The modular design allows flexible integration of user-selected ASR and LLM models via OpenAI-compatible APIs, ensuring HIPAA compliance with secure local storage of data.

View Article and Find Full Text PDF

Hearing in categories and speech perception at the "cocktail party".

PLoS One

January 2025

School of Communication Sciences & Disorders, University of Memphis, Memphis, Tennessee, United States of America.

We aimed to test whether hearing speech in phonetic categories (as opposed to a continuous/gradient fashion) affords benefits to "cocktail party" speech perception. We measured speech perception performance (recognition, localization, and source monitoring) in a simulated 3D cocktail party environment. We manipulated task difficulty by varying the number of additional maskers presented at other spatial locations in the horizontal soundfield (1-4 talkers) and via forward vs.

View Article and Find Full Text PDF

Analog In-memory Computing (IMC) has demonstrated energy-efficient and low latency implementation of convolution and fully-connected layers in deep neural networks (DNN) by using physics for computing in parallel resistive memory arrays. However, recurrent neural networks (RNN) that are widely used for speech-recognition and natural language processing have tasted limited success with this approach. This can be attributed to the significant time and energy penalties incurred in implementing nonlinear activation functions that are abundant in such models.

View Article and Find Full Text PDF

Objectives: Hearing impairment during childhood is a widespread health issue. Prompt recognition and timely intervention are vital for the advancement of language skills. Insufficient parental knowledge can lead to a delay in diagnosing and treating a condition, which can have a negative impact on academic performance.

View Article and Find Full Text PDF

 Minimally invasive Ponto surgery (MIPS) enables the installation of percutaneous bone-anchored hearing implants (BAHIs) with a drill guide through a hole punch incision. Despite being well established for adults, there is a lack of studies in the literature regarding its use in pediatric patients.  The aim of the present study was to investigate the hearing performance and soft-tissue outcomes of the use of MIPS under local anesthesia in children with unilateral craniofacial malformation (UCM).

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!