Purpose The application of Chinese Mandarin electrolaryngeal (EL) speech for laryngectomees has been limited by its drawbacks such as single fundamental frequency, mechanical sound, and large radiation noise. To improve the intelligibility of Chinese Mandarin EL speech, a new perspective using the automatic speech recognition (ASR) system was proposed, which can convert EL speech into healthy speech, if combined with text-to-speech. Method An ASR system was designed to recognize EL speech based on a deep learning model WaveNet and the connectionist temporal classification (WaveNet-CTC). This system mainly consists of 3 parts: the acoustic model, the language model, and the decoding model. The acoustic features are extracted during speech preprocessing, and 3,230 utterances of EL speech mixed with 10,000 utterances of healthy speech are used to train the ASR system. Comparative experiment was designed to evaluate the performance of the proposed method. Results The results show that the proposed ASR system has higher stability and generalizability compared with the traditional methods, manifesting superiority in terms of Chinese characters, Chinese words, short sentences, and long sentences. Phoneme confusion occurs more easily in the stop and affricate of EL speech than the healthy speech. However, the highest accuracy of the ASR could reach 83.24% when 3,230 utterances of EL speech were used to train the ASR system. Conclusions This study indicates that EL speech could be recognized effectively by the ASR based on WaveNet-CTC. This proposed method has a higher generalization performance and better stability than the traditional methods. A higher accuracy of the ASR system based on WaveNet-CTC can be obtained, which means that EL speech can be converted into healthy speech. Supplemental Material https://doi.org/10.23641/asha.8250830.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1044/2019_JSLHR-S-18-0313 | DOI Listing |
Waste Manag
December 2024
Department of Civil Engineering, University of Birmingham, Birmingham B15 2TT, United Kingdom. Electronic address:
Recycling waste glass (WG) can be time-consuming, costly, and impractical. However, its incorporation into concrete significantly reduces environmental impact and carbon emissions. This paper introduces machine learning (ML) to civil engineering to optimise WG utilisation in concrete, supporting sustainability objectives.
View Article and Find Full Text PDFSurv Geophys
May 2024
NOAA/Pacific Marine Environmental Laboratory, Seattle, WA 98115 USA.
Satellite observations from the Clouds and the Earth's Radiant Energy System show that Earth's energy imbalance has doubled from 0.5 ± 0.2 Wm during the first 10 years of this century to 1.
View Article and Find Full Text PDFBMJ Open
December 2024
Department of Pediatric Surgery, The Affiliated Hospital of Guizhou Medical University, Guiyang, Guizhou, China
Objectives: We aim to delineate the digestive congenital abnormalities burden in children under 14 years old between 1990 and 2021.
Design: We implemented data from the Global Burden of Disease (GBD) 2021 database to evaluate digestive congenital abnormalities burden with different measures in 204 countries and territories from 1990 to 2021. We present precise estimations with 95% uncertainty intervals.
BMC Emerg Med
December 2024
Health Management and Economics Research Center, Health Management Research Institute, Iran University of Medical Sciences, Tehran, Iran.
Background: The ability of hospitals to provide special care services for critically ill or injured patients during emergencies and disasters poses very significant challenges that necessitate the response in-place plans. the increasing frequency of such events, coupled with limited hospital resources and increasing patient volumes, underscores the urgency of addressing these issues. Specifically, this study aims to identify the challenges faced by hospitals in providing special services during disasters.
View Article and Find Full Text PDFJAMIA Open
December 2024
Center for Home Care Policy & Research, VNS Health, New York, NY 10017, United States.
Objectives: As artificial intelligence evolves, integrating speech processing into home healthcare (HHC) workflows is increasingly feasible. Audio-recorded communications enhance risk identification models, with automatic speech recognition (ASR) systems as a key component. This study evaluates the transcription accuracy and equity of 4 ASR systems-Amazon Web Services (AWS) General, AWS Medical, Whisper, and Wave2Vec-in transcribing patient-nurse communication in US HHC, focusing on their ability in accurate transcription of speech from Black and White English-speaking patients.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!