Comparison of predictive measures of speech recognition after noise reduction processing.

Karolina Smeds Arne Leijon Florian Wolters Anders Hammarstedt Sara Båsjö Sofia Hertzman

J Acoust Soc Am

ORCA Europe, Widex A/S, Maria Bangata 4, SE-118 63 Stockholm, Sweden.

Published: September 2014

A number of measures were evaluated with regard to their ability to predict the speech-recognition benefit of single-channel noise reduction (NR) processing. Three NR algorithms and a reference condition were used in the evaluation. Twenty listeners with impaired hearing and ten listeners with normal hearing participated in a blinded laboratory study. An adaptive speech test was used. The speech test produces results in terms of signal-to-noise ratios that correspond to equal speech recognition performance (in this case 80% correct) with and without the NR algorithms. This facilitates a direct comparison between predicted and experimentally measured effects of noise reduction algorithms on speech recognition. The experimental results were used to evaluate nine different predictive measures, one in two variants. The best predictions were found with the Coherence Speech Intelligibility Index (CSII) [Kates and Arehart (2005), J. Acoust. Soc. Am. 117(4), 2224-2237]. In general, measures using correlation between the clean speech and the processed noisy speech, as well as other measures that are based on short-time analysis of speech and noise, seemed most promising.

Download full-text PDF	Source
http://dx.doi.org/10.1121/1.4892766	DOI Listing

Publication Analysis

Top Keywords

speech recognition

noise reduction

speech

predictive measures

reduction processing

speech test

measures

comparison predictive

measures speech

noise

Similar Publications

ClinClip: a Multimodal Language Pre-training model integrating EEG data for enhanced English medical listening assessment.

Front Neurosci

January 2025

The Basic Department, The Tourism College of Changchun University, Changchun, China.

Guangyu Sun

Introduction: In the field of medical listening assessments,accurate transcription and effective cognitive load management are critical for enhancing healthcare delivery. Traditional speech recognition systems, while successful in general applications often struggle in medical contexts where the cognitive state of the listener plays a significant role. These conventional methods typically rely on audio-only inputs and lack the ability to account for the listener's cognitive load, leading to reduced accuracy and effectiveness in complex medical environments.

View Article and Find Full Text PDF

Similar Publications

Ghadeer-speech-crowd-corpus: Speech dataset.

Data Brief

February 2025

Computer Science Department, College of Science, University of Baghdad, Iraq.

Ghadeer Qasim Ali Husam Ali Abdulmohsin

The availability of raw data is a considerable challenge across most branches of science. In the absence of data, neither experiments can be conducted nor development can be undertaken. Despite their importance, raw data are still lacking across many scientific fields.

View Article and Find Full Text PDF

Similar Publications

Biological, linguistic, and individual factors govern voice qualitya).

J Acoust Soc Am

January 2025

USC Viterbi School of Engineering, University of Southern California, Los Angeles, California 90089-1455, USA.

Jody Kreiman Yoonjeong Lee

Voice quality serves as a rich source of information about speakers, providing listeners with impressions of identity, emotional state, age, sex, reproductive fitness, and other biologically and socially salient characteristics. Understanding how this information is transmitted, accessed, and exploited requires knowledge of the psychoacoustic dimensions along which voices vary, an area that remains largely unexplored. Recent studies of English speakers have shown that two factors related to speaker size and arousal consistently emerge as the most important determinants of quality, regardless of who is speaking.

View Article and Find Full Text PDF

Similar Publications

LipBengal: Pioneering Bengali lip-reading dataset for pronunciation mapping through lip gestures.

Data Brief

February 2025

Department of Electrical, Electronic and Communication Engineering, Military Institute of Science and Technology (MIST), Dhaka 1216, Bangladesh.

Md Tanvir Rahman Sahed Md Tanjil Islam Aronno Hussain Nyeem Md Abdul Wahed Tashrif Ahsan

The dataset represents a significant advancement in Bengali lip-reading and visual speech recognition research, poised to drive future applications and technological progress. Despite Bengali's global status as the seventh most spoken language with approximately 265 million speakers, linguistically rich and widely spoken languages like Bengali have been largely overlooked by the research community. fills this gap by offering a pioneering dataset tailored for Bengali lip-reading, comprising visual data from 150 speakers across 54 classes, encompassing Bengali phonemes, alphabets, and symbols.

View Article and Find Full Text PDF

Similar Publications

Pupillometry and perceived listening effort for cochlear implant users-a comparison of three speech-in-noise tests.

Int J Audiol

January 2025

Department of Otorhinolaryngology and Head & Neck Surgery, Leiden University Medical Center, Leiden, Netherlands.

Hendrik Christiaan Stronks Paula Louisa Jansen Robin van Deurzen Jeroen Johannes Briaire Johan Hubertus Maria Frijns

Objective: Measuring listening effort using pupillometry is challenging in cochlear implant (CI) users. We assess three validated speech tests (Matrix, LIST, and DIN) to identify the optimal speech material for measuring peak-pupil-dilation (PPD) in CI users as a function of signal-to-noise ratio (SNR).

Design: Speech tests were administered in quiet and two noisy conditions, namely at the speech recognition threshold (0 dB re SRT), i.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!