This study examined the redundancy of spectral and temporal information in everyday sentences, which were reduced to 16 rectangular spectral bands having center frequencies ranging from 250 to 8000 Hz, spaced at 1/3 octave intervals. High-order filtering eliminated contributions from transition bands, and the widths of the resulting effectively rectangular speech bands were varied from 4% down to 0.5%. Intelligibility of these sub-critical bandwidth stimuli ranged from nearly perfect in the 4% bandwidth conditions, down to nearly zero in the 0.5% bandwidth conditions. However, a large intelligibility increase was obtained under the narrower filtering conditions when the speech bands were used to vocode broader noise bands that approximated critical bandwidths (ERBn) at the 16 center frequencies. For example, the 0.5%-and 1%-bandwidth speech stimuli were only about 1% and 20% intelligible, respectively, whereas scores of about 26% and 60%, respectively, were obtained for the ERBn-wide noise bands modulated by the speech bands. These large intelligibility increases occurred despite elimination of spectral fine structure and the addition of stochastic fluctuations to the speech-envelope cues. Results from additional experiments indicate that optimal temporal processing requires that envelope cues stimulate a majority of the fibers comprising an ERBn.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2897725PMC
http://dx.doi.org/10.1121/1.3460364DOI Listing

Publication Analysis

Top Keywords

speech bands
12
intelligibility sub-critical
8
center frequencies
8
bandwidth conditions
8
large intelligibility
8
noise bands
8
bands
7
speech
5
noise vocoding
4
vocoding improve
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!