A psychoacoustic method to find the perceptual cues of stop consonants in natural speech.

J Acoust Soc Am

Department of Electrical and Computer Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA.

Published: April 2010

AI Article Synopsis

Article Abstract

Synthetic speech has been widely used in the study of speech cues. A serious disadvantage of this method is that it requires prior knowledge about the cues to be identified in order to synthesize the speech. Incomplete or inaccurate hypotheses about the cues often lead to speech sounds of low quality. In this research a psychoacoustic method, named three-dimensional deep search (3DDS), is developed to explore the perceptual cues of stop consonants from naturally produced speech. For a given sound, it measures the contribution of each subcomponent to perception by time truncating, highpass/lowpass filtering, or masking the speech with white noise. The AI-gram, a visualization tool that simulates the auditory peripheral processing, is used to predict the audible components of the speech sound. The results are generally in agreement with the classical studies that stops are characterized by a short duration burst followed by a F2 transition, suggesting the effectiveness of the 3DDS method. However, it is also shown that /ba/ and /pa/ may have a wide band click as the dominant cue. F2 transition is not necessary for the perception of /ta/ and /ka/. Moreover, many stop consonants contain conflicting cues that are characteristic of competing sounds. The robustness of a consonant sound to noise is determined by the intensity of the dominant cue.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2865708PMC
http://dx.doi.org/10.1121/1.3295689DOI Listing

Publication Analysis

Top Keywords

psychoacoustic method
8
perceptual cues
8
cues consonants
8
speech
8
speech sound
8
dominant cue
8
cues
6
method find
4
find perceptual
4
consonants natural
4

Similar Publications

Purpose:  The purpose of this study is to examine whether the gap detection thresholds (GDT) obtained are similar between an adaptive and nonadaptive procedure in children and adults.

Study Design:  Standard group comparison.

Study Sample:  Eighteen typically developing children and 20 young adults with hearing thresholds of 25 dB HL or lower participated in this study.

View Article and Find Full Text PDF

Perceptually enhanced spectral distance metric for head-related transfer function quality prediction.

J Acoust Soc Am

December 2024

Key Laboratory of Speech Acoustics and Content Understanding, Institute of Acoustics, Chinese Academy of Sciences, Beijing 100190, China.

Given the substantial time and complexity involved in the perceptual evaluation of head-related transfer function (HRTF) processing, there is considerable value in adopting numerical assessment. Although many numerical methods have been introduced in recent years, monaural spectral distance metrics such as log-spectral distortion (LSD) remain widely used despite their significant limitations. In this study, listening tests were conducted to investigate the correlation between LSD and the auditory perception of HRTFs.

View Article and Find Full Text PDF

Background: Difficulties with speech-in-noise perception in autism spectrum disorders (ASD) may be associated with impaired analysis of speech sounds, such as vowels, which represent the fundamental phoneme constituents of human speech. Vowels elicit early (< 100 ms) sustained processing negativity (SPN) in the auditory cortex that reflects the detection of an acoustic pattern based on the presence of formant structure and/or periodic envelope information (f0) and its transformation into an auditory "object".

Methods: We used magnetoencephalography (MEG) and individual brain models to investigate whether SPN is altered in children with ASD and whether this deficit is associated with impairment in their ability to perceive speech in the background of noise.

View Article and Find Full Text PDF

Data on neurophysiological and psychological responses to audio immersive experience in stereo and 3D audio formats.

BMC Res Notes

December 2024

Tecnologico de Monterrey, Escuela de Ingenieria y Ciencias, Ave. Eugenio Garza Sada 2501, 64849, Monterrey, N.L, México.

Objectives: This dataset presents demographic, psychological, auditory and neurophysiological information of 31 volunteers, who participated in an experiment measuring the auditory immersive experience in two audio formats: stereophonic downmix and three-dimensional audio. This dataset could help understand the objectiveness (based on the nervous system response) behind the subjectiveness of immersion brought about by the audio format (based on the listener evaluation). The final objective of this dataset is to study the psychological and neurophysiological responses of immersive attributes in auditory events in future studies.

View Article and Find Full Text PDF

Effect of Simultaneous Use of Neuromodulation and Acoustic Stimulation in the Management of Tinnitus.

Indian J Otolaryngol Head Neck Surg

December 2024

Department of Audiology, School of Rehabilitation Sciences, Hamadan University of Medical Sciences, Fahmideh Street, Pazhoohesh Square, Hamadan, 6517838736 Iran.

Tinnitus is a relatively common disorder with a heterogeneous nature. Combining methods in its treatment may offergreater effectiveness. We aim to explore the impact of concurrently applying tRNS neuromodulation and acousticstimulation for tinnitus control.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!