Synthetic speech has been widely used in the study of speech cues. A serious disadvantage of this method is that it requires prior knowledge about the cues to be identified in order to synthesize the speech. Incomplete or inaccurate hypotheses about the cues often lead to speech sounds of low quality. In this research a psychoacoustic method, named three-dimensional deep search (3DDS), is developed to explore the perceptual cues of stop consonants from naturally produced speech. For a given sound, it measures the contribution of each subcomponent to perception by time truncating, highpass/lowpass filtering, or masking the speech with white noise. The AI-gram, a visualization tool that simulates the auditory peripheral processing, is used to predict the audible components of the speech sound. The results are generally in agreement with the classical studies that stops are characterized by a short duration burst followed by a F2 transition, suggesting the effectiveness of the 3DDS method. However, it is also shown that /ba/ and /pa/ may have a wide band click as the dominant cue. F2 transition is not necessary for the perception of /ta/ and /ka/. Moreover, many stop consonants contain conflicting cues that are characteristic of competing sounds. The robustness of a consonant sound to noise is determined by the intensity of the dominant cue.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2865708 | PMC |
http://dx.doi.org/10.1121/1.3295689 | DOI Listing |
J Am Acad Audiol
July 2024
USM Audiology Clinic, School of Speech and Hearing Sciences, The University of Southern Mississippi, Hattiesburg, Mississippi.
Purpose: The purpose of this study is to examine whether the gap detection thresholds (GDT) obtained are similar between an adaptive and nonadaptive procedure in children and adults.
Study Design: Standard group comparison.
Study Sample: Eighteen typically developing children and 20 young adults with hearing thresholds of 25 dB HL or lower participated in this study.
J Acoust Soc Am
December 2024
Key Laboratory of Speech Acoustics and Content Understanding, Institute of Acoustics, Chinese Academy of Sciences, Beijing 100190, China.
Given the substantial time and complexity involved in the perceptual evaluation of head-related transfer function (HRTF) processing, there is considerable value in adopting numerical assessment. Although many numerical methods have been introduced in recent years, monaural spectral distance metrics such as log-spectral distortion (LSD) remain widely used despite their significant limitations. In this study, listening tests were conducted to investigate the correlation between LSD and the auditory perception of HRTFs.
View Article and Find Full Text PDFJ Neurodev Disord
December 2024
Center for Neurocognitive Research (MEG Center), Moscow State University of Psychology and Education, Moscow, Russian Federation.
Background: Difficulties with speech-in-noise perception in autism spectrum disorders (ASD) may be associated with impaired analysis of speech sounds, such as vowels, which represent the fundamental phoneme constituents of human speech. Vowels elicit early (< 100 ms) sustained processing negativity (SPN) in the auditory cortex that reflects the detection of an acoustic pattern based on the presence of formant structure and/or periodic envelope information (f0) and its transformation into an auditory "object".
Methods: We used magnetoencephalography (MEG) and individual brain models to investigate whether SPN is altered in children with ASD and whether this deficit is associated with impairment in their ability to perceive speech in the background of noise.
BMC Res Notes
December 2024
Tecnologico de Monterrey, Escuela de Ingenieria y Ciencias, Ave. Eugenio Garza Sada 2501, 64849, Monterrey, N.L, México.
Objectives: This dataset presents demographic, psychological, auditory and neurophysiological information of 31 volunteers, who participated in an experiment measuring the auditory immersive experience in two audio formats: stereophonic downmix and three-dimensional audio. This dataset could help understand the objectiveness (based on the nervous system response) behind the subjectiveness of immersion brought about by the audio format (based on the listener evaluation). The final objective of this dataset is to study the psychological and neurophysiological responses of immersive attributes in auditory events in future studies.
View Article and Find Full Text PDFIndian J Otolaryngol Head Neck Surg
December 2024
Department of Audiology, School of Rehabilitation Sciences, Hamadan University of Medical Sciences, Fahmideh Street, Pazhoohesh Square, Hamadan, 6517838736 Iran.
Tinnitus is a relatively common disorder with a heterogeneous nature. Combining methods in its treatment may offergreater effectiveness. We aim to explore the impact of concurrently applying tRNS neuromodulation and acousticstimulation for tinnitus control.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!