Two vocal qualities, twang and yawn, were synthesized and rated perceptually. The stimuli consisted of synthesized vocal productions of a sentence-length utterance 'ya ya ya ya ya,' which had speech-like intonation. In a continuum transformation from normal to twang, the area in the pharynx was gradually decreased, along with vocal tract shortening and a decreased open quotient in the glottal airflow. In a continuum transformation toward yawn, the area in the pharynx was gradually increased, along with vocal tract lengthening and an increased open quotient. The normal (untransformed) vocal tract area was pre-determined by earlier studies involving MRI scans of a human subject's vocal tract. Listeners were asked to rate (on a scale from 1-10) the 'amount of twang' in one listening session and the 'amount of yawn' in another listening session. Overall, the perception of twang increased directly with pharyngeal area narrowing, vocal tract shortening, and decreased open quotient. The perception of yawn increased with pharyngeal area widening, vocal tract lengthening, and increased open quotient. Adjustments of one parameter alone yielded less significant perceptual changes than the above combinations, with open quotient showing the greatest effect in isolation. Listeners demonstrated variable perceptions in both continua with poor inter-subject, intra-subject, and inter-group reliability.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1080/14015430310018874 | DOI Listing |
Indian J Otolaryngol Head Neck Surg
February 2025
Department of ENT, Mahatma Gandhi Medical College and Research Institute, Sri Balaji Vidyapeeth University, Pillaiyarkuppam, Pondicherry, 607402 India.
Laryngopharyngeal reflux disease (LPRD) is characterized by the backflow of gastric contents into the laryngopharynx, distinct from gastroesophageal reflux disease (GERD). Prevalence among otolaryngology patients ranges from 4 to 30% and being the major cause for hoarseness of voice. Common symptoms include hoarseness, chronic coughing, globus sensation, throat clearing and endoscopic evaluation reveals signs like posterior commissure hypertrophy and vocal fold edema.
View Article and Find Full Text PDFJ Voice
March 2025
Roy J. Carver Department of Biomedical Engineering, University of Iowa, Iowa City, Iowa; Department of Radiology, University of Iowa, Iowa City, Iowa. Electronic address:
Objectives: Accurate segmentation of the vocal tract from MRI data is essential for various voice, speech, and singing applications. Manual segmentation is time-intensive and susceptible to errors. This study aimed to evaluate the efficacy of deep learning algorithms for automatic vocal tract segmentation from 3D MRI.
View Article and Find Full Text PDFAnnu Int Conf IEEE Eng Med Biol Soc
July 2024
Speech impairment resulting from laryngectomy causes severe physiological and psychological distress to laryngectomee. In clinical practice, the upper vocal tract articulatory organs function normally in most laryngectomee. The potential to reconstruct speech by leveraging articulatory information is of significant importance, offering a meaningful contribution to the effective rehabilitation of speech in these patients.
View Article and Find Full Text PDFAnnu Int Conf IEEE Eng Med Biol Soc
July 2024
This study focuses on how different modalities of human communication can be used to distinguish between healthy controls and subjects with schizophrenia who exhibit strong positive symptoms. We developed a multi-modal schizophrenia classification system using audio, video, and text. Facial action units and vocal tract variables were extracted as low-level features from video and audio respectively, which were then used to compute high-level coordination features that served as the inputs from the audio and video modalities.
View Article and Find Full Text PDFNat Hum Behav
March 2025
Department of Neurological Surgery, University of California, San Francisco, CA, USA.
Voluntary, flexible stopping of speech output is an essential aspect of speech motor control, especially during natural conversations. The cognitive and neural mechanisms of speech inhibition are not well understood. Here we have recorded direct high-density cortical activity while participants engaged in continuous speech production and were visually cued to stop speaking.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!