Decoding the dancing of the tongue: A model-based learning approach to phonetic targets in coarticulationa).

Jianguo Wei Guochen Bai Wenhuan Lu Jianwu Dang

J Acoust Soc Am

Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518000, China.

Published: October 2024

A model synthesizing average frequency components from select sentences in an electromagnetic articulography database has been crafted. This revealed the dual roles of the tongue: its dorsum acts like a carrier wave, and the tip acts as a modulation signal within the articulatory realm. This model illuminates anticipatory coarticulation's subtleties during speech planning. It undergoes rigorous, two-stage optimization: statistical estimation and refinement to depict carryover and anticipation. The model's base, rooted in physiological insights, deciphers carryover targets while its upper layer captures anticipation. Optimization has pinpointed unique phonetic targets for each phoneme, providing deep insights into virtual target formation during speech planning. These simulations, aligning closely with empirical data and marked by a mere 0.18 cm average error, along with extensive listening tests attest to the model's accuracy and enhanced speech synthesis quality.

Download full-text PDF	Source
http://dx.doi.org/10.1121/10.0032362	DOI Listing

Publication Analysis

Top Keywords

phonetic targets

speech planning

decoding dancing

dancing tongue

tongue model-based

model-based learning

learning approach

approach phonetic

targets coarticulationa

coarticulationa model

Similar Publications

Impact of High- and Low-Pass Acoustic Filtering on Audiovisual Speech Redundancy and Benefit in Children.

Ear Hear

December 2024

Center for Hearing Research, Boys Town National Research Hospital, Omaha, Nebraska, USA.

Kaylah Lalonde Grace Dwyer Adam Bosen Abby Pitts

Objectives: To investigate the influence of frequency-specific audibility on audiovisual benefit in children, this study examined the impact of high- and low-pass acoustic filtering on auditory-only and audiovisual word and sentence recognition in children with typical hearing. Previous studies show that visual speech provides greater access to consonant place of articulation than other consonant features and that low-pass filtering has a strong impact on perception on acoustic consonant place of articulation. This suggests visual speech may be particularly useful when acoustic speech is low-pass filtered because it provides complementary information about consonant place of articulation.

View Article and Find Full Text PDF

Similar Publications

Articulatory correlates of consonantal length contrasts: The case of Japanese mimetic geminates.

JASA Express Lett

January 2025

Department of Linguistics, Yale University, New Haven, Connecticut 06520,

Francesco Burroni Shigeto Kawahara Jason A Shaw

This study investigates the articulatory correlates of consonantal length contrasts in Japanese mimetic words using electromagnetic articulography data. Regression and dynamic time warping analyses applied to intragestural timing, kinematic properties, and intergestural timing reveal that Japanese geminates are characterized by longer closure phases, longer gestural plateaus, higher tongue tip positions, larger movements, and lower stiffness. Geminates also exhibit distinct timing relationships with adjacent vowels, specifically, longer times to target that allow for longer preceding vowels.

View Article and Find Full Text PDF

Similar Publications

The impact of typological similarities and differences between German and Italian on the acquisition of language-specific phonetic cues in bilingual children: insights from the T-complex.

Front Hum Neurosci

December 2024

Ph.D. Program in Speech-Language-Hearing Sciences, The Graduate Center, The City University of New York Graduate Center, New York, NY, United States.

Theresa Bloder Yasuaki Shinohara Tanja Rinker Valerie L Shafer

Introduction: Lateral temporal neural measures (Na and T-complex Ta and Tb) of the auditory evoked potential (AEP) index auditory/speech processing and have been observed in children and adults. While Na is already present in children under 4 years of age, Ta emerges from 4 years of age, and Tb appears even later. The T-complex has been found to be sensitive to language experience in Spanish-English and Turkish-German children and adults.

View Article and Find Full Text PDF

Similar Publications

Politeness and Prosody: The Effect of Power, Distance, and Imposition on Pitch Contours in Spanish.

Lang Speech

January 2025

School of Languages and Cultures, Purdue University, USA.

Bruno Staszkiewicz

Research in the last few decades has examined the intersection between phonetics and politeness in multiple languages. While most of the studies have analyzed the role of politeness on suprasegmental features (i.e.

View Article and Find Full Text PDF

Similar Publications

The Phonological Mapping Negativity (PMN) as a language-specific component: Exploring responses to linguistic vs musical mismatch.

PLoS One

December 2024

Department of Linguistics, The University of Kansas, Lawrence, Kansas, United States of America.

Jen Lewendon James Britton Stephen Politzer-Ahles

Article Synopsis

The Phonological Mismatch Negativity (PMN) is a brain response indicating how the brain processes phonological (speech sound) information, particularly when there's a violation of expected phonemes.
In a study, participants listened to three-syllable words and three-note tunes, focusing either on the language or music, and were tested for their reactions when the first sounds mismatched what they expected.
Results showed the PMN only occurred with phoneme mismatches and not with musical mismatches, suggesting it might be specifically sensitive to language, but further investigation is needed to clarify its relationship with other brain responses like the N400.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!