A model synthesizing average frequency components from select sentences in an electromagnetic articulography database has been crafted. This revealed the dual roles of the tongue: its dorsum acts like a carrier wave, and the tip acts as a modulation signal within the articulatory realm. This model illuminates anticipatory coarticulation's subtleties during speech planning. It undergoes rigorous, two-stage optimization: statistical estimation and refinement to depict carryover and anticipation. The model's base, rooted in physiological insights, deciphers carryover targets while its upper layer captures anticipation. Optimization has pinpointed unique phonetic targets for each phoneme, providing deep insights into virtual target formation during speech planning. These simulations, aligning closely with empirical data and marked by a mere 0.18 cm average error, along with extensive listening tests attest to the model's accuracy and enhanced speech synthesis quality.

Download full-text PDF

Source
http://dx.doi.org/10.1121/10.0032362DOI Listing

Publication Analysis

Top Keywords

phonetic targets
8
speech planning
8
decoding dancing
4
dancing tongue
4
tongue model-based
4
model-based learning
4
learning approach
4
approach phonetic
4
targets coarticulationa
4
coarticulationa model
4

Similar Publications

Objectives: To investigate the influence of frequency-specific audibility on audiovisual benefit in children, this study examined the impact of high- and low-pass acoustic filtering on auditory-only and audiovisual word and sentence recognition in children with typical hearing. Previous studies show that visual speech provides greater access to consonant place of articulation than other consonant features and that low-pass filtering has a strong impact on perception on acoustic consonant place of articulation. This suggests visual speech may be particularly useful when acoustic speech is low-pass filtered because it provides complementary information about consonant place of articulation.

View Article and Find Full Text PDF

This study investigates the articulatory correlates of consonantal length contrasts in Japanese mimetic words using electromagnetic articulography data. Regression and dynamic time warping analyses applied to intragestural timing, kinematic properties, and intergestural timing reveal that Japanese geminates are characterized by longer closure phases, longer gestural plateaus, higher tongue tip positions, larger movements, and lower stiffness. Geminates also exhibit distinct timing relationships with adjacent vowels, specifically, longer times to target that allow for longer preceding vowels.

View Article and Find Full Text PDF

Introduction: Lateral temporal neural measures (Na and T-complex Ta and Tb) of the auditory evoked potential (AEP) index auditory/speech processing and have been observed in children and adults. While Na is already present in children under 4 years of age, Ta emerges from 4 years of age, and Tb appears even later. The T-complex has been found to be sensitive to language experience in Spanish-English and Turkish-German children and adults.

View Article and Find Full Text PDF

Research in the last few decades has examined the intersection between phonetics and politeness in multiple languages. While most of the studies have analyzed the role of politeness on suprasegmental features (i.e.

View Article and Find Full Text PDF
Article Synopsis
  • The Phonological Mismatch Negativity (PMN) is a brain response indicating how the brain processes phonological (speech sound) information, particularly when there's a violation of expected phonemes.
  • In a study, participants listened to three-syllable words and three-note tunes, focusing either on the language or music, and were tested for their reactions when the first sounds mismatched what they expected.
  • Results showed the PMN only occurred with phoneme mismatches and not with musical mismatches, suggesting it might be specifically sensitive to language, but further investigation is needed to clarify its relationship with other brain responses like the N400.
View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!