A model of acoustic interspeaker variability based on the concept of formant-cavity affiliation.

Lian Apostol Pascal Perrier Gérard Bailly

J Acoust Soc Am

Institut de la Communication Parlée, UMR CNRS 5009, INPG, Grenoble, France.

Published: January 2004

A method is proposed to model the interspeaker variability of formant patterns for oral vowels. It is assumed that this variability originates in the differences existing among speakers in the respective lengths of their front and back vocal-tract cavities. In order to characterize, from the spectral description of the acoustic speech signal, these vocal-tract differences between speakers, each formant is interpreted, according to the concept of formant-cavity affiliation, as a resonance of a specific vocal-tract cavity. Its frequency can thus be directly related to the corresponding cavity length, and a transformation model can be proposed from a speaker A to a speaker B on the basis of the frequency ratios of the formants corresponding to the same resonances. In order to minimize the number of sounds to be recorded for each speaker in order to carry out this speaker transformation, the frequency ratios are exactly computed only for the three extreme cardinal vowels [i, a, u] and they are approximated for the remaining vowels through an interpolation function. The method is evaluated through its capacity to transform the (F1,F2) formant patterns of eight oral vowels pronounced by five male speakers into the (F1,F2) patterns of the corresponding vowels generated by an articulatory model of the vocal tract. The resulting formant patterns are compared to those provided by normalization techniques published in the literature. The proposed method is found to be efficient, but a number of limitations are also observed and discussed. These limitations can be associated with the formant-cavity affiliation model itself or with a possible influence of speaker-specific vocal-tract geometry in the cross-sectional direction, which the model might not have taken into account.

Download full-text PDF	Source
http://dx.doi.org/10.1121/1.1631946	DOI Listing

Publication Analysis

Top Keywords

formant-cavity affiliation

formant patterns

interspeaker variability

concept formant-cavity

patterns oral

oral vowels

frequency ratios

model

vowels

model acoustic

Similar Publications

A magnetic resonance imaging-based articulatory and acoustic study of "retroflex" and "bunched" American English /r/.

J Acoust Soc Am

June 2008

Speech Communication Laboratory, Institute of Systems Research, and Department of Electrical and Computer Engineering, University of Maryland, College Park, Maryland 20742, USA.

Xinhui Zhou Carol Y Espy-Wilson Suzanne Boyce Mark Tiede Christy Holland

Speakers of rhotic dialects of North American English show a range of different tongue configurations for /r/. These variants produce acoustic profiles that are indistinguishable for the first three formants [Delattre, P., and Freeman, D.

View Article and Find Full Text PDF

Similar Publications

A model of acoustic interspeaker variability based on the concept of formant-cavity affiliation.

J Acoust Soc Am

January 2004

Institut de la Communication Parlée, UMR CNRS 5009, INPG, Grenoble, France.

Lian Apostol Pascal Perrier Gérard Bailly

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!