On normalized MSE analysis of speech fundamental frequency in the cochlear implant-like spectrally reduced speech.

Cong-Thanh Do Dominique Pastor André Goalic

IEEE Trans Biomed Eng

Institut TELECOM, TELECOM Bretagne, Unit Mixte de Recherche Centre National de la Recherche Scientifique 3192 Laboratoire des Sciences et Techniques de l'Information, de la Communication et de la Connaissance, Brest, France.

Published: March 2010

In this paper, we present a quantitative study on the speech fundamental frequency (F0) of the cochlear implant-like spectrally reduced speech (SRS). The SRS was synthesized from the subband amplitude and frequency modulations (AM and FM) of original clean speech utterances selected from the TI-digits database. The SRS synthesis algorithm was derived from the frequency amplitude modulation encoding (FAME) strategy, proposed by Nie et al., 2005. The normalized mses (NMSEs), calculated between the F0 of the original clean speech and that of the SRSs, were analyzed. The NMSEs analysis of F0 revealed the greater F0 distortion in the AM-based SRS, which is the acoustic simulation of present-day cochlear implants, compared to the FAME-based SRS. This evidence supports the fact that current cochlear implant users have difficulty in the speaker recognition task as reported by Zeng et al., 2005. Further, the analysis results showed that it is better to keep the rapidly varying FM components to reduce the F0 distortion in the FAME-based SRS at low spectral resolution.

Download full-text PDF	Source
http://dx.doi.org/10.1109/TBME.2009.2031097	DOI Listing

Publication Analysis

Top Keywords

speech fundamental

fundamental frequency

frequency cochlear

cochlear implant-like

implant-like spectrally

spectrally reduced

reduced speech

original clean

clean speech

fame-based srs

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!