Processing group delay spectrograms for study of formant and harmonic contours in speech signals.

J Acoust Soc Am

Department of Artificial Intelligence and Data Science, Koneru Lakshmaiah Education Foundation, Hyderabad 500075, India.

Published: October 2024

AI Article Synopsis

  • The paper explores formant and harmonic contours by analyzing group delay (GD) spectrograms of speech signals, which captures the nuances of speech sound.
  • It highlights that the GD spectrum can be obtained without phase wrapping, allowing for clearer visibility of formant and harmonic frequencies using modified single frequency filtering (SFF) techniques.
  • While the contours for synthetic speech closely align with ground truth, natural speech contours are less precise but still approximate the truth in voiced regions, suggesting the need for further refinement in automatic extraction of formant frequencies.

Article Abstract

This paper deals with study of formant and harmonic contours by processing the group delay (GD) spectrograms of speech signals. The GD spectrum is the negative derivative of the phase spectrum with respect to frequency. Recent study shows that the GD spectrogram can be obtained without phase wrapping. Formant frequency contours can be observed in the display of the peaks of the instantaneous wideband equivalent GD spectrogram, derived using the modified single frequency filtering (SFF) analysis of speech signals. Harmonic frequency contours can be observed in the display of the peaks of the instantaneous narrowband equivalent GD spectrogram, derived using the modified SFF analysis of speech signals. For synthetic speech signals, the observed formant contours match the ground truth formant contours from which the signal is derived. For natural speech signals, the observed formant contours match approximately with the given ground truth formant contours mostly in the voiced regions. The results are illustrated for several randomly selected utterances from the TIMIT database. While this study helps to observe the contours of formants in the display, automatic extraction of the formant frequencies needs further processing, requiring logic for eliminating the spurious points, without forcing the number of formants.

Download full-text PDF

Source
http://dx.doi.org/10.1121/10.0032364DOI Listing

Publication Analysis

Top Keywords

speech signals
24
formant contours
16
contours
9
processing group
8
group delay
8
delay spectrograms
8
formant
8
study formant
8
formant harmonic
8
harmonic contours
8

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!