Predicting fundamental frequency from mel-frequency cepstral coefficients to enable speech reconstruction.

J Acoust Soc Am

School of Computing Sciences, University of East Anglia, Norwich, NR4 7TJ, United Kingdom.

Published: August 2005

This work proposes a method to reconstruct an acoustic speech signal solely from a stream of mel-frequency cepstral coefficients (MFCCs) as may be encountered in a distributed speech recognition (DSR) system. Previous methods for speech reconstruction have required, in addition to the MFCC vectors, fundamental frequency and voicing components. In this work the voicing classification and fundamental frequency are predicted from the MFCC vectors themselves using two maximum a posteriori (MAP) methods. The first method enables fundamental frequency prediction by modeling the joint density of MFCCs and fundamental frequency using a single Gaussian mixture model (GMM). The second scheme uses a set of hidden Markov models (HMMs) to link together a set of state-dependent GMMs, which enables a more localized modeling of the joint density of MFCCs and fundamental frequency. Experimental results on speaker-independent male and female speech show that accurate voicing classification and fundamental frequency prediction is attained when compared to hand-corrected reference fundamental frequency measurements. The use of the predicted fundamental frequency and voicing for speech reconstruction is shown to give very similar speech quality to that obtained using the reference fundamental frequency and voicing.

Download full-text PDF

Source
http://dx.doi.org/10.1121/1.1953269DOI Listing

Publication Analysis

Top Keywords

fundamental frequency
40
speech reconstruction
12
frequency voicing
12
frequency
10
fundamental
9
mel-frequency cepstral
8
cepstral coefficients
8
mfcc vectors
8
voicing classification
8
classification fundamental
8

Similar Publications

STRAIGHTMORPH: A Voice Morphing Tool for Research in Voice Communication Sciences.

Open Res Eur

January 2025

Center for Innovative Research and Liaison, Wakayama University, Wakayama, Wakayama Prefecture, Japan.

The purpose of this paper is to make easily available to the scientific community an efficient voice morphing tool called STRAIGHTMORPH and provide a short tutorial on its use with examples. STRAIGHTMORPH consists of a set of Matlab functions allowing the generation of high-quality, parametrically-controlled morphs of an arbitrary number of voice samples. A first step consists in extracting an 'mObject' for each voice sample, with accurate tracking of the fundamental frequency contour and manual definition of Time and Frequency anchors corresponding across samples to be morphed.

View Article and Find Full Text PDF

Age-related dependencies of electric and spectral powers in conventional frequency bands were studied by the newly proposed method of detailed spectral analysis. The magnetic encephalograms (MEG) and magnetic resonance images (MRI) of the head were obtained from the open archive Cam-CAN. The spatial distributions of elementary spectral components (MEG-based functional tomograms) were reconstructed from MEG for 501 subjects (248 males and 253 females, ages 18-88 years, mean age 54.

View Article and Find Full Text PDF

Acceptance of Evidence Transfer Within German Early Benefit Assessment of New Drugs for Pediatric and Adolescents Target Populations.

Pharmacoeconomics

January 2025

Institute for Health Services Research and Health Economics, Centre for Health and Society, Medical Faculty, Heinrich Heine University, Building: 12.49, Moorenstr. 5, 40225, Düsseldorf, Germany.

Background And Objective: In Germany, all new drugs undergo an early benefit assessment (EBA) by the decision-making body (G-BA). Due to limited access to clinical data in pediatric healthcare since 2017, evidence transfer has allowed for data from adult studies to be used in the EBA of pediatric drugs. This study examines the acceptance of evidence transfer, aiming to understand its correlation with granted added benefit.

View Article and Find Full Text PDF

Vortex beams are currently drawing a great deal of interest, from fundamental research to several promising applications. While their generation in bulky optical devices limits their use in integrated complex systems, metasurfaces have recently proven successful in creating optical vortices, especially in the linear regime. In the nonlinear domain, of strategic importance for the future of classical and quantum information, to date orbital angular momentum has only been created in qualitative ways, without discussing discrepancies between design and experimental results.

View Article and Find Full Text PDF

The Relationship Between the Neighbourhood Environment and Mental Health: Integrating Subjective and Objective Perspectives.

J Adv Nurs

January 2025

Center for Wise Information Technology of Mental Health Nursing Research, School of Nursing, Wuhan University, Wuhan, China.

Aims: To explore the relationship between neighbourhood environments and mental health by integrating subjective and objective perspectives.

Design: A cross-sectional study.

Methods: From September 2023 to January 2024, adult residents at the physical examination centers of two public hospitals in China completed measurements of subjective neighbourhood environment, depressive and anxiety symptoms, psychological stress, and socio-demographic characteristics.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!