This work proposes a method to reconstruct an acoustic speech signal solely from a stream of mel-frequency cepstral coefficients (MFCCs) as may be encountered in a distributed speech recognition (DSR) system. Previous methods for speech reconstruction have required, in addition to the MFCC vectors, fundamental frequency and voicing components. In this work the voicing classification and fundamental frequency are predicted from the MFCC vectors themselves using two maximum a posteriori (MAP) methods. The first method enables fundamental frequency prediction by modeling the joint density of MFCCs and fundamental frequency using a single Gaussian mixture model (GMM). The second scheme uses a set of hidden Markov models (HMMs) to link together a set of state-dependent GMMs, which enables a more localized modeling of the joint density of MFCCs and fundamental frequency. Experimental results on speaker-independent male and female speech show that accurate voicing classification and fundamental frequency prediction is attained when compared to hand-corrected reference fundamental frequency measurements. The use of the predicted fundamental frequency and voicing for speech reconstruction is shown to give very similar speech quality to that obtained using the reference fundamental frequency and voicing.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1121/1.1953269 | DOI Listing |
Open Res Eur
January 2025
Center for Innovative Research and Liaison, Wakayama University, Wakayama, Wakayama Prefecture, Japan.
The purpose of this paper is to make easily available to the scientific community an efficient voice morphing tool called STRAIGHTMORPH and provide a short tutorial on its use with examples. STRAIGHTMORPH consists of a set of Matlab functions allowing the generation of high-quality, parametrically-controlled morphs of an arbitrary number of voice samples. A first step consists in extracting an 'mObject' for each voice sample, with accurate tracking of the fundamental frequency contour and manual definition of Time and Frequency anchors corresponding across samples to be morphed.
View Article and Find Full Text PDFGeroscience
January 2025
Keldysh Institute of Applied Mathematics, Russian Academy of Sciences, Moscow, 125047, Russia.
Age-related dependencies of electric and spectral powers in conventional frequency bands were studied by the newly proposed method of detailed spectral analysis. The magnetic encephalograms (MEG) and magnetic resonance images (MRI) of the head were obtained from the open archive Cam-CAN. The spatial distributions of elementary spectral components (MEG-based functional tomograms) were reconstructed from MEG for 501 subjects (248 males and 253 females, ages 18-88 years, mean age 54.
View Article and Find Full Text PDFPharmacoeconomics
January 2025
Institute for Health Services Research and Health Economics, Centre for Health and Society, Medical Faculty, Heinrich Heine University, Building: 12.49, Moorenstr. 5, 40225, Düsseldorf, Germany.
Background And Objective: In Germany, all new drugs undergo an early benefit assessment (EBA) by the decision-making body (G-BA). Due to limited access to clinical data in pediatric healthcare since 2017, evidence transfer has allowed for data from adult studies to be used in the EBA of pediatric drugs. This study examines the acceptance of evidence transfer, aiming to understand its correlation with granted added benefit.
View Article and Find Full Text PDFLight Sci Appl
January 2025
Laboratoire Matériaux et Phénomènes Quantiques, Université Paris Cité and CNRS, Paris, 75013, France.
Vortex beams are currently drawing a great deal of interest, from fundamental research to several promising applications. While their generation in bulky optical devices limits their use in integrated complex systems, metasurfaces have recently proven successful in creating optical vortices, especially in the linear regime. In the nonlinear domain, of strategic importance for the future of classical and quantum information, to date orbital angular momentum has only been created in qualitative ways, without discussing discrepancies between design and experimental results.
View Article and Find Full Text PDFJ Adv Nurs
January 2025
Center for Wise Information Technology of Mental Health Nursing Research, School of Nursing, Wuhan University, Wuhan, China.
Aims: To explore the relationship between neighbourhood environments and mental health by integrating subjective and objective perspectives.
Design: A cross-sectional study.
Methods: From September 2023 to January 2024, adult residents at the physical examination centers of two public hospitals in China completed measurements of subjective neighbourhood environment, depressive and anxiety symptoms, psychological stress, and socio-demographic characteristics.
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!