Acoustic analysis of pathological voices compressed with MPEG system.

J Voice

Department of Basic and Clinical Psychology and Psychobiology, Universitat Jaume I, Castellon, Spain.

Published: June 2003

The MPEG-1 Layer 3 compression schema of audio signal, commonly known as mp3, has caused a great impact in recent years as it has reached high compression rates while conserving a high sound quality. Music and speech samples compressed at high bitrates are perceptually indistinguishable from the original samples, but very little was known about how compression acoustically affects the voice signal. A previous work with normal voices showed a high fidelity at high-bitrate compressions both in voice parameters and the amplitude-frequency spectrum. In the present work, dysphonic voices were tested through two studies. In the first study, spectrograms, long-term average spectra (LTAS), and fast Fourier transform (FFT) spectra of compressed and original samples of running speech were compared. In the second study, intensities, formant frequencies, formant bandwidths, and a multidimensional set of voice parameters were tested in a set of sustained phonations. Results showed that compression at high bitrates (96 and 128 kbps) preserved the relevant acoustic properties of the pathological voices. With compressions at lower bitrates, fidelity decreases, introducing some important alterations. Results from both works, Gonzalez and Cervera and this paper, open up the possibility of using MPEG-compression at high bitrates to store or transmit high-quality speech recordings, without altering their acoustic properties.

Download full-text PDF

Source
http://dx.doi.org/10.1016/s0892-1997(03)00007-9DOI Listing

Publication Analysis

Top Keywords

high bitrates
12
pathological voices
8
original samples
8
voice parameters
8
acoustic properties
8
high
6
acoustic analysis
4
analysis pathological
4
voices
4
voices compressed
4

Similar Publications

This study investigates the potential of long-wave infrared (LWIR) free-space optical (FSO) transmission using multilevel signals to achieve high spectral efficiency. The FSO transmission system includes a directly modulated-quantum cascade laser (DM-QCL) operating at 9.1 µm and a mercury cadmium telluride (MCT) detector.

View Article and Find Full Text PDF

Unipolar quantum optoelectronics for high speed direct modulation and transmission in 8-14 µm atmospheric window.

Nat Commun

September 2024

Laboratoire de Physique de l'ENS, Département de Physique, École Normale Supérieure, Université PSL, Sorbonne Université, Université Paris Cité, CNRS, 75005, Paris, France.

The large mid-infrared (MIR) spectral region, ranging from 2.5 µm to 25 µm, has remained under-exploited in the electromagnetic spectrum, primarily due to the absence of viable transceiver technologies. Notably, the 8-14 µm long-wave infrared (LWIR) atmospheric transmission window is particularly suitable for free-space optical (FSO) communication, owing to its combination of low atmospheric propagation loss and relatively high resilience to turbulence and other atmospheric disturbances.

View Article and Find Full Text PDF

In this paper, a novel interference-based nanostructure was designed and simulated to realize an all-optical 2-bit reversible comparator by employing a novel technique. The plane wave expansion (PWE) method was adopted to analyze the encoder design and frequency modes. Aside from downsizing, the finite-difference time-domain (FDTD) method was utilized for the simulation and numerical analysis of the design proposed herein.

View Article and Find Full Text PDF

Overcoming laser phase noise for low-cost coherent optical communication.

Nat Commun

July 2024

State Key Laboratory of Advanced Optical Communication Systems and Networks, Frontiers Science Center for Nano-optoelectronics, School of Electronics, Peking University, Beijing, 100871, China.

Artificial-intelligence-generated content has driven explosive data traffic growth in data-center interconnects. Traditional direct detection solutions struggle with limited spectral efficiency and distance, prompting the shift to coherent optics for cost-sensitive short-reach links. One specific challenge is integrating low-cost lasers while overcoming severe phase noise on high-order modulation formats.

View Article and Find Full Text PDF

Ophthalmic three-dimensional (3D) digital surgery can reproduce high-definition surgical images; however, 3D digital surgery is limited by recording capacities. We examined the relationship between the minimum image quality required to reproduce surgical images and recording capacity. Patients who underwent simultaneous vitrectomy and cataract surgery by the same surgeon using a 3D digital surgery system at Juntendo University Urayasu Hospital between February and October 2021 were evaluated.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!