Unlabelled: BACKGROUND Special consideration has recently been given to cepstral analysis with mel-frequency cepstral coefficients (MFCCs). The aim of this study was to assess the applicability of MFCCs in acoustic analysis for diagnosing occupational dysphonia in comparison to subjective and objective parameters of voice evaluation.

Materials And Methods: The study comprised 2 groups, one of 55 female teachers (mean age: 45 years) with occupational dysphonia confirmed by videostroboscopy and 40 female controls with normal voice (mean age: 43 years). The acoustic samples involving sustained vowels "a" and four standardized sentences were analyzed by computed analysis of MFCCs. The results were compared to acoustic parameters of jitter and shimmer groups, noise to harmonic ratio, Yanagihara index evaluating the grade of hoarseness, the aerodynamic parameter: maximum phonation time and also subjective parameters: GRBAS perceptual scale and Voice Handicap Index (VHI).

Results: The compared results revealed differences between the study and control groups, significant for MFCC2, MFCC3, MFCC5, MFCC6, MFCC8, MFCC10, particularly for MFCC6 (p < 0.001) and MFCC8 (p < 0.009), which may suggest their clinical applicability. In the study group, MFCC4, MFCC8 and MFCC10 correlated significantly with the major objective parameters of voice assessment. Moreover, MFCC8 coefficient, which in the female teachers correlated with all eight objective parameters, also showed the significant relation with perceptual voice feature A (asthenity) of subjective scale GRBAS, characteristic of weak tired voice.

Conclusions: The cepstral analysis with mel frequency cepstral coefficients is a promising tool for evaluating occupational voice disorders, capable of reflecting the perceptual voice features better than other methods of acoustic analysis.

Download full-text PDF

Source
http://dx.doi.org/10.13075/mp.5893.2013.0062DOI Listing

Publication Analysis

Top Keywords

cepstral coefficients
12
objective parameters
12
voice
8
cepstral analysis
8
acoustic analysis
8
occupational dysphonia
8
parameters voice
8
female teachers
8
age years
8
mfcc8 mfcc10
8

Similar Publications

This paper seeks to enhance the performance of Mel Frequency Cepstral Coefficients (MFCCs) for detecting abnormal heart sounds. Heart sounds are first pre-processed to remove noise and then segmented into S1, systole, S2, and diastole intervals, with thirteen MFCCs estimated from each segment, yielding 52 MFCCs per beat. Finally, MFCCs are used for heart sound classification.

View Article and Find Full Text PDF

Laryngeal disease classification using voice data: Octave-band vs. mel-frequency filters.

Heliyon

December 2024

Department of Industrial and Data Engineering, Hongik University, Seoul, South Korea.

Introduction: Laryngeal cancer diagnosis relies on specialist examinations, but non-invasive methods using voice data are emerging with artificial intelligence (AI) advancements. Mel Frequency Cepstral Coefficients (MFCCs) are widely used for voice analysis, but Octave Frequency Spectrum Energy (OFSE) may offer better accuracy in detecting subtle voice changes.

Problem Statement: Accurate early diagnosis of laryngeal cancer through voice data is challenging with current methods like MFCC.

View Article and Find Full Text PDF

Background: The age distribution of a mosquito population is a major determinant of its vectorial capacity. To contribute to disease transmission, a competent mosquito vector, carrying a pathogen, must live longer than the extrinsic incubation period of that pathogen to enable transmission to a new host. As such, determining the age of female mosquitoes is of significant interest for vector-borne diseases surveillance and control programs.

View Article and Find Full Text PDF

Accurate identification of coal and gangue is a crucial guarantee for efficient and safe mining of top coal caving face. This article proposes a coal-gangue recognition method based on an improved beluga whale optimization algorithm (IBWO), convolutional neural network, and long short-term memory network (CNN-LSTM) multi-modal fusion model. First, the mutation and memory library mechanisms are introduced into the beluga whale optimization to explore the solution space fully, prevent falling into local optimum, and accelerate the convergence process.

View Article and Find Full Text PDF

Sound-Based Unsupervised Fault Diagnosis of Industrial Equipment Considering Environmental Noise.

Sensors (Basel)

November 2024

Department of Naval Architecture and Ocean Engineering, Inha University, Incheon 22212, Republic of Korea.

The influence of environmental noise is generally excluded during research on machine fault diagnosis using acoustic signals. This study proposes a fault diagnosis method using a variational autoencoder (VAE) and domain adaptation neural network (DANN), both of which are based on unsupervised learning, to address this problem. The proposed method minimizes the impact of environmental noise and maintains the fault diagnosis performance in altered environments.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!