Can a Machine Distinguish High and Low Amount of Social Creak in Speech?

J Voice

Department of Information and Communications Engineering, Aalto University, Espoo, Finland. Electronic address:

Published: October 2024

Objectives: Increased prevalence of social creak particularly among female speakers has been reported in several studies. The study of social creak has been previously conducted by combining perceptual evaluation of speech with conventional acoustical parameters such as the harmonic-to-noise ratio and cepstral peak prominence. In the current study, machine learning (ML) was used to automatically distinguish speech of low amount of social creak from speech of high amount of social creak.

Methods: The amount of creak in continuous speech samples produced in Finnish by 90 female speakers was first perceptually assessed by two voice specialists. Based on their assessments, the speech samples were divided into two categories (low vs high amount of creak). Using the speech signals and their creak labels, seven different ML models were trained. Three spectral representations were used as feature for each model.

Results: The results show that the best performance (accuracy of 71.1%) was obtained by the following two systems: an Adaboost classifier using the mel-spectrogram feature and a decision tree classifier using the mel-frequency cepstral coefficient feature.

Conclusions: The study of social creak is becoming increasingly popular in sociolinguistic and vocological research. The conventional human perceptual assessment of the amount of creak is laborious and therefore ML technology could be used to assist researchers studying social creak. The classification systems reported in this study could be considered as baselines in future ML-based studies on social creak.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.jvoice.2024.09.050DOI Listing

Publication Analysis

Top Keywords

social creak
28
amount social
12
amount creak
12
creak
11
low amount
8
social
8
female speakers
8
study social
8
creak speech
8
high amount
8

Similar Publications

Can a Machine Distinguish High and Low Amount of Social Creak in Speech?

J Voice

October 2024

Department of Information and Communications Engineering, Aalto University, Espoo, Finland. Electronic address:

Objectives: Increased prevalence of social creak particularly among female speakers has been reported in several studies. The study of social creak has been previously conducted by combining perceptual evaluation of speech with conventional acoustical parameters such as the harmonic-to-noise ratio and cepstral peak prominence. In the current study, machine learning (ML) was used to automatically distinguish speech of low amount of social creak from speech of high amount of social creak.

View Article and Find Full Text PDF

A Core Outcome Set for Efficacy of Acute Treatment of Hereditary Angioedema.

J Allergy Clin Immunol Pract

June 2024

Department of Vascular Medicine, Amsterdam Cardiovascular Sciences, Amsterdam UMC, University of Amsterdam, Amsterdam, the Netherlands. Electronic address:

Background: Clinical trials investigating drugs for the acute treatment of hereditary angioedema attacks have assessed many different outcomes. This heterogeneity limits the comparability of trial results and may lead to selective outcome reporting bias and a high burden on trial participants.

Objective: To achieve consensus on a core outcome set composed of key outcomes that ideally should be used in all clinical efficacy trials involving the acute treatment of hereditary angioedema attacks.

View Article and Find Full Text PDF

Creaky voice, a non-modal aperiodic phonation that is often associated with low pitch targets, has been found to not only correlate linguistically with prosodic boundary, tonal categories, and pitch range, but also socially with age, gender, and social status. However, it is still not clear whether co-varying factors such as prosodic boundary, pitch range, and tone could, in turn, affect listeners' identification of creak. To fill this gap, this current study examines how creaky voice is identified in Mandarin through experimental data, aiming to enhance our understanding of cross-linguistic perception of creaky voice and, more broadly, speech perception in multi-variable contexts.

View Article and Find Full Text PDF

Everyday observations indicate that creaky voice has become common in Finland in recent years. Previous studies suggest that this trend is also occurring in other countries. This cross-sectional study investigates the use of creaky voice among Finnish university students from the 1990's to the 2010's.

View Article and Find Full Text PDF

Does the Acoustic Voice Quality Index (AVQI) Correlate with Perceived Creak and Strain in Normophonic Young Adult Finnish Females?

Folia Phoniatr Logop

January 2022

Speech and Voice Research Laboratory, Faculty of Social Sciences, Tampere University, Tampere, Finland.

Background: The Acoustic Voice Quality Index (AVQI) is a correlate of dysphonia. It has been found to differentiate between dysphonic and normophonic speakers and to indicate the effects of voice therapy. This study investigates how the AVQI reacts towards creak and strain, which are common in normophonic speakers.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!