Stuttering is a neuro-developmental speech disorder that interrupts the flow of speech due to involuntary pauses and sound repetitions. It has profound psychological impacts that affect social interactions and professional advancements. Automatically detecting stuttering events in speech recordings could assist speech therapists or speech pathologists track the fluency of people who stutter (PWS). It will also assist in the improvement of the existing speech recognition system for PWS. In this paper, the SEP-28k dataset is utilized to perform comparative analysis to assess the performance of various machine learning models in classifying the five dysfluency types namely Prolongation, Interjection, Word Repetition, Sound Repetition and Blocks.•The study focuses on automatically detecting stuttering events in speech recordings to support speech therapists and improve speech recognition systems for people who stutter (PWS).•The SEP-28k dataset is used to perform a comparative analysis of different machine learning models.•The research examines the impact of key acoustic features on model accuracy while addressing challenges such as class imbalance.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11625209PMC
http://dx.doi.org/10.1016/j.mex.2024.103050DOI Listing

Publication Analysis

Top Keywords

machine learning
12
speech
9
automatically detecting
8
detecting stuttering
8
stuttering events
8
events speech
8
speech recordings
8
speech therapists
8
people stutter
8
speech recognition
8

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!