Estimation of phoneme-specific HMM topologies for the automatic recognition of dysarthric speech.

Comput Math Methods Med

Technological University of the Mixteca, Road to Acatlima K.m. 2.5, Huajuapan de León, 69000 Oaxaca, OAX, Mexico.

Published: April 2014

AI Article Synopsis

  • Dysarthria is a motor speech disorder often caused by neurological issues, impacting effective communication and quality of life for affected individuals.
  • Assistive technologies and speech applications are being developed to enhance spoken communication, particularly through improving speech recognition systems tailored for dysarthric speech.
  • The paper discusses a novel method using Genetic Algorithms to optimize Hidden Markov Model (HMM) topologies for different phonemes, resulting in significant recognition accuracy improvements over traditional single topology approaches.

Article Abstract

Dysarthria is a frequently occurring motor speech disorder which can be caused by neurological trauma, cerebral palsy, or degenerative neurological diseases. Because dysarthria affects phonation, articulation, and prosody, spoken communication of dysarthric speakers gets seriously restricted, affecting their quality of life and confidence. Assistive technology has led to the development of speech applications to improve the spoken communication of dysarthric speakers. In this field, this paper presents an approach to improve the accuracy of HMM-based speech recognition systems. Because phonatory dysfunction is a main characteristic of dysarthric speech, the phonemes of a dysarthric speaker are affected at different levels. Thus, the approach consists in finding the most suitable type of HMM topology (Bakis, Ergodic) for each phoneme in the speaker's phonetic repertoire. The topology is further refined with a suitable number of states and Gaussian mixture components for acoustic modelling. This represents a difference when compared with studies where a single topology is assumed for all phonemes. Finding the suitable parameters (topology and mixtures components) is performed with a Genetic Algorithm (GA). Experiments with a well-known dysarthric speech database showed statistically significant improvements of the proposed approach when compared with the single topology approach, even for speakers with severe dysarthria.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3816031PMC
http://dx.doi.org/10.1155/2013/297860DOI Listing

Publication Analysis

Top Keywords

dysarthric speech
12
spoken communication
8
communication dysarthric
8
dysarthric speakers
8
finding suitable
8
single topology
8
dysarthric
6
speech
6
topology
5
estimation phoneme-specific
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!