This study explores building and improving an automatic speech recognition (ASR) system for children aged 6-9 years and diagnosed with autism spectrum disorder (ASD), language impairment (LI), or both. Working with only 1.5 hours of target data in which children perform the Clinical Evaluation of Language Fundamentals Recalling Sentences task, we apply deep neural network (DNN) weight transfer techniques to adapt a large DNN model trained on the LibriSpeech corpus of adult speech. To begin, we aim to find the best proportional training rates of the DNN layers. Our best configuration yields a 29.38% word error rate (WER). Using this configuration, we explore the effects of quantity and similarity of data augmentation in transfer learning. We augment our training with portions of the OGI Kids' Corpus, adding 4.6 hours of typically developing speakers aged kindergarten through 3 grade. We find that 2 grade data alone - approximately the mean age of the target data - outperforms other grades and all the sets combined. Doubling the data for 1, 2, and 3 grade, we again compare each grade as well as pairs of grades. We find the combination of 1 and 2 grade performs best at a 26.21% WER.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7575194PMC
http://dx.doi.org/10.21437/Interspeech.2019-3161DOI Listing

Publication Analysis

Top Keywords

language impairment
8
transfer techniques
8
target data
8
data
5
grade
5
improving asr
4
asr systems
4
systems children
4
children autism
4
autism language
4

Similar Publications

Purpose: Speech-language pathologists (SLPs) are tasked with integrating the principles of evidence-based practice (EBP) to provide effective and efficient assessment and intervention services that best support clients and their families. As new research, technologies, and perspectives emerge, SLPs are required to adapt their clinical practices to meet these changes while maintaining high-quality evidence-based services. Through an illustrative case study, we aim to demonstrate the process of applying EBP principles - including research evidence, client and family perspectives, and clinical expertise - to a complexity-based speech sound intervention delivered via telepractice.

View Article and Find Full Text PDF

Prevalence and predictors of obstructive sleep apnea in patients with bipolar disorder: A systematic review and meta-analysis.

Pak J Med Sci

January 2025

Kailong Gu Department of Geriatric Psychiatry, Huzhou Third Municipal Hospital, The Affiliated Hospital of Huzhou University, Huzhou, Zhejiang Province 313000, China.

Background & Objective: Obstructive sleep apnea (OSA) has been increasingly recognized as a comorbidity in many psychiatric disorders, including bipolar disorder (BD). This study aimed to synthesize existing evidence to determine the frequency of OSA in patients diagnosed with BD and identify potential predictors of its occurrence.

Methods: PubMed, Scopus, CENTRAL (Cochrane Central Register of Controlled Trials), and Google Scholar databases were searched for English-language papers published up from 1 January 1960 to 31 October 2023 that reported incidences of OSA in patients with BP and provided sufficient data for quantitative analysis.

View Article and Find Full Text PDF

Introduction: People with schizophrenia spectrum disorders present with language dysfunctions, yet we know little about their use of reference markers (indefinite markers, definite markers, pronouns or names), a fundamental aspect of efficient speech production.

Methods: Twenty-five (25) participants with a recent-onset schizophrenia spectrum disorder (SZ) and 25 healthy controls (HC) completed two referential communication tasks. The tasks involved presenting to an interaction partner a series of movie characters (character identification task) and movie scenes composed of six images (narration task).

View Article and Find Full Text PDF

Introduction: As a hallmark feature of amyotrophic lateral sclerosis (ALS), bulbar involvement significantly impacts psychosocial, emotional, and physical health. A validated objective marker is however lacking to characterize and phenotype bulbar involvement, positing a major barrier to early detection, progress monitoring, and tailored care. This study aimed to bridge this gap by constructing a multiplex functional mandibular muscle network to provide a novel objective measurement tool of bulbar involvement.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!