The temporal characteristics of speech can be captured by examining the distributions of the durations of measurable speech components, namely speech segment durations and pause durations. However, several barriers prevent the easy analysis of pause durations: The first problem is that natural speech is noisy, and although recording contrived speech minimizes this problem, it also discards diagnostic information about cognitive processes inherent in the longer pauses associated with natural speech. The second issue concerns setting the distribution threshold, and consists of the problem of appropriately classifying pause segments as either short pauses reflecting articulation or long pauses reflecting cognitive processing, while minimizing the overall classification error rate. This article describes a fully automated system for determining the locations of speech-pause transitions and estimating the temporal parameters of both speech and pause distributions in natural speech. We use the properties of Gaussian mixture models at several stages of the analysis, in order to identify theoretical components of the data distributions, to classify speech components, to compute durations, and to calculate the relevant statistics.

Download full-text PDF

Source
http://dx.doi.org/10.3758/s13428-012-0222-0DOI Listing

Publication Analysis

Top Keywords

natural speech
12
speech
10
automated system
8
speech components
8
pause durations
8
pauses reflecting
8
durations
5
fluency profiling
4
profiling system
4
system automated
4

Similar Publications

EEG involves recording electrical activity generated by the brain through electrodes placed on the scalp. Imagined speech classification has emerged as an essential area of research in brain-computer interfaces (BCIs). Despite significant advances, accurately classifying imagined speech signals remains challenging due to their complex and non-stationary nature.

View Article and Find Full Text PDF

Using the syntactic priming paradigm, this study investigated abstract syntactic knowledge of Chinese transitive structures (i.e., subject-verb-object [SVO], BA, and BEI) in deaf children with cochlear implants (CIs).

View Article and Find Full Text PDF

Natural language processing-based classification of early Alzheimer's disease from connected speech.

Alzheimers Dement

January 2025

Laboratory for Cognitive Neurology, Department of Neurosciences, Leuven Brain Institute, KU Leuven, Leuven, Belgium.

Introduction: The automated analysis of connected speech using natural language processing (NLP) emerges as a possible biomarker for Alzheimer's disease (AD). However, it remains unclear which types of connected speech are most sensitive and specific for the detection of AD.

Methods: We applied a language model to automatically transcribed connected speech from 114 Flemish-speaking individuals to first distinguish early AD patients from amyloid negative cognitively unimpaired (CU) and then amyloid negative from amyloid positive CU individuals using five different types of connected speech.

View Article and Find Full Text PDF

Knowledge of the natural history of deficiency disorder (CDD) is limited to the results of cross-sectional analysis of largely pediatric cohorts. Assessment of outcomes in adulthood is critical for clinical decision-making and future precision medicine approaches but is challenging because of the diagnostic gap and duration of follow-up that would be required for prospective studies. We aimed to delineate the natural history retrospectively from adulthood.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!