Estimation of Speech Features Using a Wearable Inertial Sensor.

Zuyu Du Yaodan Xu Xinsheng Yu Sen Wang Lin Xu

J Voice

School of Information Science and Technology, ShanghaiTech University, Shanghai, China; Shanghai Frontiers Science Center of Human-centered Artificial Intelligence, Shanghai, China; MoE Key Lab of Intelligent Perception and Human-Machine Collaboration (ShanghaiTech University), Shanghai, China. Electronic address:

Published: October 2024

Speech features have been investigated as novel digital biomarkers for many psychiatric and neurocognitive diseases. Microphones are the most used devices for speech recording but inevitably suffering from several disadvantages such as privacy leakage and environmental noises, limiting their clinical applications particularly for long-term ambulatory monitoring. The aim of the present study is therefore to explore the feasibility of extracting speech features from the acceleration recorded on the sternum. Ten healthy subjects volunteered in our study. Two speech tasks, that is, repeating one sentence 20 times and reading 20 different sentences, were performed by each subject, with each task repeated eight times under different speech rate and loudness. Voice signals and speech-caused chest vibrations were simultaneously recorded by a microphone and an accelerometer placed on the sternum. Forty-two acoustic features and six time-related prosodic features were extracted from both signals using a standard toolbox, and then compared by a linear fit and correlation analysis. Good agreement between the acceleration features and microphone features is observed in all six time-related prosodic features for both tasks, but only in 19 and 17 acoustic features for task 1 and 2, respectively, with most of them loudness- or pitch-related. Our results suggest the sternum acceleration to track time-related speech prosody, loudness, and pitch very well, demonstrating the feasibility of deriving digital biomarkers from the acceleration signal for diseases strongly related to time-related prosodic and loudness features.

Download full-text PDF	Source
http://dx.doi.org/10.1016/j.jvoice.2024.09.012	DOI Listing

Publication Analysis

Top Keywords

speech features

time-related prosodic

features

digital biomarkers

acoustic features

prosodic features

speech

estimation speech

features wearable

wearable inertial

Similar Publications

Sex differences in mitochondrial free-carnitine levels in subjects at-risk and with Alzheimer's disease in two independent study cohorts.

Mol Psychiatry

January 2025

Department of Psychiatry, New York University Grossman School of Medicine, New York, NY, USA.

Benedetta Bigio Ricardo A S Lima-Filho Olivia Barnhill Felipe K Sudo Claudia Drummond

A major challenge in the development of more effective therapeutic strategies for Alzheimer's disease (AD) is the identification of molecular mechanisms linked to specific pathophysiological features of the disease. Importantly AD has a two-fold higher incidence in women than men and a protracted prodromal phase characterized by amnestic mild-cognitive impairment (aMCI) suggesting that biological processes occurring early can initiate vulnerability to AD. Here, we used a sample of 125 subjects from two independent study cohorts to determine the levels in plasma (the most accessible specimen) of two essential mitochondrial markers acetyl-L-carnitine (LAC) and its derivative free-carnitine motivated by a mechanistic model in rodents in which targeting mitochondrial metabolism of LAC leads to the amelioration of cognitive function and boosts epigenetic mechanisms of gene expression.

View Article and Find Full Text PDF

Similar Publications

Outcomes of Tongue Reduction Surgery in Beckwith-Wiedemann Syndrome: A Systematic Review.

J Craniofac Surg

January 2025

Division of Plastic Surgery, Department of Surgery, Dalhousie University.

Beraki Abraha Oliva Macintyre Hannah Brennan Paul Hong Michael Bezuhly

Introduction: Macroglossia is a frequent clinical feature of Beckwith-Wiedemann syndrome (BWS), a congenital overgrowth disorder. Macroglossia can lead to abnormal breathing, feeding, speech, and dentoskeletal development. Partial glossectomy is a common intervention aimed at reducing these abnormalities.

View Article and Find Full Text PDF

Similar Publications

Phonological and speech motor abilities in children with childhood apraxia of speech.

Clin Linguist Phon

January 2025

Centre for Language and Cognition, Groningen University, Groningen, The Netherlands.

Dora Knežević Ben Maassen

Childhood apraxia of speech (CAS) is a motor speech disorder in which the precision and consistency of speech sounds are impaired due to deficits in motor planning and programming. The literature on CAS suggests that the clinical features of CAS cannot be limited to one level of speech processing and that a more comprehensive understanding of how all levels involved in speech production are part of a complex system is needed. The aim of this study was to investigate the relationship between phonological and speech motor abilities in children with CAS and to determine the extent to which speech motor performance accounts for phonological processing in children with CAS.

View Article and Find Full Text PDF

Similar Publications

Conflicting Evidence for a Motor Timing Theory of Stuttering: Choral Speech Changes the Rhythm of Both Neurotypical and Stuttering Talkers, but in Opposite Directions.

J Speech Lang Hear Res

January 2025

Institute of Cognitive Neuroscience, University College London, United Kingdom.

Sophie Meekings Lotte Eijk Stefany Stankova Santosh Maruthy Sophie Kerttu Scott

Purpose: Talking in unison with a partner, otherwise known as choral speech, reliably induces fluency in people who stutter (PWS). This effect may arise because choral speech addresses a hypothesized motor timing deficit by giving PWS an external rhythm to align with and scaffold their utterances onto. This study tested this theory by comparing the choral speech rhythm of people who do and do not stutter to assess whether both groups change their rhythm in similar ways when talking chorally.

View Article and Find Full Text PDF

Similar Publications

Correction: Sage et al. Hybridization of Acoustic and Visual Features of Polish Sibilants Produced by Children for Computer Speech Diagnosis. 2024, , 5360.

Sensors (Basel)

December 2024

Faculty of Biomedical Engineering, Silesian University of Technology, Roosevelta 40, 41-800 Zabrze, Poland.

Agata Sage Zuzanna Miodońska Michał Kręcichwost Paweł Badura

In the original publication [...

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!