Harnessing Voice Analysis and Machine Learning for Early Diagnosis of Parkinson's Disease: A Comparative Study Across Three Datasets.

J Voice

Center of Innovation, Technology and Education (CITE) at Anhembi Morumbi University - Anima Institute, São José dos Campos, São Paulo, Brazil; Arena235 Research Lab, São José dos Campos, São Paulo, Brazil. Electronic address:

Published: May 2024

Objective: This study evaluates the efficacy of voice analysis combined with machine learning (ML) techniques in enabling the diagnosis of Parkinson's disease (PD).

Methods: Voice data, phonation of the vowel "a," from three distinct datasets (two from the University of California Irvine ML Repository and one from figshare) for 432 participants (278 PD patients) were analyzed. We employed four ML models-Artificial Neural Networks, Random Forest, Gradient Boosting (GB), and Support Vector Machine (SVM)-alongside two ensemble methods (soft voting classifier-Ensemble Voting Classifier and stacking method-Ensemble Stacking Model (ESM)). The models underwent 50 iterations of evaluation, involving various data splits and 10-fold cross-validation. Comparative analysis was done using one-way Analysis of Variance followed by Bonferroni posthoc corrections.

Results: The ESM, SVM, and GB models emerged as the top performers, demonstrating superior performance across metrics, including accuracy, sensitivity, specificity, precision, F1 score, and area under the receiver operating characteristic curve (ROC AUC). Despite data heterogeneity and variable selection limitations, the models showed high values for all metrics.

Conclusions: ML integration with voice analysis, mainly through ESM, SVM, and GB, is promising for early PD diagnosis. Using multi-source data and a large sample size enhances our findings' validity, reliability, and generalizability.

Significance: Integrating advanced ML techniques with voice analysis demonstrates substantial potential for improving early PD detection, offering valuable tools for speech-language pathologists (SLPs). These findings provide clinically relevant insights that can be applied within the scope of SLP practice to refine diagnostic processes and facilitate early intervention.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.jvoice.2024.04.020DOI Listing

Publication Analysis

Top Keywords

voice analysis
16
machine learning
8
early diagnosis
8
diagnosis parkinson's
8
parkinson's disease
8
esm svm
8
analysis
6
harnessing voice
4
analysis machine
4
early
4

Similar Publications

Background: Head and neck cancer (HNC) is amongst the 10 most common cancers worldwide and has a major effect on patients' quality of life. Given the complexity of this unique group of patients, a multidisciplinary team approach is preferable. Amongst the debilitating sequels of HNC and/or its treatment, swallowing, speech and voice impairments are prevalent and require the involvement of speech-language pathologists (SLPs).

View Article and Find Full Text PDF

Background: Effective staff-to-staff and patient-provider communication in the Emergency Department (ED) is essential for safe, quality care. Routine wearing of Personal-Protective-Equipment (PPE) has introduced new challenges to communication. We aimed to understand the perspectives of ED staff about communicating while wearing PPE, and to identify factors contributing to communication success, breakdown, and repair.

View Article and Find Full Text PDF

A client-server based recognition system: Non-contact single/multiple emotional and behavioral state assessment methods.

Comput Methods Programs Biomed

December 2024

School of Communication and Information Engineering, Shanghai University, 200444, Shanghai, China. Electronic address:

Background And Objectives: In the current global health landscape, there is an increasing demand for rapid and accurate assessment of mental states. Traditional assessment methods typically rely on face-to-face interactions, which are not only time-consuming but also highly subjective. Addressing this issue, this study aims to develop a client-server-based, non-contact multimodal emotion and behavior recognition system to enhance the efficiency and accuracy of mental state assessments.

View Article and Find Full Text PDF

Acoustic Exaggeration Enhances Speech Discrimination in Young Autistic Children.

Autism Res

December 2024

Psychiatry and Addictology Department, CIUSSS-NIM Research Center, University of Montreal, Montreal, Quebec, Canada.

Child-directed speech (CDS), which amplifies acoustic and social features of speech during interactions with young children, promotes typical phonetic and language development. In autism, both behavioral and brain data indicate reduced sensitivity to human speech, which predicts absent, decreased, or atypical benefits of exaggerated speech signals such as CDS. This study investigates the impact of exaggerated fundamental frequency (F0) and voice-onset time on the neural processing of speech sounds in 22 Chinese-speaking autistic children aged 2-7 years old with a history of speech delays, compared with 25 typically developing (TD) peers.

View Article and Find Full Text PDF

Background: Understanding advocacy strategies is essential to improving dementia awareness, reducing stigma, supporting cognitive health promotion, and influencing policy to support people living with dementia. However, there is a dearth of evidence-based research on advocacy strategies used to support dementia awareness.

Objective: This study aimed to use posts from X (formerly known as Twitter) to understand dementia advocacy strategies during World Alzheimer's Awareness Month in September 2022.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!