Background: Changes in voice are a symptom of Parkinson's disease and used to assess the progression of the condition. However, natural differences in the voices of people can make this challenging. Computerized binary speech classification can identify people with PD (PwPD), but its multiclass application to detect the severity of the disease remains difficult.
Method: This study investigated six diadochokinetic (DDK) tasks, four features (phonation, articulation, prosody, and their fusion), and three machine learning models for four severity levels of PwPD. The four binary classifications were: (i) Normal vs Not Normal, (ii) Slight vs Not Slight, (iii) Mild vs Not Mild and (iv) Moderate vs. Not Moderate. The best task and features for each class were identified and the models were ensembled to develop a multiclass model to distinguish between Normal vs. Slight vs. Mild vs. Moderate.
Results: For Normal vs Not-normal, logistic regression (LR) using the prosody from "ka-ka-ka" task, Random Forest (RF) using articulation from "petaka" for Slight vs Not Slight, RF for the fusion from "ka-ka-ka" for Mild vs Not Mild and Gradient Boosting (GB) using prosody from "ta-ta-ta" for Moderate vs Not Moderate gave the best results. Combining these using LR achieved an accuracy of 72%.
Conclusion: Dividing the multiclass problem into four binary problems gives the optimum speech features for each class. This pilot study, conducted on a small public dataset, shows the potential of computerized speech analysis using DDK to evaluate the severity of Parkinson's disease voice symptoms.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1016/j.compbiomed.2024.109565 | DOI Listing |
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!