An Experimental Analysis on Multicepstral Projection Representation Strategies for Dysphonia Detection.

Sensors (Basel)

Department of Computer Science and Statistics, Institute of Biosciences, Letters and Exact Sciences, São Paulo State University, São José do Rio Preto 15054-000, SP, Brazil.

Published: May 2023

Biometrics-based authentication has become the most well-established form of user recognition in systems that demand a certain level of security. For example, the most commonplace social activities stand out, such as access to the work environment or to one's own bank account. Among all biometrics, voice receives special attention due to factors such as ease of collection, the low cost of reading devices, and the high quantity of literature and software packages available for use. However, these biometrics may have the ability to represent the individual impaired by the phenomenon known as dysphonia, which consists of a change in the sound signal due to some disease that acts on the vocal apparatus. As a consequence, for example, a user with the flu may not be properly authenticated by the recognition system. Therefore, it is important that automatic voice dysphonia detection techniques be developed. In this work, we propose a new framework based on the representation of the voice signal by the multiple projection of cepstral coefficients to promote the detection of dysphonic alterations in the voice through machine learning techniques. Most of the best-known cepstral coefficient extraction techniques in the literature are mapped and analyzed separately and together with measures related to the fundamental frequency of the voice signal, and its representation capacity is evaluated on three classifiers. Finally, the experiments on a subset of the Saarbruecken Voice Database prove the effectiveness of the proposed material in detecting the presence of dysphonia in the voice.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10256083PMC
http://dx.doi.org/10.3390/s23115196DOI Listing

Publication Analysis

Top Keywords

dysphonia detection
8
voice signal
8
voice
7
experimental analysis
4
analysis multicepstral
4
multicepstral projection
4
projection representation
4
representation strategies
4
dysphonia
4
strategies dysphonia
4

Similar Publications

A rare association of Guillain-Barré syndrome/Miller-Fisher syndrome overlap syndrome and Herpes Simplex Virus Type 1 infection: trigger or exacerbating factor?

Ther Adv Neurol Disord

December 2024

Neurology Unit, Department of Neuroscience and Mental Health, Fondazione IRCCS Ca' Granda Ospedale Maggiore Policlinico, Dino Ferrari Centre, Milan, Italy.

Article Synopsis
  • Guillain-Barré syndrome (GBS) is an immune-mediated disorder with varied symptoms often triggered by infections, including viral ones.
  • An 80-year-old man exhibited signs of GBS/Miller-Fisher syndrome overlap after a respiratory infection, and tests showed herpes simplex virus type 1 DNA in his cerebrospinal fluid.
  • Treatment with intravenous immunoglobulin and acyclovir led to recovery, highlighting the importance of considering viral infections as potential triggers for autoimmune neurological conditions.
View Article and Find Full Text PDF

Objective: To study the prevalence of synchronous oesophageal cancer in patients with head and neck cancers using Narrow Band Imaging and Lugol's chromoendoscopy.

Materials And Methods: Study design: Prospective cross sectional diagnostic study. Method: 63 recruited patients with head and neck cancers, underwent haematologic evaluation, histological confirmation, imaging which included contrast enhanced computerised tomography(CECT) of the Neck and when indicated an additional Magnetic Resonance Imaging(MRI) scan followed by UGI endoscopy using white light followed by Narrow Band Imaging(NBI) and Lugol's chromoendoscopy(LCE).

View Article and Find Full Text PDF
Article Synopsis
  • The 2024 Voice AI Symposium gathered experts to discuss advancements in voice biomarkers and AI applications in healthcare through five educational workshops.
  • Topics covered included international standardization, real-world AI deployment, assistive technologies, best practices for data collection, and deep learning applications in voice analysis.
  • Key outcomes emphasized the need for unified standards, challenges in practical AI deployment, ethical considerations in data collection, and innovations in managing voice disorders using AI technology.
View Article and Find Full Text PDF

: Amyloid goiter (AG) is a rare cause of thyroid swelling, characterized by deposits of amyloid protein in the thyroid tissue. It can be associated with primary or secondary amyloidosis. Its prevalence in multinodular goiter cases is 0.

View Article and Find Full Text PDF

Effects of Voice Therapy on Maximum Phonation Time and S:Z Ratio in Patients With Primary Muscle Tension Dysphonia.

J Voice

November 2024

Department of Otolaryngology-Head and Neck Surgery, Mayo Clinic, 4500 San Pablo Road, Jacksonville, Florida, 32224.

Objective: This study aimed to evaluate the influence of voice therapy on maximum phonation time (MPT) and S:Z ratio in patients diagnosed with primary muscle tension dysphonia (pMTD). The goal was to investigate whether pMTD is associated with reduced S:Z ratio and prolonged MPT.

Study Design: Prospective cohort study.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!