Biometrics-based authentication has become the most well-established form of user recognition in systems that demand a certain level of security. For example, the most commonplace social activities stand out, such as access to the work environment or to one's own bank account. Among all biometrics, voice receives special attention due to factors such as ease of collection, the low cost of reading devices, and the high quantity of literature and software packages available for use. However, these biometrics may have the ability to represent the individual impaired by the phenomenon known as dysphonia, which consists of a change in the sound signal due to some disease that acts on the vocal apparatus. As a consequence, for example, a user with the flu may not be properly authenticated by the recognition system. Therefore, it is important that automatic voice dysphonia detection techniques be developed. In this work, we propose a new framework based on the representation of the voice signal by the multiple projection of cepstral coefficients to promote the detection of dysphonic alterations in the voice through machine learning techniques. Most of the best-known cepstral coefficient extraction techniques in the literature are mapped and analyzed separately and together with measures related to the fundamental frequency of the voice signal, and its representation capacity is evaluated on three classifiers. Finally, the experiments on a subset of the Saarbruecken Voice Database prove the effectiveness of the proposed material in detecting the presence of dysphonia in the voice.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10256083 | PMC |
http://dx.doi.org/10.3390/s23115196 | DOI Listing |
Ther Adv Neurol Disord
December 2024
Neurology Unit, Department of Neuroscience and Mental Health, Fondazione IRCCS Ca' Granda Ospedale Maggiore Policlinico, Dino Ferrari Centre, Milan, Italy.
Indian J Otolaryngol Head Neck Surg
December 2024
Department of Gastroenterology, Christian Medical College, Vellore, Tamil Nadu 632004 India.
Objective: To study the prevalence of synchronous oesophageal cancer in patients with head and neck cancers using Narrow Band Imaging and Lugol's chromoendoscopy.
Materials And Methods: Study design: Prospective cross sectional diagnostic study. Method: 63 recruited patients with head and neck cancers, underwent haematologic evaluation, histological confirmation, imaging which included contrast enhanced computerised tomography(CECT) of the Neck and when indicated an additional Magnetic Resonance Imaging(MRI) scan followed by UGI endoscopy using white light followed by Narrow Band Imaging(NBI) and Lugol's chromoendoscopy(LCE).
Front Digit Health
October 2024
Department of Otolaryngology, University of South Florida, Tampa, FL, United States.
touchREV Endocrinol
October 2024
CaTaLiNA: Cancer de tiroides en Latinoamerica, Quito, Ecuador.
: Amyloid goiter (AG) is a rare cause of thyroid swelling, characterized by deposits of amyloid protein in the thyroid tissue. It can be associated with primary or secondary amyloidosis. Its prevalence in multinodular goiter cases is 0.
View Article and Find Full Text PDFJ Voice
November 2024
Department of Otolaryngology-Head and Neck Surgery, Mayo Clinic, 4500 San Pablo Road, Jacksonville, Florida, 32224.
Objective: This study aimed to evaluate the influence of voice therapy on maximum phonation time (MPT) and S:Z ratio in patients diagnosed with primary muscle tension dysphonia (pMTD). The goal was to investigate whether pMTD is associated with reduced S:Z ratio and prolonged MPT.
Study Design: Prospective cohort study.
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!