Voice Signal Typing Using a Pattern Recognition Approach.

J Voice

Instituto de Investigación y Desarrollo en Bioingeniería y Bioinformática, UNER-CONICET, Oro Verde, Entre Ríos, Argentina.

Published: January 2022

Voice signal classification in three types according to their degree of periodicity, a task known as signal typing, is a relevant preprocessing step before computing any perturbation measures. However, it is a time consuming and subjective activity. This has given rise to interest in automatic systems that use objective measures to distinguish among the different signal types. The purpose of this paper is twofold. First, to propose a pattern recognition approach for automatic voice signal typing based on a multi-class linear Support Vector Machine, and using rather well-known parameters like Jitter, Shimmer, Harmonic-to-Noise Ratio, and Cepstral Prominence Peak in combination with nonlinear dynamics measures. Two novel features are also proposed as objective parameters. Second, to validate this approach using a large amount of signals coming from two well-known corpora using cross-dataset experiments to assess the generalizability of the system. A total amount of 1262 signals labeled by professional voice pathologists were used with this purpose. Statistically significant differences between all types were found for all features. Accuracies over 82.71% were estimated in all intra-datasets and inter-datasets using cross-validation. Finally, the use of posterior probabilities is proposed as a measure of the reliability of the assigned type. This could help clinicians to make a more informed decision about the type assigned to a voice. These outcomes suggest that the proposed approach can successfully discriminate among the three voice types, paving the way to a fully automatic tool for voice signal typing in the future.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.jvoice.2020.03.006DOI Listing

Publication Analysis

Top Keywords

voice signal
16
signal typing
16
pattern recognition
8
recognition approach
8
voice
7
signal
5
typing
4
typing pattern
4
approach
4
approach voice
4

Similar Publications

Fine social discrimination of siblings in mice: Implications for early detection of Alzheimer's disease.

Neurobiol Dis

January 2025

Centre de Recherches sur la Cognition Animale, Centre de Biologie Intégrative, Université de Toulouse, CNRS, UPS, 31062, France. Electronic address:

The ability to distinguish between individuals is crucial for social species and supports behaviors such as reproduction, hierarchy formation, and cooperation. In rodents, social discrimination relies on memory and the recognition of individual-specific cues, known as "individual signatures". While olfactory signals are central, other sensory cues - such as auditory, visual, and tactile inputs - also play a role.

View Article and Find Full Text PDF

Hindko is a language primarily spoken in Northwestern areas of Pakistan. Approximately eight million people speak the Hindko language. According to its native speakers, it is 7 largest language of Pakistan and 2 largest language of Khyber Pakhtunkhwa.

View Article and Find Full Text PDF

Purpose: Mental health screening is recommended by the US Preventive Services Task Force for all patients in areas where treatment options are available. Still, it is estimated that only 4% of primary care patients are screened for depression. The goal of this study was to evaluate the efficacy of machine learning technology (Kintsugi Voice, v1, Kintsugi Mindful Wellness, Inc) to detect and analyze voice biomarkers consistent with moderate to severe depression, potentially allowing for greater compliance with this critical primary care public health need.

View Article and Find Full Text PDF

Affective voice signaling has significant biological and social relevance across various species, and different affective signaling types have emerged through the evolution of voice communication. These types range from basic affective voice bursts and nonverbal affective up to affective intonations superimposed on speech utterances in humans in the form of paraverbal prosodic patterns. These different types of affective signaling should have evolved to be acoustically and perceptually distinctive, allowing accurate and nuanced affective communication.

View Article and Find Full Text PDF

Introduction: Spinal cord injury is a physiological disruption often caused by trauma, leading to severe physical and psychological effects, including irreversible impairment and disability. Cervical injuries, particularly between C1 and C8, are the most severe, potentially causing diaphragm paralysis and requiring mechanical ventilation. Reduced respiratory muscle strength not only affects respiratory function but also significantly impacts voice, speech, and communication, which are crucial for quality of life.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!