Voice signal classification in three types according to their degree of periodicity, a task known as signal typing, is a relevant preprocessing step before computing any perturbation measures. However, it is a time consuming and subjective activity. This has given rise to interest in automatic systems that use objective measures to distinguish among the different signal types. The purpose of this paper is twofold. First, to propose a pattern recognition approach for automatic voice signal typing based on a multi-class linear Support Vector Machine, and using rather well-known parameters like Jitter, Shimmer, Harmonic-to-Noise Ratio, and Cepstral Prominence Peak in combination with nonlinear dynamics measures. Two novel features are also proposed as objective parameters. Second, to validate this approach using a large amount of signals coming from two well-known corpora using cross-dataset experiments to assess the generalizability of the system. A total amount of 1262 signals labeled by professional voice pathologists were used with this purpose. Statistically significant differences between all types were found for all features. Accuracies over 82.71% were estimated in all intra-datasets and inter-datasets using cross-validation. Finally, the use of posterior probabilities is proposed as a measure of the reliability of the assigned type. This could help clinicians to make a more informed decision about the type assigned to a voice. These outcomes suggest that the proposed approach can successfully discriminate among the three voice types, paving the way to a fully automatic tool for voice signal typing in the future.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1016/j.jvoice.2020.03.006 | DOI Listing |
Neurobiol Dis
January 2025
Centre de Recherches sur la Cognition Animale, Centre de Biologie Intégrative, Université de Toulouse, CNRS, UPS, 31062, France. Electronic address:
The ability to distinguish between individuals is crucial for social species and supports behaviors such as reproduction, hierarchy formation, and cooperation. In rodents, social discrimination relies on memory and the recognition of individual-specific cues, known as "individual signatures". While olfactory signals are central, other sensory cues - such as auditory, visual, and tactile inputs - also play a role.
View Article and Find Full Text PDFData Brief
February 2025
Department of Information & Communication Technology, University of Agder (UiA), Norway.
Hindko is a language primarily spoken in Northwestern areas of Pakistan. Approximately eight million people speak the Hindko language. According to its native speakers, it is 7 largest language of Pakistan and 2 largest language of Khyber Pakhtunkhwa.
View Article and Find Full Text PDFAnn Fam Med
January 2025
Departments of Psychiatry and Emergency Medicine, University of Arkansas for Medical Sciences, Little Rock, Arkansas.
Purpose: Mental health screening is recommended by the US Preventive Services Task Force for all patients in areas where treatment options are available. Still, it is estimated that only 4% of primary care patients are screened for depression. The goal of this study was to evaluate the efficacy of machine learning technology (Kintsugi Voice, v1, Kintsugi Mindful Wellness, Inc) to detect and analyze voice biomarkers consistent with moderate to severe depression, potentially allowing for greater compliance with this critical primary care public health need.
View Article and Find Full Text PDFEmotion
January 2025
Department of Psychology, Cognitive and Affective Neuroscience Unit, University of Zurich.
Affective voice signaling has significant biological and social relevance across various species, and different affective signaling types have emerged through the evolution of voice communication. These types range from basic affective voice bursts and nonverbal affective up to affective intonations superimposed on speech utterances in humans in the form of paraverbal prosodic patterns. These different types of affective signaling should have evolved to be acoustically and perceptually distinctive, allowing accurate and nuanced affective communication.
View Article and Find Full Text PDFJ Spinal Cord Med
January 2025
Speech-Language-Hearing Sciences, Medical School, Federal University of Minas Gerais, Belo Horizonte, Brazil.
Introduction: Spinal cord injury is a physiological disruption often caused by trauma, leading to severe physical and psychological effects, including irreversible impairment and disability. Cervical injuries, particularly between C1 and C8, are the most severe, potentially causing diaphragm paralysis and requiring mechanical ventilation. Reduced respiratory muscle strength not only affects respiratory function but also significantly impacts voice, speech, and communication, which are crucial for quality of life.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!