Perceptual Error Identification of Human and Synthesized Voices.

J Voice

Department of Speech Language Pathology and Audiology, Universidade Federal de São Paulo, São Paulo, Brazil; Voice Department, Centro de Estudos da Voz-CEV, São Paulo, Brazil.

Published: September 2016

Objectives/hypothesis: To verify the discriminatory ability of human and synthesized voice samples.

Study Design: This is a prospective study.

Methods: A total of 70 subjects, 20 voice specialist speech-language pathologists (V-SLPs), 20 general SLPs (G-SLPs), and 30 naive listeners (NLs) participated of a listening task that was simply to classify the stimuli as human or synthesized. Samples of 36 voices, 18 human and 18 synthesized vowels, male and female (9 each), with different type and degree of deviation, were presented with 50% of repetition to verify intrarater consistency. Human voices were collected from a vocal clinic database. Voice disorders were simulated by perturbations of vocal frequency, jitter (roughness), additive noise (breathiness) and by increasing tension and decreasing separation of the vocal folds (strain).

Results: The average amount of error considering all groups was 37.8%, 31.9% for V-SLP, 39.3% for G-SLP, and 40.8% for NL. V-SLP had smaller mean percentage error for synthesized (24.7%), breathy (36.7%), synthesized breathy (30.8%), and tense (25%) and female (27.5%) voices. G-SLP and NL presented equal mean percentage error for all voices classification. All groups together presented no difference on the mean percentage error between human and synthesized voices (P value = 0.452).

Conclusions: The quality of synthesized samples was very high. V-SLP presented a lower amount of error, which allows us to infer that auditory training assists on vocal analysis tasks.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.jvoice.2015.07.017DOI Listing

Publication Analysis

Top Keywords

human synthesized
20
percentage error
12
synthesized
8
synthesized voices
8
synthesized samples
8
amount error
8
human
6
voices
6
error
5
perceptual error
4

Similar Publications

The oxidation of 5-HMF to HMFCA is an important yet complex process, as it generates high-value chemical intermediates. Achieving this transformation efficiently requires the development of non-precious, highly active catalysts derived from renewable biomass sources. In this work, we introduce UoM-1 (UoM, University of Mazandaran), a novel cobalt-based metal-organic framework (Co-MOF) synthesized using a simple one-step ultrasonic irradiation method.

View Article and Find Full Text PDF

Enhancing antibody levels and T cell activity of quadrivalent influenza vaccine by combining it with CpG HP021.

Sci Rep

December 2024

State Key Laboratory for Diagnosis, Treatment of Infectious Diseases, National Clinical Research Center for Infectious Diseases, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, The First Affiliated Hospital, School of Medicine, Zhejiang University, Hangzhou, 310003, China.

Influenza virus infections are a serious danger to people's health worldwide as they are responsible for seasonal flu outbreaks. There is an urgent need to improve the effectiveness and durability longevity of the immune response to influenza vaccines. We synthesized the CpG HP021 and examined the impact of it on the immune response to an influenza vaccine.

View Article and Find Full Text PDF

Carbon dots (CDs) are versatile nanomaterials that are considered ideal for application in bioimaging, drug delivery, sensing, and optoelectronics owing to their excellent photoluminescence, biocompatibility, and chemical stability features. Nitrogen doping enhances the fluorescence of CDs, alters their electronic properties, and improves their functional versatility. N-doped CDs can be synthesized via solvothermal treatment of carbon sources with nitrogen-rich precursors; however, systematic investigations of their synthesis mechanisms have been rarely reported.

View Article and Find Full Text PDF

Sixteen thio/semicarbazide-based benzyloxy derivatives (BT1-BT16) were synthesized and evaluated for their inhibitory activities against monoamine oxidases (MAOs). Most compounds showed better inhibitory activity against MAO-B than against MAO-A. BT1, BT3, and BT5 showed the greatest inhibitory activity with an identical IC value of 0.

View Article and Find Full Text PDF

Background: Medical narratives are fundamental to the correct identification of a patient's health condition. This is not only because it describes the patient's situation. It also contains relevant information about the patient's context and health state evolution.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!