Individuals, such as voice-related professionals, elderly people and smokers, are increasingly suffering from voice disorder, which implies the importance of pathological voice repair. Previous work on pathological voice repair only concerned about sustained vowel /a/, but multiple vowels repair is still challenging due to the unstable extraction of pitch and the unsatisfactory reconstruction of formant. In this paper, a multiple vowels repair based on pitch extraction and Line Spectrum Pair feature for voice disorder is proposed, which broadened the research subjects of voice repair from only single vowel /a/ to multiple vowels /a/, /i/ and /u/ and achieved the repair of these vowels successfully. Considering deep neural network as a classifier, a voice recognition is performed to classify the normal and pathological voices. Wavelet Transform and Hilbert-Huang Transform are applied for pitch extraction. Based on Line Spectrum Pair (LSP) feature, the formant is reconstructed. The final repaired voice is obtained by synthesizing the pitch and the formant. The proposed method is validated on Saarbrücken Voice Database (SVD) database. The achieved improvements of three metrics, Segmental Signal-to-Noise Ratio, LSP distance measure and Mel cepstral distance measure, are respectively 45.87%, 50.37% and 15.56%. Besides, an intuitive analysis based on spectrogram has been done and a prominent repair effect has been achieved.

Download full-text PDF

Source
http://dx.doi.org/10.1109/JBHI.2020.2978103DOI Listing

Publication Analysis

Top Keywords

multiple vowels
16
vowels repair
12
pitch extraction
12
spectrum pair
12
voice disorder
12
voice repair
12
voice
9
repair
8
repair based
8
based pitch
8

Similar Publications

Background And Aims: An unanticipated difficult airway is one of the greatest challenges for anesthesiologists. Proper preoperative airway assessment is crucial to reducing complications. However, current screening tests based on anthropometric features are of uncertain benefit.

View Article and Find Full Text PDF

Participants tend to produce a higher or lower vocal pitch in response to upward or downward visual motion, suggesting a pitch-motion correspondence between the visual and speech production processes. However, previous studies were contaminated by factors such as the meaning of vocalized words and the intrinsic pitch or tongue movements associated with the vowels. To address these issues, we examined the pitch-motion correspondence between simple visual motion and pitched speech production.

View Article and Find Full Text PDF

Purpose: Speech disorders associated with velopharyngeal dysfunction (VPD) are common. Some require surgical management, while others are responsive to speech therapy. This is related to whether the speech error is obligatory (passive) or compensatory (active).

View Article and Find Full Text PDF

Introduction: Parkinson's patients with dysarthria often suffer from multiple impairments in speech subsystems, including phonation. The Acoustic Voice Quality Index (AVQI) may be considered as a predictor of the onset and severity of Parkinson's disease.

Objective: Investigating the AVQI in Persian-speaking Parkinson's patients compared to healthy controls and its association with disease severity based on Unified Parkinson's Disease Rating Scale-Part III (UPDRS-III) and dysarthria severity.

View Article and Find Full Text PDF

Purpose: The objective of the present study is to investigate nasal and oral vowel production in French-speaking children with cochlear implants (CIs) and children with typical hearing (TH). Vowel nasality relies primarily on acoustic cues that may be less effectively transmitted by the implant. The study investigates how children with CIs manage to produce these segments in French, a language with contrastive vowel nasalization.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!