A Two-Stage Cepstral Analysis Procedure for the Classification of Rough Voices.

J Voice

Department of Statistics, Penn State University, University Park, Pennsylvania. Electronic address:

Published: January 2020

Objectives: The objective of this study was to investigate the ability of a two-stage method of cepstral peak identification to effectively discriminate rough vs breathy vs typical voice in sustained vowel productions. It was hypothesized that a dual-stage search for cepstral peak prominences (CPP's) above and below specified quefrency/F cutoffs would result in a CPP difference that would be characteristic of the rough, diplophonic voice type.

Methodology: Central one-second portions of sustained vowel /a/ productions were obtained from 90 subjects (rough, breathy, and normophonic voices). All voice samples were analyzed using a a two-stage cepstral analysis process in which a CPP difference value was obtained by identifying cepstral peaks above and below a lower limit for expected F (150 Hz for females and 90 Hz for males), called CPP and CPP respectively.

Results: The CPP difference value was observed to be a highly significant predictor, with negative values for this parameter characteristic of a dominant subharmonic in the voice signal and the perception of diplophonic, rough voice. Correct classification of rough vs nonrough voice samples was 82.2% (sensitivity 0.80 and specificity 0.833). In the consideration of three group classification (breathy vs. normophonic vs. rough), models incorporating two predictors (the CPP obtained from a single search through a 60 to 300 Hz frequency range (CPP) and the CPP difference value) correctly classified 78.88% of the voice samples.

Conclusions: Rough, diplophonic voices were consistently observed to have a subharmonic peak that was greater in amplitude than the cepstral peak obtained within the region of the expected F, resulting in a negative value for the CPP difference. The two-stage cepstral analysis process described herein is visually intuitive from the graphical display of a cepstrum and is a simple extended calculation derived from cepstral analysis procedures that have been recommended as essential in the acoustic description of vocal quality.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.jvoice.2018.07.003DOI Listing

Publication Analysis

Top Keywords

cpp difference
20
cepstral analysis
16
two-stage cepstral
12
cepstral peak
12
cpp
9
rough
8
classification rough
8
rough breathy
8
sustained vowel
8
rough diplophonic
8

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!