Two signal-processing algorithms, derived from those described by Stubbs and Summerfield [R.J. Stubbs and Q. Summerfield, J. Acoust. Soc. Am. 84, 1236-1249 (1988)], were used to separate the voiced speech of two talkers speaking simultaneously, at similar intensities, in a single channel. Both algorithms use fundamental frequency (FO) as the basis for segregation. One attenuates the interfering voice by filtering the cepstrum of the signal. The other is a hybrid algorithm that combines cepstral filtering with the technique of harmonic selection [T.W. Parsons, J. Acoust. Soc. Am. 60, 911-918 (1976)]. The algorithms were evaluated and compared in perceptual experiments involving listeners with normal hearing and listeners with cochlear hearing impairments. In experiment 1 the processing was used to separate voiced sentences spoken on a monotone. Both algorithms gave significant increases in intelligibility to both groups of listeners. The improvements were equivalent to an increase of 3-4 dB in the effective signal-to-noise ratio (SNR). In experiment 2 the processing was used to separate voiced sentences spoken with time-varying intonation. For normal-hearing listeners, cepstral filtering gave a significant increase in intelligibility, while the hybrid algorithm gave an increase that was on the margins of significance (p = 0.06). The improvements were equivalent to an increase of 2-3 dB in the effective SNR. For impaired listeners, no intelligibility improvements were demonstrated with intoned sentences. The decrease in performance for intoned material is attributed to limitations of the algorithms when FO is nonstationary.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1121/1.399257 | DOI Listing |
Healthcare (Basel)
January 2025
Department of Computer Science, Institute of Mathematics and Statistics, University of São Paulo (USP), São Paulo 05508-220, SP, Brazil.
Background/objectives: The aim of this paper was to compare voice and speech characteristics between post-COVID-19 and control subjects. The hypothesis was that acoustic parameters of voice and speech may differentiate subjects infected by COVID-19 from control subjects. Additionally, we expected to observe the persistence of symptoms in women.
View Article and Find Full Text PDFJ Voice
January 2025
Department of Audio, Video, and Electronic Forensics, Academy of Forensic Science, Shanghai, China; Shanghai Forensic Service Platform, Key Laboratory of Forensic Science, Ministry of Justice, Shanghai, China.
Drug abuse can cause severe damage to the human speech organs. The vocal folds are one of the important speech organs that produce voice through vibration when airflow passes through. Previous studies have reported the negative effects of drugs on speech organs, including the vocal folds, but there is still limited research on relevant field.
View Article and Find Full Text PDFEar Hear
December 2024
Center for Hearing Research, Boys Town National Research Hospital, Omaha, Nebraska, USA.
Objectives: To investigate the influence of frequency-specific audibility on audiovisual benefit in children, this study examined the impact of high- and low-pass acoustic filtering on auditory-only and audiovisual word and sentence recognition in children with typical hearing. Previous studies show that visual speech provides greater access to consonant place of articulation than other consonant features and that low-pass filtering has a strong impact on perception on acoustic consonant place of articulation. This suggests visual speech may be particularly useful when acoustic speech is low-pass filtered because it provides complementary information about consonant place of articulation.
View Article and Find Full Text PDFJ Voice
January 2025
Department of Communication Sciences and Disorders, Bowling Green State University, Bowling Green, OH.
Objectives: This study aimed to identify voice instabilities across registration shifts produced by untrained female singers and describe them relative to changes in fundamental frequency, airflow, intensity, inferred adduction, and acoustic spectra.
Study Design: Multisignal descriptive study.
Methods: Five untrained female singers sang up to 30 repetitions of octave scales.
J Voice
January 2025
School of Behavioral and Brain Sciences, Department of Speech, Language, and Hearing, Callier Center for Communication Disorders, University of Texas at Dallas, Richardson, TX; Department of Otolaryngology - Head and Neck Surgery, University of Texas Southwestern Medical Center, Dallas, TX. Electronic address:
Introduction: Patients with primary muscle tension dysphonia (pMTD) commonly report symptoms of vocal effort, fatigue, discomfort, odynophonia, and aberrant vocal quality (eg, vocal strain, hoarseness). However, voice symptoms most salient to pMTD have not been identified. Furthermore, how standard vocal fatigue and vocal tract discomfort indices that capture persistent symptoms-like the Vocal Fatigue Index (VFI) and Vocal Tract Discomfort Scale (VTDS)-relate to acute symptoms experienced at the time of the voice evaluation is unclear.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!