We present a method for speech enhancement of data collected in extremely noisy environments, such as those obtained during magnetic resonance imaging (MRI) scans. We propose an algorithm based on dictionary learning to perform this enhancement. We use complex nonnegative matrix factorization with intra-source additivity (CMF-WISA) to learn dictionaries of the noise and speech+noise portions of the data and use these to factor the noisy spectrum into estimated speech and noise components. We augment the CMF-WISA cost function with spectral and temporal regularization terms to improve the noise modeling. Based on both objective and subjective assessments, we find that our algorithm significantly outperforms traditional techniques such as Least Mean Squares (LMS) filtering, while not requiring prior knowledge or specific assumptions such as periodicity of the noise waveforms that current state-of-the-art algorithms require.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6157637PMC
http://dx.doi.org/10.1109/TASLP.2018.2800280DOI Listing

Publication Analysis

Top Keywords

dictionary learning
8
spectral temporal
8
temporal regularization
8
acoustic denoising
4
denoising dictionary
4
learning spectral
4
regularization method
4
method speech
4
speech enhancement
4
enhancement data
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!