Real-time single-channel deep neural network-based speech enhancement on edge devices.

Interspeech

Department of Electrical and Computer Engineering, The University of Texas at Dallas, Richardson, TX-75080, USA.

Published: October 2020

In this paper, we present a deep neural network architecture comprising of both convolutional neural network (CNN) and recurrent neural network (RNN) layers for real-time single-channel speech enhancement (SE). The proposed neural network model focuses on enhancing the noisy speech magnitude spectrum on a frame-by-frame process. The developed model is implemented on the smartphone (edge device), to demonstrate the real-time usability of the proposed method. Perceptual evaluation of speech quality (PESQ) and short-time objective intelligibility (STOI) test results are used to compare the proposed algorithm to previously published conventional and deep learning-based SE methods. Subjective ratings show the performance improvement of the proposed model over the other baseline SE methods.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8064406PMC
http://dx.doi.org/10.21437/Interspeech.2020-1901DOI Listing

Publication Analysis

Top Keywords

neural network
16
real-time single-channel
8
deep neural
8
speech enhancement
8
neural
5
single-channel deep
4
neural network-based
4
speech
4
network-based speech
4
enhancement edge
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!