Attention-Based Joint Training of Noise Suppression and Sound Event Detection for Noise-Robust Classification.

Sensors (Basel)

Department of Electronic Engineering, Hanyang University, Seoul 04763, Korea.

Published: October 2021

AI Article Synopsis

  • The study focuses on improving sound event detection (SED) performance in noisy environments using a combination of specialized neural networks for noise suppression and classification.
  • The proposed approach utilizes a context codec method-equipped temporal convolutional network for noise suppression and a convolutional recurrent neural network for SED, achieving reduced model complexity without sacrificing performance.
  • Experimental results demonstrate that this method effectively enhances classification performance in various noisy conditions.

Article Abstract

Sound event detection (SED) recognizes the corresponding sound event of an incoming signal and estimates its temporal boundary. Although SED has been recently developed and used in various fields, achieving noise-robust SED in a real environment is typically challenging owing to the performance degradation due to ambient noise. In this paper, we propose combining a pretrained time-domain speech-separation-based noise suppression network (NS) and a pretrained classification network to improve the SED performance in real noisy environments. We use group communication with a context codec method (GC3)-equipped temporal convolutional network (TCN) for the noise suppression model and a convolutional recurrent neural network for the SED model. The former significantly reduce the model complexity while maintaining the same TCN module and performance as a fully convolutional time-domain audio separation network (Conv-TasNet). We also do not update the weights of some layers (i.e., freeze) in the joint fine-tuning process and add an attention module in the SED model to further improve the performance and prevent overfitting. We evaluate our proposed method using both simulation and real recorded datasets. The experimental results show that our method improves the classification performance in a noisy environment under various signal-to-noise-ratio conditions.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8540800PMC
http://dx.doi.org/10.3390/s21206718DOI Listing

Publication Analysis

Top Keywords

noise suppression
12
sound event
12
event detection
8
sed model
8
sed
6
performance
5
network
5
attention-based joint
4
joint training
4
noise
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!