Efficient blind dereverberation and echo cancellation based on independent component analysis for actual acoustic signals.

Neural Comput

Department of Intelligence Science and Technology, Graduate School of Informatics, Kyoto University, Kyoto 606-8501, Japan.

Published: January 2012

This letter presents a new algorithm for blind dereverberation and echo cancellation based on independent component analysis (ICA) for actual acoustic signals. We focus on frequency domain ICA (FD-ICA) because its computational cost and speed of learning convergence are sufficiently reasonable for practical applications such as hands-free speech recognition. In applying conventional FD-ICA as a preprocessing of automatic speech recognition in noisy environments, one of the most critical problems is how to cope with reverberations. To extract a clean signal from the reverberant observation, we model the separation process in the short-time Fourier transform domain and apply the multiple input/output inverse-filtering theorem (MINT) to the FD-ICA separation model. A naive implementation of this method is computationally expensive, because its time complexity is the second order of reverberation time. Therefore, the main issue in dereverberation is to reduce the high computational cost of ICA. In this letter, we reduce the computational complexity to the linear order of the reverberation time by using two techniques: (1) a separation model based on the independence of delayed observed signals with MINT and (2) spatial sphering for preprocessing. Experiments show that the computational cost grows in proportion to the linear order of the reverberation time and that our method improves the word correctness of automatic speech recognition by 10 to 20 points in a RT₂₀= 670 ms reverberant environment.

Download full-text PDF

Source
http://dx.doi.org/10.1162/NECO_a_00219DOI Listing

Publication Analysis

Top Keywords

computational cost
12
speech recognition
12
order reverberation
12
reverberation time
12
blind dereverberation
8
dereverberation echo
8
echo cancellation
8
cancellation based
8
based independent
8
independent component
8

Similar Publications

Aim: Dynamic cancer control is a current health system priority, yet methods for achieving it are lacking. This study aims to review the application of system dynamics modeling (SDM) on cancer control and evaluate the research quality.

Methods: Articles were searched in PubMed, Web of Science, and Scopus from the inception of the study to November 15th, 2023.

View Article and Find Full Text PDF

Background: Predicting dementia early has major implications for clinical management and patient outcomes. Yet, we still lack sensitive tools for stratifying patients early, resulting in patients being undiagnosed or wrongly diagnosed. Despite rapid expansion in machine learning models for dementia prediction, limited model interpretability and generalizability impede translation to the clinic.

View Article and Find Full Text PDF

Draw+: network-based computational drug repositioning with attention walking and noise filtering.

Health Inf Sci Syst

December 2025

Division of Software, Yonsei University, Mirae Campus, Yeonsedae-gil 1, Wonju-si, 26493 Gangwon-do Korea.

Purpose: Drug repositioning, a strategy that repurposes already-approved drugs for novel therapeutic applications, provides a faster and more cost-effective alternative to traditional drug discovery. Network-based models have been adopted by many computational methodologies, especially those that use graph neural networks to predict drug-disease associations. However, these techniques frequently overlook the quality of the input network, which is a critical factor for achieving accurate predictions.

View Article and Find Full Text PDF

In this work, a cost-effective, scalable pneumatic silicone actuator array is introduced, designed to dynamically conform to the user's skin and thereby alleviate localised pressure within a prosthetic socket. The appropriate constitutive models for developing a finite element representation of these actuators are systematically identified, parametrised, and validated. Employing this computational framework, the surface deformation fields induced by 270 variations in soft actuator array design parameters under realistic load conditions are examined, achieving predictive accuracies within 70 µm.

View Article and Find Full Text PDF

Understanding cellular responses to external stimuli is critical for parsing biological mechanisms and advancing therapeutic development. High-content image-based assays provide a cost-effective approach to examine cellular phenotypes induced by diverse interventions, which offers valuable insights into biological processes and cellular states. In this paper, we introduce MorphoDiff, a generative pipeline to predict high-resolution cell morphological responses under different conditions based on perturbation encoding.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!