A Physiologically Inspired Model for Solving the Cocktail Party Problem.

J Assoc Res Otolaryngol

Hearing Research Center, Department of Biomedical Engineering, Boston University, 44 Cummington Mall, Room 412, Boston, MA, 02215, USA.

Published: December 2019

At a cocktail party, we can broadly monitor the entire acoustic scene to detect important cues (e.g., our names being called, or the fire alarm going off), or selectively listen to a target sound source (e.g., a conversation partner). It has recently been observed that individual neurons in the avian field L (analog to the mammalian auditory cortex) can display broad spatial tuning to single targets and selective tuning to a target embedded in spatially distributed sound mixtures. Here, we describe a model inspired by these experimental observations and apply it to process mixtures of human speech sentences. This processing is realized in the neural spiking domain. It converts binaural acoustic inputs into cortical spike trains using a multi-stage model composed of a cochlear filter-bank, a midbrain spatial-localization network, and a cortical network. The output spike trains of the cortical network are then converted back into an acoustic waveform, using a stimulus reconstruction technique. The intelligibility of the reconstructed output is quantified using an objective measure of speech intelligibility. We apply the algorithm to single and multi-talker speech to demonstrate that the physiologically inspired algorithm is able to achieve intelligible reconstruction of an "attended" target sentence embedded in two other non-attended masker sentences. The algorithm is also robust to masker level and displays performance trends comparable to humans. The ideas from this work may help improve the performance of hearing assistive devices (e.g., hearing aids and cochlear implants), speech-recognition technology, and computational algorithms for processing natural scenes cluttered with spatially distributed acoustic objects.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6889086PMC
http://dx.doi.org/10.1007/s10162-019-00732-4DOI Listing

Publication Analysis

Top Keywords

physiologically inspired
8
cocktail party
8
spatially distributed
8
spike trains
8
cortical network
8
inspired model
4
model solving
4
solving cocktail
4
party problem
4
problem cocktail
4

Similar Publications

In addition to the known therapeutic indications for cannabidiol, its administration by inhalation appears to be of great interest. Indeed, there is evidence of cannabidiol's efficacy in several physiological pathways, suggesting its potential for a wide range of applications for both local and systemic pulmonary administration like cancers. Significant advances in pulmonary drug delivery have led to innovative strategies to address the challenges of increasing the respirable fraction of drugs and standardizing inhalable products.

View Article and Find Full Text PDF

Negative Pressure Ventilation Ex-Situ Lung Perfusion Preserves Porcine and Human Lungs for 36-Hours.

Clin Transplant

January 2025

Division of Cardiac Surgery, Department of Surgery, Faculty of Medicine, University of Alberta, Edmonton, Canada.

Introduction: Preclinically, 24-hour continuous Ex-Situ Lung Perfusion (ESLP) is the longest duration achieved in large animal models and rejected human lungs. Here, we present our 36-hour Negative Pressure Ventilation (NPV)-ESLP protocol applied to porcine and rejected human lungs.

Methods: Five sets of donor domestic pig lungs (45-55 kg) underwent 36-hour NPV-ESLP.

View Article and Find Full Text PDF

Recently, there has been growing interest in knowing the best hygrometry level during high-flow nasal oxygen and non-invasive ventilation (NIV) and its potential influence on the outcome. Various studies have shown that breathing cold and dry air results in excessive water loss by nasal mucosa, reduced mucociliary clearance, increased airway resistance, reduced epithelial cell function, increased inflammation, sloughing of tracheal epithelium, and submucosal inflammation. With the Coronavirus Disease 2019 pandemic, using high-flow nasal oxygen with a heated humidifier has become an emerging form of non-invasive support among clinicians.

View Article and Find Full Text PDF

Artificial neural networks (ANNs) can help camera-based remote photoplethysmography (rPPG) in measuring cardiac activity and physiological signals from facial videos, such as pulse wave, heart rate and respiration rate with better accuracy. However, most existing ANN-based methods require substantial computing resources, which poses challenges for effective deployment on mobile devices. Spiking neural networks (SNNs), on the other hand, hold immense potential for energy-efficient deep learning owing to their binary and event-driven architecture.

View Article and Find Full Text PDF

The physiological sequelae of pre-term birth might influence the responses of this population to hypoxia. Moreover, identifying variables associated with development of acute mountain sickness (AMS) remains a key practically significant area of altitude research. We investigated the effects of pre-term birth on nocturnal oxygen saturation ( ) dynamics and assessed the predictive potential of nocturnal -related metrics for morning AMS in 12 healthy adults with gestational age < 32 weeks (pre-term) and 12 term-born control participants.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!