Machine learning in acoustics: Theory and applications.

Michael J Bianco Peter Gerstoft James Traer Emma Ozanich Marie A Roch Sharon Gannot Charles-Alban Deledalle

J Acoust Soc Am

Department of Electrical and Computer Engineering, University of California San Diego, La Jolla, California 92093, USA.

Published: November 2019

Acoustic data provide scientific and engineering insights in fields ranging from biology and communications to ocean and Earth science. We survey the recent advances and transformative potential of machine learning (ML), including deep learning, in the field of acoustics. ML is a broad family of techniques, which are often based in statistics, for automatically detecting and utilizing patterns in data. Relative to conventional acoustics and signal processing, ML is data-driven. Given sufficient training data, ML can discover complex relationships between features and desired labels or actions, or between features themselves. With large volumes of training data, ML can discover models describing complex acoustic phenomena such as human speech and reverberation. ML in acoustics is rapidly developing with compelling results and significant future promise. We first introduce ML, then highlight ML developments in four acoustics research areas: source localization in speech processing, source localization in ocean acoustics, bioacoustics, and environmental sounds in everyday scenes.

Download full-text PDF	Source
http://dx.doi.org/10.1121/1.5133944	DOI Listing

Publication Analysis

Top Keywords

machine learning

training data

data discover

source localization

acoustics

learning acoustics

acoustics theory

theory applications

applications acoustic

data

Similar Publications

Diagnosis of lung cancer using salivary miRNAs expression and clinical characteristics.

BMC Pulm Med

January 2025

Universal Scientific Education and Research Network (USERN), Tehran, Iran.

Negar Alizadeh Hoda Zahedi Maryam Koopaie Mahnaz Fatahzadeh Reza Mousavi

Objective: Lung cancer (LC), the primary cause for cancer-related death globally is a diverse illness with various characteristics. Saliva is a readily available biofluid and a rich source of miRNA. It can be collected non-invasively as well as transported and stored easily.

View Article and Find Full Text PDF

Similar Publications

HDN-DDI: a novel framework for predicting drug-drug interactions using hierarchical molecular graphs and enhanced dual-view representation learning.

BMC Bioinformatics

January 2025

School of Computer Science and Technology, University of Science and Technology of China, 443 Huangshan Road, Hefei, 230027, China.

Jinchen Sun Haoran Zheng

Background: Drug-drug interactions (DDIs) especially antagonistic ones present significant risks to patient safety, underscoring the urgent need for reliable prediction methods. Recently, substructure-based DDI prediction has garnered much attention due to the dominant influence of functional groups and substructures on drug properties. However, existing approaches face challenges regarding the insufficient interpretability of identified substructures and the isolation of chemical substructures.

View Article and Find Full Text PDF

Similar Publications

Predicting bullying victimization among adolescents using the risk and protective factor framework: a large-scale machine learning approach.

BMC Public Health

January 2025

Statistics, Brigham Young University, Provo, 84602, Utah, USA.

Ethan Low Joshua Monsen Lindsay Schow Rachel Roberts Lucy Collins

Background: Bullying, encompassing physical, psychological, social, or educational harm, affects approximately 1 in 20 United States teens aged 12-18. The prevalence and impact of bullying, including online bullying, necessitate a deeper understanding of risk and protective factors to enhance prevention efforts. This study investigated the key risk and protective factors most highly associated with adolescent bullying victimization.

View Article and Find Full Text PDF

Similar Publications

Age group classification based on optical measurement of brain pulsation using machine learning.

Sci Rep

January 2025

Research Unit of Health Sciences and Technology, University of Oulu, Oulu, Finland.

Martti Ilvesmäki Hany Ferdinando Kai Noponen Tapio Seppänen Vesa Korhonen

Optical techniques, such as functional near-infrared spectroscopy (fNIRS), contain high potential for the development of non-invasive wearable systems for evaluating cerebral vascular condition in aging, due to their portability and ability to monitor real-time changes in cerebral hemodynamics. In this study, thirty-six healthy adults were measured by single channel fNIRS to explore differences between two age groups using machine learning (ML). The subjects, measured during functional magnetic resonance imaging (fMRI) at Oulu University Hospital, were divided into young (age ≤ 32) and elderly (age ≥ 57) groups.

View Article and Find Full Text PDF

Similar Publications

Explainable vision transformer for automatic visual sleep staging on multimodal PSG signals.

NPJ Digit Med

January 2025

Graduate School of Data Science, Seoul National University, Seoul, Republic of Korea.

Hyojin Lee You Rim Choi Hyun Kyung Lee Jaemin Jeong Joopyo Hong

Polysomnography (PSG) is crucial for diagnosing sleep disorders, but manual scoring of PSG is time-consuming and subjective, leading to high variability. While machine-learning models have improved PSG scoring, their clinical use is hindered by the 'black-box' nature. In this study, we present SleepXViT, an automatic sleep staging system using Vision Transformer (ViT) that provides intuitive, consistent explanations by mimicking human 'visual scoring'.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!