A nonlinear speech feature extraction algorithm was developed by modeling human cochlear functions, and demonstrated as a noise-robust front-end for speech recognition systems. The algorithm was based on a model of the Organ of Corti in the human cochlea with such features as such as basilar membrane (BM), outer hair cells (OHCs), and inner hair cells (IHCs). Frequency-dependent nonlinear compression and amplification of OHCs were modeled by lateral inhibition to enhance spectral contrasts. In particular, the compression coefficients had frequency dependency based on the psychoacoustic evidence. Spectral subtraction and temporal adaptation were applied in the time-frame domain. With long-term and short-term adaptation characteristics, these factors remove stationary or slowly varying components and amplify the temporal changes such as onset or offset. The proposed features were evaluated with a noisy speech database and showed better performance than the baseline methods such as mel-frequency cepstral coefficients (MFCCs) and RASTA-PLP in unknown noisy conditions.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1016/j.neunet.2013.02.006 | DOI Listing |
Nat Commun
January 2025
Los Alamos National Laboratory, EES-17 National Security Earth Science, Los Alamos, NM, 87545, USA.
Significant progress has been made in probing the state of an earthquake fault by applying machine learning to continuous seismic waveforms. The breakthroughs were originally obtained from laboratory shear experiments and numerical simulations of fault shear, then successfully extended to slow-slipping faults. Here we apply the Wav2Vec-2.
View Article and Find Full Text PDFInt J Audiol
January 2025
Department of Otolaryngology, Vanderbilt University Medical Center, Nashville, TN, USA.
Objectives: An improvement in speech perception is a major well-documented benefit of cochlear implantation (CI), which is commonly discussed with CI candidates to set expectations. However, a large variability exists in speech perception outcomes. We evaluated the accuracy of clinical predictions of post-CI speech perception scores.
View Article and Find Full Text PDFOtolaryngol Head Neck Surg
January 2025
Department of Otolaryngology-Head and Neck Surgery, Columbia University Vagelos College of Physicians and Surgeons, NewYork-Presbyterian/Columbia University Irving Medical Center, New York, New York, USA.
Objective: Hearing loss (HL) is associated with depression, but existing datasets are limited by the type of data available for both hearing and mental health conditions. The purpose of this study is to determine if there is an association between HL and depressive disorders within a large bi-institutional electronic health record (EHR) system containing more granular diagnostic information.
Study Design: Cross-sectional epidemiologic study.
Sensors (Basel)
January 2025
SHCCIG Yubei Coal Industry Co., Ltd., Xi'an 710900, China.
The coal mining industry in Northern Shaanxi is robust, with a prevalent use of the local dialect, known as "Shapu", characterized by a distinct Northern Shaanxi accent. This study addresses the practical need for speech recognition in this dialect. We propose an end-to-end speech recognition model for the North Shaanxi dialect, leveraging the Conformer architecture.
View Article and Find Full Text PDFJ Clin Med
January 2025
Assistance Publique-Hôpitaux de Paris, Hôpital Bicêtre, Service d'Oto-Rhino-Laryngologie, 78 Rue du Général Leclerc, 94270 Le Kremlin-Bicêtre, France.
Hearing aids (HAs) have been used for standard high-frequency hearing loss and tinnitus, but their effects on speech intelligibility in noise (SIN) in people with normal hearing, including hidden hearing loss (HHL), have been little explored. We included in a prospective cohort study patients who experience poor SIN and have normal pure tone average in quiet conditions or slight HL. We used open-fit HAs.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!