We investigate the use of phonetic motor invariants (MIs), that is, recurring kinematic patterns of the human phonetic articulators, to improve automatic phoneme discrimination. Using a multi-subject database of synchronized speech and lips/tongue trajectories, we first identify MIs commonly associated with bilabial and dental consonants, and use them to simultaneously segment speech and motor signals. We then build a simple neural network-based regression schema (called Audio-Motor Map, AMM) mapping audio features of these segments to the corresponding MIs. Extensive experimental results show that (a) a small set of features extracted from the MIs, as originally gathered from articulatory sensors, are dramatically more effective than a large, state-of-the-art set of audio features, in automatically discriminating bilabials from dentals; (b) the same features, extracted from AMM-reconstructed MIs, are as effective as or better than the audio features, when testing across speakers and coarticulating phonemes; and dramatically better as noise is added to the speech signal. These results seem to support some of the claims of the motor theory of speech perception and add experimental evidence of the actual usefulness of MIs in the more general framework of automated speech recognition.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3164679PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0024055PLOS

Publication Analysis

Top Keywords

audio features
12
phonetic motor
8
motor invariants
8
improve automatic
8
automatic phoneme
8
phoneme discrimination
8
features extracted
8
mis
6
speech
5
features
5

Similar Publications

Integrating visual features has been proven effective for deep learning-based speech quality enhancement, particularly in highly noisy environments. However, these models may suffer from redundant information, resulting in performance deterioration when the signal-to-noise ratio (SNR) is relatively high. Real-world noisy scenarios typically exhibit widely varying noise levels.

View Article and Find Full Text PDF

Background: Chronic obstructive pulmonary disease (COPD) affects breathing, speech production, and coughing. We evaluated a machine learning analysis of speech for classifying the disease severity of COPD.

Methods: In this single centre study, non-consecutive COPD patients were prospectively recruited for comparing their speech characteristics during and after an acute COPD exacerbation.

View Article and Find Full Text PDF

Amplitude compression is an indispensable feature of contemporary audio production and especially relevant in modern hearing aids. The cortical fate of amplitude-compressed speech signals is not well-studied, however, and may yield undesired side effects: We hypothesize that compressing the amplitude envelope of continuous speech reduces neural tracking. Yet, leveraging such a 'compression side effect' on unwanted, distracting sounds could potentially support attentive listening if effectively reducing their neural tracking.

View Article and Find Full Text PDF

Background: Podcasts are an unconventional method of disseminating information through audio to the masses. They are an emerging portable technology and a valuable resource that provides unlimited access for promoting health among participants. Podcasts related to health care have been used as a source of medical education, but there is a dearth of studies on the use of podcasts as a source of health information.

View Article and Find Full Text PDF

Assay for Transposase-Accessible Chromatin with sequencing (ATAC-seq) is a powerful, high-throughput technique for assessing chromatin accessibility and understanding epigenomic regulation. Neutrophils, as a crucial leukocyte type in immune responses, undergo substantial chromatin architectural changes during differentiation and activation, which significantly impact the gene expression necessary for their functions. ATAC-seq has been instrumental in uncovering key transcription factors in neutrophil maturation, revealing pathogen-specific epigenomic signatures, and identifying therapeutic targets for autoimmune diseases.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!