Phonetic acquisition in cortical dynamics, a computational approach.

PLoS One

Universidad de Buenos Aires, Facultad de Ingeniería, Instituto de Ingeniería Biomédica, Ciudad Autónoma de Buenos Aires, Argentina.

Published: February 2020

Many computational theories have been developed to improve artificial phonetic classification performance from linguistic auditory streams. However, less attention has been given to psycholinguistic data and neurophysiological features recently found in cortical tissue. We focus on a context in which basic linguistic units-such as phonemes-are extracted and robustly classified by humans and other animals from complex acoustic streams in speech data. We are especially motivated by the fact that 8-month-old human infants can accomplish segmentation of words from fluent audio streams based exclusively on the statistical relationships between neighboring speech sounds without any kind of supervision. In this paper, we introduce a biologically inspired and fully unsupervised neurocomputational approach that incorporates key neurophysiological and anatomical cortical properties, including columnar organization, spontaneous micro-columnar formation, adaptation to contextual activations and Sparse Distributed Representations (SDRs) produced by means of partial N-Methyl-D-aspartic acid (NMDA) depolarization. Its feature abstraction capabilities show promising phonetic invariance and generalization attributes. Our model improves the performance of a Support Vector Machine (SVM) classifier for monosyllabic, disyllabic and trisyllabic word classification tasks in the presence of environmental disturbances such as white noise, reverberation, and pitch and voice variations. Furthermore, our approach emphasizes potential self-organizing cortical principles achieving improvement without any kind of optimization guidance which could minimize hypothetical loss functions by means of-for example-backpropagation. Thus, our computational model outperforms multiresolution spectro-temporal auditory feature representations using only the statistical sequential structure immerse in the phonotactic rules of the input stream.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6555517PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0217966PLOS

Publication Analysis

Top Keywords

phonetic acquisition
4
cortical
4
acquisition cortical
4
cortical dynamics
4
dynamics computational
4
computational approach
4
approach computational
4
computational theories
4
theories developed
4
developed improve
4

Similar Publications

Language outcomes of children with hearing loss remain heterogeneous despite recent advances in treatment and intervention. Consonants with high frequency, in particular, continue to pose challenges to affected children's speech perception and production. In this review, the authors evaluate findings of how enriched child-directed speech and song might function as a form of early family-centered intervention to remedy the effects of hearing loss on consonant acquisition already during infancy.

View Article and Find Full Text PDF

This study assessed the neural mechanisms and relative saliency of categorization for speech sounds and comparable graphemes (i.e., visual letters) of the same phonetic label.

View Article and Find Full Text PDF

Interactions between the context in which a sensorimotor skill is learned and the recall of that memory have been primarily studied in limb movements, but speech production requires movement, and many aspects of speech processing are influenced by task-relevant contextual information. Here, in ecologically valid speech (read sentences), we test whether English-French bilinguals can use the language of production to acquire and recall distinct motor plans for similar speech sounds spanning the production workspace. Participants experienced real-time alterations of auditory feedback while producing interleaved English and French sentences.

View Article and Find Full Text PDF
Article Synopsis
  • Variability in speech input can help infants learn sound contrasts, as it may highlight individual cues or reveal stable relationships between them.
  • The study focuses on the relationship between Voice Onset Time (VOT) and F1 formant frequency in German, showing that adults use these cues to differentiate voiced and voiceless stops based on a trading relation.
  • Infants, at six months, show a preference for speech in which phonetic cues align with the adult trading relation rather than a reversed relation, indicating their ability to discern these sound patterns.
View Article and Find Full Text PDF

Language processing encompasses a sophisticated interplay of phonological (sound-based) and semantic (meaning-based) processes. This intricate interaction develops progressively during early language acquisition. It involves not only the addition of new words to the child's vocabulary but also the evolving organization of lexico-semantic networks.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!