Decoding and synthesizing tonal language speech from brain activity.

Sci Adv

Department of Neurosurgery, Huashan Hospital, Shanghai Medical College, Fudan University, Shanghai 200040, China.

Published: June 2023

Recent studies have shown that the feasibility of speech brain-computer interfaces (BCIs) as a clinically valid treatment in helping nontonal language patients with communication disorders restore their speech ability. However, tonal language speech BCI is challenging because additional precise control of laryngeal movements to produce lexical tones is required. Thus, the model should emphasize the features from the tonal-related cortex. Here, we designed a modularized multistream neural network that directly synthesizes tonal language speech from intracranial recordings. The network decoded lexical tones and base syllables independently via parallel streams of neural network modules inspired by neuroscience findings. The speech was synthesized by combining tonal syllable labels with nondiscriminant speech neural activity. Compared to commonly used baseline models, our proposed models achieved higher performance with modest training data and computational costs. These findings raise a potential strategy for approaching tonal language speech restoration.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10256166PMC
http://dx.doi.org/10.1126/sciadv.adh0478DOI Listing

Publication Analysis

Top Keywords

tonal language
16
language speech
16
speech
8
lexical tones
8
neural network
8
tonal
5
language
5
decoding synthesizing
4
synthesizing tonal
4
speech brain
4

Similar Publications

Background: People share health-related experiences and treatments, such as for insomnia, in digital communities. Natural language processing tools can be leveraged to understand the terms used in digital spaces to discuss insomnia and insomnia treatments.

Objective: The aim of this study is to summarize and chart trends of insomnia treatment terms on a digital insomnia message board.

View Article and Find Full Text PDF

Purpose: Children with autism spectrum disorder (ASD) often show abnormal speech prosody. Tonal languages can pose more difficulties as speakers need to use acoustic cues to make lexical contrasts while encoding the focal function, but the acquisition of speech prosody of non-native languages, especially tonal languages has rarely been investigated.

Methods: This study aims to fill in the aforementioned gap by studying prosodic focus-marking in Mandarin by native Cantonese-speaking children with ASD (n = 25), in comparison with their typically developing (TD) peers (n = 20) and native Mandarin-speaking children (n = 20).

View Article and Find Full Text PDF

Background/objectives: Previous studies have examined the role of working memory in cognitive tasks such as syntactic, semantic, and phonological processing, thereby contributing to our understanding of linguistic information management and retrieval. However, the real-time processing of phonological information-particularly in relation to suprasegmental features like tone, where its contour represents a time-varying signal-remains a relatively underexplored area within the framework of Information Processing Theory (IPT). This study aimed to address this gap by investigating the real-time processing of similar tonal information by native Cantonese speakers, thereby providing a deeper understanding of how IPT applies to auditory processing.

View Article and Find Full Text PDF

Background: Cochlear implantation is an effective method of auditory rehabilitation. Nevertheless, the results show individual variations depending on several factors.

Aim: To evaluate cochlear implantation results based on the APCEI profile (Acceptance, Perception, Comprehension, Oral Expression and Intelligibility) and audiometric results.

View Article and Find Full Text PDF

In perceptual studies, musicality and pitch aptitude have been implicated in tone learning, while vocabulary size has been implicated in distributional (segment) learning. Moreover, working memory plays a role in the overnight consolidation of explicit-declarative L2 learning. This study examines how these factors uniquely account for individual differences in the distributional learning and consolidation of an L2 tone contrast, where learners are tonal language speakers, and the training is implicit.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!