A decade after speech was first decoded from human brain signals, accuracy and speed remain far below that of natural speech. Here we show how to decode the electrocorticogram with high accuracy and at natural-speech rates. Taking a cue from recent advances in machine translation, we train a recurrent neural network to encode each sentence-length sequence of neural activity into an abstract representation, and then to decode this representation, word by word, into an English sentence. For each participant, data consist of several spoken repeats of a set of 30-50 sentences, along with the contemporaneous signals from ~250 electrodes distributed over peri-Sylvian cortices. Average word error rates across a held-out repeat set are as low as 3%. Finally, we show how decoding with limited data can be improved with transfer learning, by training certain layers of the network under multiple participants' data.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10560395 | PMC |
http://dx.doi.org/10.1038/s41593-020-0608-8 | DOI Listing |
BMJ Health Care Inform
January 2025
King's College London, London, UK
In this perspective article, we consider the use of predictive models in healthcare and associated challenges. We will argue that patients can play a valuable role in supporting the safe and practicable embedding of such tools and provide some examples.
View Article and Find Full Text PDFAnn Transl Med
December 2024
Division of Advanced Gastrointestinal and Bariatric Surgery, Mayo Clinic, Jacksonville, FL, USA.
Background: Addressing language barriers through accurate interpretation is crucial for providing quality care and establishing trust. While the ability of artificial intelligence (AI) to translate medical documentation has been studied, its role for patient-provider communication is less explored. This review evaluates AI's effectiveness in clinical translation by assessing accuracy, usability, satisfaction, and feedback on its use.
View Article and Find Full Text PDFJACC Adv
December 2024
Division of Blood Disorders and Public Health Genomics, National Center on Birth Defects and Developmental Disabilities, Centers for Disease Control and Prevention, Atlanta, Georgia, USA.
Background: Familial hypercholesterolemia (FH) is a common genetic disorder that is strongly associated with premature cardiovascular disease. Effective diagnosis and appropriate treatment of FH can reduce cardiovascular disease risk; however, FH is underdiagnosed. Electronic health record (EHR)-based FH screening tools have been previously described to enhance the detection of FH.
View Article and Find Full Text PDFCreating the Babel Fish, a tool that helps individuals translate speech between any two languages, requires advanced technological innovation and linguistic expertise. Although conventional speech-to-speech translation systems composed of multiple subsystems performing translation in a cascaded fashion exist, scalable and high-performing unified systems remain underexplored. To address this gap, here we introduce SEAMLESSM4T-Massively Multilingual and Multimodal Machine Translation-a single model that supports speech-to-speech translation (101 to 36 languages), speech-to-text translation (from 101 to 96 languages), text-to-speech translation (from 96 to 36 languages), text-to-text translation (96 languages) and automatic speech recognition (96 languages).
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!