This study introduces a unified computational framework connecting acoustic, speech and word-level linguistic structures to study the neural basis of everyday conversations in the human brain. We used electrocorticography to record neural signals across 100 h of speech production and comprehension as participants engaged in open-ended real-life conversations. We extracted low-level acoustic, mid-level speech and contextual word embeddings from a multimodal speech-to-text model (Whisper). We developed encoding models that linearly map these embeddings onto brain activity during speech production and comprehension. Remarkably, this model accurately predicts neural activity at each level of the language processing hierarchy across hours of new conversations not used in training the model. The internal processing hierarchy in the model is aligned with the cortical hierarchy for speech and language processing, where sensory and motor regions better align with the model's speech embeddings, and higher-level language areas better align with the model's language embeddings. The Whisper model captures the temporal sequence of language-to-speech encoding before word articulation (speech production) and speech-to-language encoding post articulation (speech comprehension). The embeddings learned by this model outperform symbolic models in capturing neural activity supporting natural speech and language. These findings support a paradigm shift towards unified computational models that capture the entire processing hierarchy for speech comprehension and production in real-world conversations.

Download full-text PDF

Source
http://dx.doi.org/10.1038/s41562-025-02105-9DOI Listing

Publication Analysis

Top Keywords

language processing
12
speech production
12
processing hierarchy
12
speech
10
neural basis
8
everyday conversations
8
unified computational
8
production comprehension
8
neural activity
8
hierarchy speech
8

Similar Publications

The aim of the study was to determine the test-retest reliability of MMN and LDN recorded to simple speech contrasts in children with listening difficulties. MMN and LDN responses were recorded from Fz and Cz electrodes for a /da/-/ga/ contrast twice within a 10-day period. To extract MMN and LDN, auditory-evoked responses to /ga/ stimuli presented alone were subtracted from the responses to /ga/ presented within an oddball sequence.

View Article and Find Full Text PDF

The Transformative Role of Artificial Intelligence in Dentistry: A Comprehensive Overview. Part 1: Fundamentals of AI, and its Contemporary Applications in Dentistry.

Int Dent J

March 2025

Department of Restorative Dentistry, College of Dentistry, Ajman University, Ajman, United Arab Emirates; Centre of Medical and Bio-allied Health Sciences Research, Ajman University, Ajman, United Arab Emirates.

Artificial intelligence (AI) holds immense promise in revolutionising dentistry, spanning, diagnostics, treatment planning and educational realms. This narrative review, in two parts, explores the fundamentals and the multifaceted potential of AI in dentistry. The current article explores the profound impact of AI in dentistry, encompassing diagnostic tools, treatment planning, and patient care.

View Article and Find Full Text PDF

Seeing and visualizing across the hemispheres.

Handb Clin Neurol

March 2025

Sorbonne Université, Institut du Cerveau/Paris Brain Institute-ICM, Inserm, CNRS, APHP, Hôpital de la Pitié Salpêtrière, Paris, France. Electronic address:

Despite our subjective experience of a largely symmetric visual world, the human brain exhibits varying patterns and degrees of hemispheric asymmetry in distinct processes of visual cognition. This chapter reviews behavioral and neuroimaging evidence from neurotypical individuals and neurological patients, concerning functional asymmetries between the right hemisphere (RH) and the left hemisphere (LH) in visual object processing and mental imagery. Hierarchical perception shows RH preference for global processing and LH preference for local processing.

View Article and Find Full Text PDF

Hemispheric asymmetries in episodic memory.

Handb Clin Neurol

March 2025

Laboratory of Neuropsychology of Memory, IRCSS Santa Lucia Foundation, Rome, Italy; Department of Systems Medicine, Tor Vergata University, Rome, Italy. Electronic address:

The term "episodic memory" refers to our ability to remember past personal experiences. This ability is severely disrupted following bilateral damage to a dedicated neural substrate located symmetrically in the mesial temporal lobes. Milder deficits are also observed following unilateral damage to the same structures.

View Article and Find Full Text PDF

The arts and hemispheric specialization.

Handb Clin Neurol

March 2025

Department of Psychology, University of California, Los Angeles (UCLA), Los Angeles, CA, United States. Electronic address:

Art was initially thought of as a single function linked mainly to spatial perception and right hemisphere functional specialization. Art was also considered to be diametrically opposed to language, further solidifying the right hemisphere specialization model. This view remained dominant for many decades.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!