We use (multi)modal deep neural networks (DNNs) to probe for sites of multimodal integration in the human brain by predicting stereoen-cephalography (SEEG) recordings taken while human subjects watched movies. We operationalize sites of multimodal integration as regions where a multimodal vision-language model predicts recordings better than unimodal language, unimodal vision, or linearly-integrated language-vision models. Our target DNN models span different architectures (e.g., convolutional networks and transformers) and multimodal training techniques (e.g., cross-attention and contrastive learning). As a key enabling step, we first demonstrate that trained vision and language models systematically outperform their randomly initialized counterparts in their ability to predict SEEG signals. We then compare unimodal and multimodal models against one another. Because our target DNN models often have different architectures, number of parameters, and training sets (possibly obscuring those differences attributable to integration), we carry out a controlled comparison of two models (SLIP and SimCLR), which keep all of these attributes the same aside from input modality. Using this approach, we identify a sizable number of neural sites (on average 141 out of 1090 total sites or 12.94%) and brain regions where multimodal integration seems to occur. Additionally, we find that among the variants of multimodal training techniques we assess, CLIP-style training is the best suited for downstream prediction of the neural activity in these sites.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11213144PMC

Publication Analysis

Top Keywords

multimodal integration
12
multimodal
9
sites multimodal
8
regions multimodal
8
models target
8
target dnn
8
dnn models
8
multimodal training
8
training techniques
8
models
6

Similar Publications

In 2025, it will be 30 years since the initial clinical approval of pegylated liposomal doxorubicin (PLD) by the Food and Drug Administration. PLD predated the field of nanomedicine and became a model nanomedicine setting key pharmacological principles (prolonged circulation, slow drug release and the enhanced permeability and retention (EPR) effect) for clinical application of other nano-drugs in cancer therapy. The impressive reduction of cardiotoxicity conferred by PLD is the most valuable clinical asset.

View Article and Find Full Text PDF

Multimodal cross-scale context clusters for classification of mental disorders using functional and structural MRI.

Neural Netw

January 2025

The Key Laboratory for Computer Systems of State Ethnic Affairs Commission, Southwest Minzu University, Chengdu, Sichuan 610225, China. Electronic address:

The brain is a complex system with multiple scales and hierarchies, making it challenging to identify abnormalities in individuals with mental disorders. The dynamic segregation and integration of activities across brain regions enable flexible switching between local and global information processing modes. Modeling these scale dynamics within and between brain regions can uncover hidden correlates of brain structure and function in mental disorders.

View Article and Find Full Text PDF

Personalized sports training plans are essential for addressing individual athlete needs, but traditional methods often need to integrate diverse data types, limiting adaptability and effectiveness. Existing machine learning (ML) and rule-based approaches cannot dynamically generate context-specific training programs, reducing their applicability in real-world scenarios. This study aims to develop a Generative Adversarial Network (GAN)- based framework to create context-specific training plans by integrating numeric attributes (e.

View Article and Find Full Text PDF

Background: Previously, we demonstrated that changes in circulating tumor DNA (ctDNA) are promising biomarkers for early response prediction (ERP) to immune checkpoint inhibitors (ICI) in metastatic urothelial cancer (mUC). In this study, we investigated the value of whole blood immunotranscriptomics for ERP-ICI and integrated both biomarkers into a multimodal model to boost accuracy.

Methods: Blood samples of 93 patients were collected at baseline and after 2-6 weeks of ICI for ctDNA (N=88) and immunotranscriptome (N=79) analyses.

View Article and Find Full Text PDF

[An update on surgical treatment options for inflammatory bowel disease].

Inn Med (Heidelb)

January 2025

Klinik für Allgemein‑, Viszeral- und Thoraxchirurgie, Klinikum Darmstadt GmbH, Grafenstraße 9, 64283, Darmstadt, Deutschland.

There are national and international guidelines and developments for the surgery of chronic inflammatory bowel disease (IBD) that contribute to better patient care. Important recommendations include increasingly individualized and minimally invasive approaches with the integration of new technologies. The indication for abdominal surgery remains tied to specialization, not least in order to continue to be able to assess the importance of sequential treatment and multimodality in improving surgical results and minimizing risks.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!