Efficient Neural Decoding Based on Multimodal Training.

Brain Sci

Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai 200433, China.

Published: September 2024

Background/objectives: Neural decoding methods are often limited by the performance of brain encoders, which map complex brain signals into a latent representation space of perception information. These brain encoders are constrained by the limited amount of paired brain and stimuli data available for training, making it challenging to learn rich neural representations.

Methods: To address this limitation, we present a novel multimodal training approach using paired image and functional magnetic resonance imaging (fMRI) data to establish a brain masked autoencoder that learns the interactions between images and brain activities. Subsequently, we employ a diffusion model conditioned on brain data to decode realistic images.

Results: Our method achieves high-quality decoding results in semantic contents and low-level visual attributes, outperforming previous methods both qualitatively and quantitatively, while maintaining computational efficiency. Additionally, our method is applied to decode artificial patterns across region of interests (ROIs) to explore their functional properties. We not only validate existing knowledge concerning ROIs but also unveil new insights, such as the synergy between early visual cortex and higher-level scene ROIs, as well as the competition within the higher-level scene ROIs.

Conclusions: These findings provide valuable insights for future directions in the field of neural decoding.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11506634PMC
http://dx.doi.org/10.3390/brainsci14100988DOI Listing

Publication Analysis

Top Keywords

neural decoding
12
multimodal training
8
brain encoders
8
higher-level scene
8
brain
7
efficient neural
4
decoding
4
decoding based
4
based multimodal
4
training background/objectives
4

Similar Publications

Quantifying the regulatory potential of genetic variants via a hybrid sequence-oriented model with SVEN.

Nat Commun

December 2024

State Key Laboratory of Protein and Plant Gene Research, School of Life Sciences, Biomedical Pioneering Innovative Center (BIOPIC) and Beijing Advanced Innovation Center for Genomics (ICG), Center for Bioinformatics (CBI), Peking University, 100871, Beijing, China.

Deciphering how noncoding DNA determines gene expression is critical for decoding the functional genome. Understanding the transcription effects of noncoding genetic variants are still major unsolved problems, which is critical for downstream applications in human genetics and precision medicine. Here, we integrate regulatory-specific neural networks and tissue-specific gradient-boosting trees to build SVEN: a hybrid sequence-oriented architecture that can accurately predict tissue-specific gene expression level and quantify the tissue-specific transcriptomic impacts of structural variants across more than 350 tissues and cell lines.

View Article and Find Full Text PDF

Subjective feelings are thought to arise from conceptual and bodily states. We examine whether the valence of feelings may also be decoded directly from objective ecological statistics of the visual environment. We train a visual valence (VV) machine learning model of low-level image statistics on nearly 8000 emotionally charged photographs.

View Article and Find Full Text PDF

Motivation: Microbial signatures in the human microbiome are closely associated with various human diseases, driving the development of machine learning models for microbiome-based disease prediction. Despite progress, challenges remain in enhancing prediction accuracy, generalizability, and interpretability. Confounding factors, such as host's gender, age, and body mass index, significantly influence the human microbiome, complicating microbiome-based predictions.

View Article and Find Full Text PDF

Animacy perception, the ability to discern living from non-living entities, is crucial for survival and social interaction, as it includes recognizing abstract concepts such as movement, purpose, and intentions. This process involves interpreting cues that may suggest the intentions or actions of others. It engages the temporal cortex (TC), particularly the superior temporal sulcus (STS) and the adjacent region of the inferior temporal cortex (ITC), as well as the dorsomedial prefrontal cortex (dmPFC).

View Article and Find Full Text PDF

In the contemporary field of life sciences, researchers have gradually recognized the critical role of microbes in maintaining human health. However, traditional biological experimental methods for validating the association between microbes and diseases are both time-consuming and costly. Therefore, developing effective computational methods to predict potential associations between microbes and diseases is an important and urgent task.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!