Deep Neural Networks and Brain Alignment: Brain Encoding and Decoding (Survey)

Read original: arXiv:2307.10246 - Published 7/9/2024 by Subba Reddy Oota, Zijiao Chen, Manish Gupta, Raju S. Bapi, Gael Jobard, Frederic Alexandre, Xavier Hinaut

🤿

Overview

This paper explores how insights about the brain can be obtained using AI models, and the relationship between deep learning models and brain recordings.
It discusses the use of brain recording techniques, such as functional magnetic resonance imaging (fMRI), to improve AI models.
The neuroscience community has contributed several large datasets related to passive reading, listening, and viewing of stimuli, which have been used to develop encoding and decoding models.
Encoding models aim to generate fMRI brain representations given a stimulus, while decoding models reconstruct the stimulus from fMRI data.
Deep learning-based encoding and decoding models have been recently proposed, inspired by the effectiveness of deep learning in other domains.

Plain English Explanation

Researchers have been investigating whether artificial intelligence (AI) models can provide insights into how the human brain works. They've been exploring the connection between the information stored in deep learning models and the signals recorded from the brain, such as those captured by functional magnetic resonance imaging (fMRI).

The idea is that studying these brain recordings could help improve the AI models themselves. Neuroscientists have put together several large datasets of people's brain activity while they were passively reading, listening to, or viewing different types of content, like words, stories, pictures, and movies.

Researchers have used these datasets to develop two main types of models. Encoding models try to generate the brain's response to a particular stimulus, like a word or image, based on the AI model's understanding. Decoding models try to do the opposite: reconstruct the original stimulus from the brain's response.

These models can be useful for evaluating and diagnosing neurological conditions, and may even help design therapies for brain injuries or disorders. The researchers were inspired by the impressive performance of deep learning models in areas like natural language processing, computer vision, and speech recognition, and have been developing deep learning-based encoding and decoding models.

Technical Explanation

The paper reviews the use of deep learning models to obtain insights about the brain by studying brain recordings like fMRI. It discusses how the representations learned by deep learning models may be related to the signals measured from the brain.

The neuroscience community has contributed several large datasets of brain activity measured during passive reading, listening, and viewing tasks. These datasets have been used to develop encoding models that can generate fMRI brain representations given a stimulus, and decoding models that can reconstruct the stimulus from fMRI data.

Inspired by the success of deep learning in other domains like natural language processing, computer vision, and speech, the authors review deep learning-based encoding and decoding architectures. These include models like MindBridge, Language Reconstruction from the Brain, BrainChat, and Neuro-Vision to Language.

The authors discuss the benefits and limitations of these deep learning approaches, and how they can be used for applications like evaluating neurological conditions and designing brain-computer interfaces.

Critical Analysis

The paper provides a comprehensive review of the use of deep learning models to study brain recordings, but there are a few potential limitations and areas for further research:

The datasets used to train these models are relatively small compared to typical deep learning datasets, which could limit the models' performance and generalization.
The paper does not delve into the interpretability of the deep learning models - it's not always clear how the models are extracting insights from the brain recordings.
The real-world applications of these models, such as for neurological diagnosis or brain-computer interfaces, are still in early stages and require more validation.

Researchers may want to explore ways to collect larger, more diverse brain recording datasets, and develop more interpretable deep learning architectures specifically tailored for this domain. Additionally, further work is needed to demonstrate the clinical utility of these models in practical settings.

Overall, this paper provides a valuable overview of an exciting area of research at the intersection of AI and neuroscience, but there are still many open challenges to address.

Conclusion

This paper highlights the potential for using AI models, particularly deep learning, to obtain insights about the human brain by studying brain recordings like fMRI data. The neuroscience community has contributed large datasets of brain activity during various perceptual and cognitive tasks, which have enabled the development of encoding and decoding models.

These models can generate fMRI brain representations from stimuli, or reconstruct stimuli from brain activity, respectively. Deep learning-based approaches have shown promising results, inspired by the successes of deep learning in other domains like natural language processing and computer vision.

While there are still limitations and areas for further research, this work demonstrates the growing synergy between AI and neuroscience, with the potential to advance our understanding of the brain and develop new applications for brain-computer interfaces and neurological diagnostics.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🤿

Deep Neural Networks and Brain Alignment: Brain Encoding and Decoding (Survey)

Subba Reddy Oota, Zijiao Chen, Manish Gupta, Raju S. Bapi, Gael Jobard, Frederic Alexandre, Xavier Hinaut

Can we obtain insights about the brain using AI models? How is the information in deep learning models related to brain recordings? Can we improve AI models with the help of brain recordings? Such questions can be tackled by studying brain recordings like functional magnetic resonance imaging (fMRI). As a first step, the neuroscience community has contributed several large cognitive neuroscience datasets related to passive reading/listening/viewing of concept words, narratives, pictures, and movies. Encoding and decoding models using these datasets have also been proposed in the past two decades. These models serve as additional tools for basic cognitive science and neuroscience research. Encoding models aim at generating fMRI brain representations given a stimulus automatically. They have several practical applications in evaluating and diagnosing neurological conditions and thus may also help design therapies for brain damage. Decoding models solve the inverse problem of reconstructing the stimuli given the fMRI. They are useful for designing brain-machine or brain-computer interfaces. Inspired by the effectiveness of deep learning models for natural language processing, computer vision, and speech, several neural encoding and decoding models have been recently proposed. In this survey, we will first discuss popular representations of language, vision and speech stimuli, and present a summary of neuroscience datasets. Further, we will review popular deep learning based encoding and decoding architectures and note their benefits and limitations. Finally, we will conclude with a summary and discussion about future trends. Given the large amount of recently published work in the computational cognitive neuroscience (CCN) community, we believe that this survey enables an entry point for DNN researchers to diversify into CCN research.

7/9/2024

Decoding Linguistic Representations of Human Brain

Yu Wang, Heyang Liu, Yuhao Wang, Chuan Xuan, Yixuan Hou, Sheng Feng, Hongcheng Liu, Yusheng Liao, Yanfeng Wang

Language, as an information medium created by advanced organisms, has always been a concern of neuroscience regarding how it is represented in the brain. Decoding linguistic representations in the evoked brain has shown groundbreaking achievements, thanks to the rapid improvement of neuroimaging, medical technology, life sciences and artificial intelligence. In this work, we present a taxonomy of brain-to-language decoding of both textual and speech formats. This work integrates two types of research: neuroscience focusing on language understanding and deep learning-based brain decoding. Generating discernible language information from brain activity could not only help those with limited articulation, especially amyotrophic lateral sclerosis (ALS) patients but also open up a new way for the next generation's brain-computer interface (BCI). This article will help brain scientists and deep-learning researchers to gain a bird's eye view of fine-grained language perception, and thus facilitate their further investigation and research of neural process and language decoding.

7/31/2024

MindBridge: A Cross-Subject Brain Decoding Framework

Shizun Wang, Songhua Liu, Zhenxiong Tan, Xinchao Wang

Brain decoding, a pivotal field in neuroscience, aims to reconstruct stimuli from acquired brain signals, primarily utilizing functional magnetic resonance imaging (fMRI). Currently, brain decoding is confined to a per-subject-per-model paradigm, limiting its applicability to the same individual for whom the decoding model is trained. This constraint stems from three key challenges: 1) the inherent variability in input dimensions across subjects due to differences in brain size; 2) the unique intrinsic neural patterns, influencing how different individuals perceive and process sensory information; 3) limited data availability for new subjects in real-world scenarios hampers the performance of decoding models. In this paper, we present a novel approach, MindBridge, that achieves cross-subject brain decoding by employing only one model. Our proposed framework establishes a generic paradigm capable of addressing these challenges by introducing biological-inspired aggregation function and novel cyclic fMRI reconstruction mechanism for subject-invariant representation learning. Notably, by cycle reconstruction of fMRI, MindBridge can enable novel fMRI synthesis, which also can serve as pseudo data augmentation. Within the framework, we also devise a novel reset-tuning method for adapting a pretrained model to a new subject. Experimental results demonstrate MindBridge's ability to reconstruct images for multiple subjects, which is competitive with dedicated subject-specific models. Furthermore, with limited data for a new subject, we achieve a high level of decoding accuracy, surpassing that of subject-specific models. This advancement in cross-subject brain decoding suggests promising directions for wider applications in neuroscience and indicates potential for more efficient utilization of limited fMRI data in real-world scenarios. Project page: https://littlepure2333.github.io/MindBridge

4/12/2024

Language Reconstruction with Brain Predictive Coding from fMRI Data

Congchi Yin, Ziyi Ye, Piji Li

Many recent studies have shown that the perception of speech can be decoded from brain signals and subsequently reconstructed as continuous language. However, there is a lack of neurological basis for how the semantic information embedded within brain signals can be used more effectively to guide language reconstruction. The theory of predictive coding suggests that human brain naturally engages in continuously predicting future word representations that span multiple timescales. This implies that the decoding of brain signals could potentially be associated with a predictable future. To explore the predictive coding theory within the context of language reconstruction, this paper proposes a novel model textsc{PredFT} for jointly modeling neural decoding and brain prediction. It consists of a main decoding network for language reconstruction and a side network for predictive coding. The side network obtains brain predictive coding representation from related brain regions of interest with a multi-head self-attention module. This representation is fused into the main decoding network with cross-attention to facilitate the language models' generation process. Experiments are conducted on the largest naturalistic language comprehension fMRI dataset Narratives. textsc{PredFT} achieves current state-of-the-art decoding performance with a maximum BLEU-1 score of $27.8%$.

5/21/2024