Differentiable Optimization of Similarity Scores Between Models and Brains

Read original: arXiv:2407.07059 - Published 7/10/2024 by Nathan Cloos, Moufan Li, Markus Siegel, Scott L. Brincat, Earl K. Miller, Guangyu Robert Yang, Christopher J. Cueva

Differentiable Optimization of Similarity Scores Between Models and Brains

Overview

This paper proposes a method for optimizing the similarity between machine learning models and brain activity data.
The goal is to better align the representations learned by AI models with the representations in the human brain.
The authors introduce a differentiable optimization technique that can be used to fine-tune AI models to maximize their similarity to brain data.

Plain English Explanation

The human brain is a complex and fascinating organ, and researchers are very interested in understanding how it works. One way to study the brain is by looking at patterns of neural activity - the electrical signals that neurons in the brain generate when they are active.

Advances in neuroimaging techniques have allowed scientists to measure brain activity with increasing precision, creating large datasets of "brain data" that can be compared to the representations learned by artificial intelligence (AI) models.

The idea behind this paper is that if we can find AI models whose internal representations closely match the representations in the human brain, it could help us better understand how the brain processes information. This could have important implications for a variety of applications, from building more human-like AI systems to developing new treatments for neurological disorders.

The key innovation in this paper is a new optimization technique that allows the authors to "fine-tune" AI models to maximize their similarity to brain data in a mathematically rigorous way. By treating the similarity metric as a differentiable function, the authors can use standard gradient-based optimization methods to adjust the model parameters and improve the alignment between the model and the brain.

This approach builds on prior work in contrastive learning and similarity-based model comparison, but the authors show that their technique can achieve better results in practice.

Technical Explanation

The core of the authors' approach is a differentiable optimization framework for maximizing the similarity between AI models and brain data. They define a similarity metric that can be computed between the activations of an AI model and the measured neural responses in the brain, and then use gradient-based optimization to adjust the model parameters to increase this similarity.

Specifically, the authors propose a novel "brain-model similarity" (BMS) objective function that compares the representations learned by the AI model to the brain data. This objective is differentiable with respect to the model parameters, allowing standard gradient descent techniques to be used to optimize the model.

The authors demonstrate the effectiveness of their approach through a series of experiments on various AI models and brain datasets. They show that by fine-tuning the models using the BMS objective, they are able to significantly improve the alignment between the model representations and the corresponding brain activity patterns.

Importantly, the authors also analyze the limitations of their approach, discussing potential issues like the sensitivity of the similarity metric to specific implementation choices and the challenge of scaling the optimization to larger models and datasets.

Critical Analysis

One key strength of this work is the technical rigor with which the authors develop their optimization framework. By formulating the problem in a differentiable way, they are able to leverage powerful gradient-based optimization techniques that can efficiently search the parameter space of the AI models.

However, a potential limitation is the reliance on specific similarity metrics and the sensitivity of the results to the choice of these metrics. The authors acknowledge that the BMS objective function may not be the only way to quantify the similarity between models and brains, and that alternative metrics could lead to different results.

Additionally, while the authors demonstrate the effectiveness of their approach on several benchmark datasets, it remains to be seen how well the technique will scale to larger and more complex AI models and brain data. Applying this method to state-of-the-art language or vision models, for example, could pose significant computational and technical challenges.

Finally, it's worth noting that while aligning AI models with brain representations is an intriguing goal, the ultimate value of this work will depend on the practical applications and insights that can be derived from these model-brain comparisons. The authors touch on some potential use cases, but further research will be needed to fully realize the benefits of this approach.

Conclusion

This paper presents a novel optimization technique for improving the alignment between artificial intelligence models and brain activity data. By defining a differentiable similarity metric and using gradient-based optimization, the authors demonstrate how AI models can be fine-tuned to better match the representations observed in the human brain.

While the technical details of the approach are complex, the core idea is relatively straightforward: if we can find AI models that closely mimic the information processing in the brain, it could lead to important insights about cognition and open up new avenues for developing more human-like artificial intelligence.

The authors have made a significant contribution to this field, but there is still much work to be done to fully realize the potential of this line of research. Continued collaboration between AI and neuroscience researchers will be crucial for advancing our understanding of the brain and building the next generation of intelligent systems.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Differentiable Optimization of Similarity Scores Between Models and Brains

Nathan Cloos, Moufan Li, Markus Siegel, Scott L. Brincat, Earl K. Miller, Guangyu Robert Yang, Christopher J. Cueva

What metrics should guide the development of more realistic models of the brain? One proposal is to quantify the similarity between models and brains using methods such as linear regression, Centered Kernel Alignment (CKA), and angular Procrustes distance. To better understand the limitations of these similarity measures we analyze neural activity recorded in five experiments on nonhuman primates, and optimize synthetic datasets to become more similar to these neural recordings. How similar can these synthetic datasets be to neural activity while failing to encode task relevant variables? We find that some measures like linear regression and CKA, differ from angular Procrustes, and yield high similarity scores even when task relevant variables cannot be linearly decoded from the synthetic datasets. Synthetic datasets optimized to maximize similarity scores initially learn the first principal component of the target dataset, but angular Procrustes captures higher variance dimensions much earlier than methods like linear regression and CKA. We show in both theory and simulations how these scores change when different principal components are perturbed. And finally, we jointly optimize multiple similarity scores to find their allowed ranges, and show that a high angular Procrustes similarity, for example, implies a high CKA score, but not the converse.

7/10/2024

Correcting Biased Centered Kernel Alignment Measures in Biological and Artificial Neural Networks

Alex Murphy, Joel Zylberberg, Alona Fyshe

Centred Kernel Alignment (CKA) has recently emerged as a popular metric to compare activations from biological and artificial neural networks (ANNs) in order to quantify the alignment between internal representations derived from stimuli sets (e.g. images, text, video) that are presented to both systems. In this paper we highlight issues that the community should take into account if using CKA as an alignment metric with neural data. Neural data are in the low-data high-dimensionality domain, which is one of the cases where (biased) CKA results in high similarity scores even for pairs of random matrices. Using fMRI and MEG data from the THINGS project, we show that if biased CKA is applied to representations of different sizes in the low-data high-dimensionality domain, they are not directly comparable due to biased CKA's sensitivity to differing feature-sample ratios and not stimuli-driven responses. This situation can arise both when comparing a pre-selected area of interest (e.g. ROI) to multiple ANN layers, as well as when determining to which ANN layer multiple regions of interest (ROIs) / sensor groups of different dimensionality are most similar. We show that biased CKA can be artificially driven to its maximum value when using independent random data of different sample-feature ratios. We further show that shuffling sample-feature pairs of real neural data does not drastically alter biased CKA similarity in comparison to unshuffled data, indicating an undesirable lack of sensitivity to stimuli-driven neural responses. Positive alignment of true stimuli-driven responses is only achieved by using debiased CKA. Lastly, we report findings that suggest biased CKA is sensitive to the inherent structure of neural data, only differing from shuffled data when debiased CKA detects stimuli-driven alignment.

5/3/2024

Effective Layer Pruning Through Similarity Metric Perspective

Ian Pons, Bruno Yamamoto, Anna H. Reali Costa, Artur Jordao

Deep neural networks have been the predominant paradigm in machine learning for solving cognitive tasks. Such models, however, are restricted by a high computational overhead, limiting their applicability and hindering advancements in the field. Extensive research demonstrated that pruning structures from these models is a straightforward approach to reducing network complexity. In this direction, most efforts focus on removing weights or filters. Studies have also been devoted to layer pruning as it promotes superior computational gains. However, layer pruning often hurts the network predictive ability (i.e., accuracy) at high compression rates. This work introduces an effective layer-pruning strategy that meets all underlying properties pursued by pruning methods. Our method estimates the relative importance of a layer using the Centered Kernel Alignment (CKA) metric, employed to measure the similarity between the representations of the unpruned model and a candidate layer for pruning. We confirm the effectiveness of our method on standard architectures and benchmarks, in which it outperforms existing layer-pruning strategies and other state-of-the-art pruning techniques. Particularly, we remove more than 75% of computation while improving predictive ability. At higher compression regimes, our method exhibits negligible accuracy drop, while other methods notably deteriorate model accuracy. Apart from these benefits, our pruned models exhibit robustness to adversarial and out-of-distribution samples.

5/28/2024

How Aligned are Different Alignment Metrics?

Jannis Ahlert, Thomas Klein, Felix Wichmann, Robert Geirhos

In recent years, various methods and benchmarks have been proposed to empirically evaluate the alignment of artificial neural networks to human neural and behavioral data. But how aligned are different alignment metrics? To answer this question, we analyze visual data from Brain-Score (Schrimpf et al., 2018), including metrics from the model-vs-human toolbox (Geirhos et al., 2021), together with human feature alignment (Linsley et al., 2018; Fel et al., 2022) and human similarity judgements (Muttenthaler et al., 2022). We find that pairwise correlations between neural scores and behavioral scores are quite low and sometimes even negative. For instance, the average correlation between those 80 models on Brain-Score that were fully evaluated on all 69 alignment metrics we considered is only 0.198. Assuming that all of the employed metrics are sound, this implies that alignment with human perception may best be thought of as a multidimensional concept, with different methods measuring fundamentally different aspects. Our results underline the importance of integrative benchmarking, but also raise questions about how to correctly combine and aggregate individual metrics. Aggregating by taking the arithmetic average, as done in Brain-Score, leads to the overall performance currently being dominated by behavior (95.25% explained variance) while the neural predictivity plays a less important role (only 33.33% explained variance). As a first step towards making sure that different alignment metrics all contribute fairly towards an integrative benchmark score, we therefore conclude by comparing three different aggregation options.

7/11/2024