Two Tales of Single-Phase Contrastive Hebbian Learning

2402.08573

Published 6/26/2024 by Rasmus Kj{ae}r H{o}ier, Christopher Zach

Two Tales of Single-Phase Contrastive Hebbian Learning

Abstract

The search for biologically plausible'' learning algorithms has converged on the idea of representing gradients as activity differences. However, most approaches require a high degree of synchronization (distinct phases during learning) and introduce substantial computational overhead, which raises doubts regarding their biological plausibility as well as their potential utility for neuromorphic computing. Furthermore, they commonly rely on applying infinitesimal perturbations (nudges) to output units, which is impractical in noisy environments. Recently it has been shown that by modelling artificial neurons as dyads with two oppositely nudged compartments, it is possible for a fully local learning algorithm named dual propagation'' to bridge the performance gap to backpropagation, without requiring separate learning phases or infinitesimal nudging. However, the algorithm has the drawback that its numerical stability relies on symmetric nudging, which may be restrictive in biological and analog implementations. In this work we first provide a solid foundation for the objective underlying the dual propagation method, which also reveals a surprising connection with adversarial robustness. Second, we demonstrate how dual propagation is related to a particular adjoint state method, which is stable regardless of asymmetric nudging.

Create account to get full access

Overview

This paper presents two alternative approaches to single-phase contrastive Hebbian learning, a biologically-inspired machine learning technique.
The authors explore different ways of implementing contrastive Hebbian learning, which aims to capture the dynamics of neural activity in the brain during learning.
The paper provides a comprehensive comparison of the two proposed approaches, analyzing their performance on various tasks and discussing their biological plausibility.

Plain English Explanation

The human brain is an incredible learning machine, capable of adapting and forming new connections between neurons over time. Contrastive Hebbian learning is a type of machine learning that tries to mimic this process, with the goal of creating more biologically-plausible artificial intelligence systems.

In this paper, the authors explore two different ways of implementing contrastive Hebbian learning in a single-phase, rather than the typical multi-phase, approach. This means the algorithm can learn and adapt in a more continuous and dynamic way, similar to how the brain operates.

The authors compare the performance of these two single-phase contrastive Hebbian learning approaches on various tasks, and discuss their biological plausibility. They find that both methods can be effective, but have different strengths and weaknesses in terms of things like speed of learning, energy efficiency, and ability to capture the complex dynamics of neural activity.

Overall, this research helps us better understand how the brain learns and opens up new avenues for developing more biologically-inspired AI systems that can adapt and learn in a more natural, flexible way.

Technical Explanation

The paper presents two alternative approaches to single-phase contrastive Hebbian learning, a biologically-inspired machine learning technique that aims to capture the dynamics of neural activity during learning.

The first approach, called "Parallel Contrastive Hebbian Learning" (PCHL), involves updating the synaptic weights in parallel based on the difference between the actual and desired neural activations. The second approach, "Sequential Contrastive Hebbian Learning" (SCHL), updates the weights sequentially, using the current state of the network to compute the next update.

The authors compare the performance of PCHL and SCHL on a range of tasks, including image classification, dataset learning, and multi-layer neural networks. They find that both methods can be effective, with PCHL generally performing better in terms of training speed and energy efficiency, while SCHL is better able to capture the complex neohebbian dynamics observed in biological neural networks.

The paper also discusses the biological plausibility of the two approaches, considering factors such as the temporal dynamics of neural activity, the efficiency of the update rules, and the ability to learn in a single, continuous phase.

Critical Analysis

The paper provides a comprehensive comparison of two single-phase contrastive Hebbian learning approaches, offering valuable insights into the tradeoffs between different implementation strategies. However, the authors acknowledge that both PCHL and SCHL have limitations and areas for further research.

For example, the paper notes that the SCHL approach may be more computationally expensive and require more memory, which could be a practical concern for real-world applications. Additionally, the authors suggest that further work is needed to fully capture the complex, neohebbian dynamics observed in biological neural networks.

One potential area for further exploration could be the development of hybrid approaches that combine the strengths of PCHL and SCHL, or the integration of additional biologically-inspired mechanisms to enhance the learning capabilities of these single-phase contrastive Hebbian learning methods.

Conclusion

This paper presents two novel approaches to single-phase contrastive Hebbian learning, a biologically-inspired machine learning technique. The authors provide a thorough comparison of the performance and biological plausibility of these two methods, offering valuable insights for researchers and practitioners interested in developing more biologically-plausible computing systems.

The findings suggest that single-phase contrastive Hebbian learning can be an effective and efficient way to capture the dynamics of neural activity during learning, with the potential to lead to more adaptive and flexible AI systems. However, the paper also highlights areas for further research to address the limitations of the proposed approaches and continue advancing the field of biologically-inspired machine learning.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

Towards Biologically Plausible Computing: A Comprehensive Comparison

Changze Lv, Yufei Gu, Zhengkang Guo, Zhibo Xu, Yixin Wu, Feiran Zhang, Tianyuan Shi, Zhenghua Wang, Ruicheng Yin, Yu Shang, Siqi Zhong, Xiaohua Wang, Muling Wu, Wenhao Liu, Tianlong Li, Jianhao Zhu, Cenyuan Zhang, Zixuan Ling, Xiaoqing Zheng

Backpropagation is a cornerstone algorithm in training neural networks for supervised learning, which uses a gradient descent method to update network weights by minimizing the discrepancy between actual and desired outputs. Despite its pivotal role in propelling deep learning advancements, the biological plausibility of backpropagation is questioned due to its requirements for weight symmetry, global error computation, and dual-phase training. To address this long-standing challenge, many studies have endeavored to devise biologically plausible training algorithms. However, a fully biologically plausible algorithm for training multilayer neural networks remains elusive, and interpretations of biological plausibility vary among researchers. In this study, we establish criteria for biological plausibility that a desirable learning algorithm should meet. Using these criteria, we evaluate a range of existing algorithms considered to be biologically plausible, including Hebbian learning, spike-timing-dependent plasticity, feedback alignment, target propagation, predictive coding, forward-forward algorithm, perturbation learning, local losses, and energy-based learning. Additionally, we empirically evaluate these algorithms across diverse network architectures and datasets. We compare the feature representations learned by these algorithms with brain activity recorded by non-invasive devices under identical stimuli, aiming to identify which algorithm can most accurately replicate brain activity patterns. We are hopeful that this study could inspire the development of new biologically plausible algorithms for training multilayer networks, thereby fostering progress in both the fields of neuroscience and machine learning.

6/26/2024

cs.NE

Dataset-learning duality and emergent criticality

Ekaterina Kukleva, Vitaly Vanchurin

In artificial neural networks, the activation dynamics of non-trainable variables is strongly coupled to the learning dynamics of trainable variables. During the activation pass, the boundary neurons (e.g., input neurons) are mapped to the bulk neurons (e.g., hidden neurons), and during the learning pass, both bulk and boundary neurons are mapped to changes in trainable variables (e.g., weights and biases). For example, in feed-forward neural networks, forward propagation is the activation pass and backward propagation is the learning pass. We show that a composition of the two maps establishes a duality map between a subspace of non-trainable boundary variables (e.g., dataset) and a tangent subspace of trainable variables (i.e., learning). In general, the dataset-learning duality is a complex non-linear map between high-dimensional spaces, but in a learning equilibrium, the problem can be linearized and reduced to many weakly coupled one-dimensional problems. We use the duality to study the emergence of criticality, or the power-law distributions of fluctuations of the trainable variables. In particular, we show that criticality can emerge in the learning system even from the dataset in a non-critical state, and that the power-law distribution can be modified by changing either the activation function or the loss function.

5/28/2024

cs.LG cs.NE

🤿

Effective Learning with Node Perturbation in Deep Neural Networks

Sander Dalm, Marcel van Gerven, Nasir Ahmad

Backpropagation (BP) remains the dominant and most successful method for training parameters of deep neural network models. However, BP relies on two computationally distinct phases, does not provide a satisfactory explanation of biological learning, and can be challenging to apply for training of networks with discontinuities or noisy node dynamics. By comparison, node perturbation (NP) proposes learning by the injection of noise into network activations, and subsequent measurement of the induced loss change. NP relies on two forward (inference) passes, does not make use of network derivatives, and has been proposed as a model for learning in biological systems. However, standard NP is highly data inefficient and unstable due to its unguided noise-based search process. In this work, we investigate different formulations of NP and relate it to the concept of directional derivatives as well as combining it with a decorrelating mechanism for layer-wise inputs. We find that a closer alignment with directional derivatives together with input decorrelation at every layer strongly enhances performance of NP learning with large improvements in parameter convergence and much higher performance on the test data, approaching that of BP. Furthermore, our novel formulation allows for application to noisy systems in which the noise process itself is inaccessible.

5/28/2024

cs.LG

📉

Neuron-centric Hebbian Learning

Andrea Ferigo, Elia Cunegatti, Giovanni Iacca

One of the most striking capabilities behind the learning mechanisms of the brain is the adaptation, through structural and functional plasticity, of its synapses. While synapses have the fundamental role of transmitting information across the brain, several studies show that it is the neuron activations that produce changes on synapses. Yet, most plasticity models devised for artificial Neural Networks (NNs), e.g., the ABCD rule, focus on synapses, rather than neurons, therefore optimizing synaptic-specific Hebbian parameters. This approach, however, increases the complexity of the optimization process since each synapse is associated to multiple Hebbian parameters. To overcome this limitation, we propose a novel plasticity model, called Neuron-centric Hebbian Learning (NcHL), where optimization focuses on neuron- rather than synaptic-specific Hebbian parameters. Compared to the ABCD rule, NcHL reduces the parameters from $5W$ to $5N$, being $W$ and $N$ the number of weights and neurons, and usually $N ll W$. We also devise a ``weightless'' NcHL model, which requires less memory by approximating the weights based on a record of neuron activations. Our experiments on two robotic locomotion tasks reveal that NcHL performs comparably to the ABCD rule, despite using up to $sim97$ times less parameters, thus allowing for scalable plasticity

4/17/2024

cs.NE cs.AI cs.LG