A design of magnetic tunnel junctions for the deployment of neuromorphic hardware for edge computing

Read original: arXiv:2409.02528 - Published 9/5/2024 by Davi Rodrigues, Eleonora Raimondo, Riccardo Tomasello, Mario Carpentieri, Giovanni Finocchio

👀

Overview

Magnetic tunnel junctions (MTJs) offer promising opportunities for advancing neuromorphic computing
This work presents an MTJ design that can compute the sigmoidal activation function and its gradient at the device level
This enables both feedforward and backpropagation computations within a single device
The implementation is robust to inter-device variation and noise, and supports transfer learning and knowledge distillation

Plain English Explanation

The paper discusses a new design for magnetic tunnel junctions (MTJs), which are electronic devices that can be used for neuromorphic computing. The key idea is that this new MTJ design can perform two important computational steps for neural networks - the activation function (which determines the output of a neuron) and the backpropagation algorithm (which is used to train the network).

Normally, these two steps would require separate hardware components, but the researchers' MTJ design can do both within a single device. This is important because it can lead to more compact and efficient hardware for neural networks, which are increasingly being used in edge computing applications like smart phones and sensors.

The researchers found that their MTJ design is very robust - it can work well even with small differences from the ideal behavior, and it can handle variations between different devices and noise in the system. This makes it a promising technology for real-world applications. They also show that it supports useful machine learning techniques like transfer learning and knowledge distillation.

Technical Explanation

The key innovation in this work is an MTJ design with a free layer and two polarizers that can compute both the sigmoidal activation function and its gradient at the device level. This enables the MTJ to perform both the feedforward and backpropagation computations required for training neural networks, all within a single component.

The researchers implemented this algorithm and evaluated its performance. They found that the small discrepancies between the MTJ-generated curves and the ideal software-generated curves had a negligible impact on the performance of the backpropagation algorithm. Additionally, the device implementation was highly robust to inter-device variation and noise.

To demonstrate the potential of this approach, the researchers evaluated the performance of an edge computing network using weights from a software-trained model implemented with their MTJ design. The results showed only a minimal loss of accuracy of 0.1% for the Fashion MNIST dataset and 2% for the CIFAR-100 dataset compared to the original software implementation. These results highlight the promise of the researchers' MTJ design for compact, hardware-based neural networks in edge computing applications, particularly for transfer learning.

Critical Analysis

The paper provides a thorough evaluation of the proposed MTJ design and its potential for neuromorphic computing applications. The researchers have clearly demonstrated the device's ability to compute the sigmoidal activation function and its gradient, as well as its robustness to variations and noise.

However, the paper does not address some potential limitations or areas for further research. For example, it would be helpful to understand the scalability of this approach - how well would it work for larger, more complex neural networks? Additionally, the paper does not explore the power consumption or other practical considerations of implementing this MTJ design in real-world systems.

It would also be valuable to see a more detailed comparison to other neuromorphic computing approaches, such as those based on memristors or spintronic devices. This could help contextualize the relative strengths and weaknesses of the researchers' MTJ design.

Conclusion

This work presents a promising new MTJ design that can compute both the sigmoidal activation function and its gradient at the device level, enabling efficient hardware-based neural network computations. The design is highly robust and supports useful machine learning techniques like transfer learning, making it a compelling option for edge computing applications.

While the paper does not address all potential limitations, the researchers have clearly demonstrated the potential of their approach and laid the groundwork for further development and refinement of this technology. As neuromorphic computing continues to advance, designs like this MTJ could play an important role in enabling more efficient and powerful AI systems at the edge.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

👀

A design of magnetic tunnel junctions for the deployment of neuromorphic hardware for edge computing

Davi Rodrigues, Eleonora Raimondo, Riccardo Tomasello, Mario Carpentieri, Giovanni Finocchio

The electrically readable complex dynamics of robust and scalable magnetic tunnel junctions (MTJs) offer promising opportunities for advancing neuromorphic computing. In this work, we present an MTJ design with a free layer and two polarizers capable of computing the sigmoidal activation function and its gradient at the device level. This design enables both feedforward and backpropagation computations within a single device, extending neuromorphic computing frameworks previously explored in the literature by introducing the ability to perform backpropagation directly in hardware. Our algorithm implementation reveals two key findings: (i) the small discrepancies between the MTJ-generated curves and the exact software-generated curves have a negligible impact on the performance of the backpropagation algorithm, (ii) the device implementation is highly robust to inter-device variation and noise, and (iii) the proposed method effectively supports transfer learning and knowledge distillation. To demonstrate this, we evaluated the performance of an edge computing network using weights from a software-trained model implemented with our MTJ design. The results show a minimal loss of accuracy of only 0.1% for the Fashion MNIST dataset and 2% for the CIFAR-100 dataset compared to the original software implementation. These results highlight the potential of our MTJ design for compact, hardware-based neural networks in edge computing applications, particularly for transfer learning.

9/5/2024

Domain wall and Magnetic Tunnel Junction Hybrid for on-chip Learning in UNet architecture

Venkatesh Vadde, Bhaskaran Muralidharan, Abhishek Sharma

We present spintronic devices based hardware implementation of UNet for segmentation tasks. Our approach involves designing hardware for convolution, deconvolution, rectified activation function (ReLU), and max pooling layers of the UNet architecture. We designed the convolution and deconvolution layers of the network using the synaptic behavior of the domain wall MTJ. We also construct the ReLU and max pooling functions of the network utilizing the spin hall driven orthogonal current injected MTJ. To incorporate the diverse physics of spin-transport, magnetization dynamics, and CMOS elements in our UNet design, we employ a hybrid simulation setup that couples micromagnetic simulation, non-equilibrium Green's function, SPICE simulation along with network implementation. We evaluate our UNet design on the CamVid dataset and achieve segmentation accuracies of 83.71$%$ on test data, on par with the software implementation with 821mJ of energy consumption for on-chip training over 150 epochs. We further demonstrate nearly one order $(10times)$ improvement in the energy requirement of the network using unstable ferromagnet ($Delta$=4.58) over the stable ferromagnet ($Delta$=45) based ReLU and max pooling functions while maintaining the similar accuracy. The hybrid architecture comprising domain wall MTJ and unstable FM-based MTJ leads to an on-chip energy consumption of 85.79mJ during training, with a testing energy cost of 1.55 $mu J$.

7/12/2024

Measurement-driven neural-network training for integrated magnetic tunnel junction arrays

William A. Borders, Advait Madhavan, Matthew W. Daniels, Vasileia Georgiou, Martin Lueker-Boden, Tiffany S. Santos, Patrick M. Braganca, Mark D. Stiles, Jabez J. McClelland, Brian D. Hoskins

The increasing scale of neural networks needed to support more complex applications has led to an increasing requirement for area- and energy-efficient hardware. One route to meeting the budget for these applications is to circumvent the von Neumann bottleneck by performing computation in or near memory. An inevitability of transferring neural networks onto hardware is that non-idealities such as device-to-device variations or poor device yield impact performance. Methods such as hardware-aware training, where substrate non-idealities are incorporated during network training, are one way to recover performance at the cost of solution generality. In this work, we demonstrate inference on hardware neural networks consisting of 20,000 magnetic tunnel junction arrays integrated on a complementary metal-oxide-semiconductor chips that closely resembles market-ready spin transfer-torque magnetoresistive random access memory technology. Using 36 dies, each containing a crossbar array with its own non-idealities, we show that even a small number of defects in physically mapped networks significantly degrades the performance of networks trained without defects and show that, at the cost of generality, hardware-aware training accounting for specific defects on each die can recover to comparable performance with ideal networks. We then demonstrate a robust training method that extends hardware-aware training to statistics-aware training, producing network weights that perform well on most defective dies regardless of their specific defect locations. When evaluated on the 36 physical dies, statistics-aware trained solutions can achieve a mean misclassification error on the MNIST dataset that differs from the software-baseline by only 2 %. This statistics-aware training method could be generalized to networks with many layers that are mapped to hardware suited for industry-ready applications.

5/15/2024

🛠️

Superparamagnetic Tunnel Junctions for Reliable True Randomness

Dooyong Koh, Qiuyuan Wang, Brooke C. McGoldrick, Chung-Tao Chou, Luqiao Liu, Marc A. Baldo

Physical devices exhibiting stochastic functions with low energy consumption and high device density have the potential to enable complex probability-based computing algorithms, accelerate machine learning tasks, and enhance hardware security. Recently, superparamagnetic tunnel junctions (sMTJs) have been widely explored for such purposes, leading to the development of sMTJ-based systems; however, the reliance on nanoscale ferromagnets limits scalability and reliability, making sMTJs sensitive to external perturbations and prone to significant device variations. Here, we present an experimental demonstration of closed loop three-terminal sMTJs as reliable and potentially scalable sources of true randomness in the field-free regime. By leveraging dual-current controllability and incorporating feedback, we stabilize the switching operation of superparamagnets and reach cryptographic-quality random bitstreams. The realization of controllable and robust true random sMTJs underpin a general hardware platform for computing schemes exploiting the stochasticity in the physical world, as demonstrated by the generative artificial intelligence example in our experiment.

9/17/2024