A symmetric silicon microring resonator optical crossbar array for accelerated inference and training in deep learning

Read original: arXiv:2401.16072 - Published 6/4/2024 by Rui Tang, Shuhei Ohno, Ken Tanizawa, Kazuhiro Ikeda, Makoto Okano, Kasidit Toprasertpong, Shinichi Takagi, Mitsuru Takenaka

🤯

Overview

Photonic integrated circuits are emerging as a promising platform for accelerating matrix multiplications in deep learning
The paper proposes a silicon microring resonator (MRR) optical crossbar array with a symmetric structure that allows for simple on-chip backpropagation, enabling the acceleration of both inference and training phases of deep learning
The researchers demonstrate a 4x4 circuit on a Si-on-insulator (SOI) platform and use it to perform inference tasks of a simple neural network for classifying Iris flowers, achieving 93.3% accuracy
They then train the neural network using simulated on-chip backpropagation and achieve 91.1% accuracy in the same inference task
The paper also simulates a convolutional neural network (CNN) for handwritten digit recognition, using a 9x9 MRR crossbar array to perform the convolution operations

Plain English Explanation

Photonic integrated circuits are a type of technology that uses light instead of electricity to process information. This can be especially useful for deep learning, a powerful AI technique, because light can perform certain mathematical operations, like matrix multiplication, much faster than traditional electronics.

The researchers in this paper have developed a new photonic chip design that allows for both the inference (using the trained model) and the training (learning the model) of deep neural networks to be accelerated. Their key innovation is a special type of photonic component called a microring resonator (MRR) arranged in a grid-like "crossbar" pattern.

This MRR crossbar array has a symmetrical structure that makes it easy to perform the error-correction step, called backpropagation, which is crucial for training neural networks. The researchers show that they can use this photonic chip to accurately classify Iris flowers and recognize handwritten digits, matching the performance of electronic systems but using much less power.

By enabling both fast inference and efficient training on a photonic chip, this research brings us closer to realizing photonic neural network accelerators that are compact, energy-efficient, and can handle the massive computational demands of modern deep learning.

Technical Explanation

The paper presents a silicon microring resonator (MRR) optical crossbar array with a symmetric structure that allows for simple on-chip backpropagation, potentially enabling the acceleration of both the inference and training phases of deep learning.

The researchers demonstrate a 4x4 MRR crossbar circuit fabricated on a Si-on-insulator (SOI) platform and use it to perform inference tasks of a simple neural network for classifying Iris flowers. They achieve a classification accuracy of 93.3% with this photonic inference system.

The team then trains the neural network using simulated on-chip backpropagation and achieves an accuracy of 91.1% in the same inference task after training. This shows the potential of their photonic architecture to accelerate not just inference, but also the crucial training process for deep learning models.

Furthermore, the researchers simulate a convolutional neural network (CNN) for handwritten digit recognition, using a 9x9 MRR crossbar array to perform the convolution operations. This demonstrates the ability of their photonic approach to handle more complex deep learning architectures like CNNs.

Overall, this work contributes to the realization of compact and energy-efficient photonic accelerators for deep learning, building on previous research in photonic neuromorphic computing and wavelength-multiplexed reservoir computing.

Critical Analysis

The paper presents a promising approach for realizing photonic accelerators capable of both inference and training of deep neural networks. The symmetric MRR crossbar design is a clever solution to the challenge of implementing backpropagation on a photonic chip, as highlighted in the architecture-level modeling of photonic deep neural networks.

However, the experimental demonstration is limited to a small 4x4 circuit and simple neural network models. Scaling up to larger, more complex deep learning architectures while maintaining the benefits of the symmetric MRR design remains an open challenge that the authors acknowledge.

Additionally, the simulated backpropagation on the photonic chip may not fully capture the practical challenges and limitations of implementing the training process in a real photonic system. Further experimental validation will be necessary to confirm the feasibility and performance of on-chip training.

Overall, this work represents an important step towards realizing the potential of photonic integrated circuits for accelerating both the inference and training of deep learning models. Continued research and development in this area could lead to significant improvements in the energy efficiency and computational capabilities of AI systems.

Conclusion

This paper proposes a novel photonic integrated circuit design based on a symmetric microring resonator (MRR) crossbar array that enables the acceleration of both the inference and training phases of deep learning. The researchers demonstrate the use of this photonic architecture for simple neural network inference and training tasks, achieving promising results.

By addressing the challenge of implementing backpropagation on a photonic chip, this work contributes to the ongoing efforts to develop compact, energy-efficient photonic accelerators for deep learning. Further scaling and practical validation of the on-chip training capabilities will be crucial for realizing the full potential of this approach and advancing the field of photonic computing for artificial intelligence.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🤯

A symmetric silicon microring resonator optical crossbar array for accelerated inference and training in deep learning

Rui Tang, Shuhei Ohno, Ken Tanizawa, Kazuhiro Ikeda, Makoto Okano, Kasidit Toprasertpong, Shinichi Takagi, Mitsuru Takenaka

Photonic integrated circuits are emerging as a promising platform for accelerating matrix multiplications in deep learning, leveraging the inherent parallel nature of light. Although various schemes have been proposed and demonstrated to realize such photonic matrix accelerators, the in-situ training of artificial neural networks using photonic accelerators remains challenging due to the difficulty of direct on-chip backpropagation on a photonic chip. In this work, we propose a silicon microring resonator (MRR) optical crossbar array with a symmetric structure that allows for simple on-chip backpropagation, potentially enabling the acceleration of both the inference and training phases of deep learning. We demonstrate a $4 times 4$ circuit on a Si-on-insulator (SOI) platform and use it to perform inference tasks of a simple neural network for classifying Iris flowers, achieving a classification accuracy of 93.3%. Subsequently, we train the neural network using simulated on-chip backpropagation and achieve an accuracy of 91.1% in the same inference task after training. Furthermore, we simulate a convolutional neural network (CNN) for handwritten digit recognition, using a $9 times 9$ MRR crossbar array to perform the convolution operations. This work contributes to the realization of compact and energy-efficient photonic accelerators for deep learning.

6/4/2024

Mirage: An RNS-Based Photonic Accelerator for DNN Training

Cansu Demirkiran, Guowei Yang, Darius Bunandar, Ajay Joshi

Photonic computing is a compelling avenue for performing highly efficient matrix multiplication, a crucial operation in Deep Neural Networks (DNNs). While this method has shown great success in DNN inference, meeting the high precision demands of DNN training proves challenging due to the precision limitations imposed by costly data converters and the analog noise inherent in photonic hardware. This paper proposes Mirage, a photonic DNN training accelerator that overcomes the precision challenges in photonic hardware using the Residue Number System (RNS). RNS is a numeral system based on modular arithmetic, allowing us to perform high-precision operations via multiple low-precision modular operations. In this work, we present a novel micro-architecture and dataflow for an RNS-based photonic tensor core performing modular arithmetic in the analog domain. By combining RNS and photonics, Mirage provides high energy efficiency without compromising precision and can successfully train state-of-the-art DNNs achieving accuracy comparable to FP32 training. Our study shows that on average across several DNNs when compared to systolic arrays, Mirage achieves more than $23.8times$ faster training and $32.1times$ lower EDP in an iso-energy scenario and consumes $42.8times$ lower power with comparable or better EDP in an iso-area scenario.

5/27/2024

🧠

Photonic Neuromorphic Accelerator for Convolutional Neural Networks based on an Integrated Reconfigurable Mesh

Aris Tsirigotis, Gerge Sarantoglou, Stavros Deligiannidis, Erica Sanchez, Ana Gutierrez, Adonis Bogris, Jose Capmany, Charis Mesaritakis

In this work, we present and experimentally validate a passive photonic-integrated neuromorphic accelerator that uses a hardware-friendly optical spectrum slicing technique through a reconfigurable silicon photonic mesh. The proposed scheme acts as an analogue convolutional engine, enabling information preprocessing in the optical domain, dimensionality reduction and extraction of spatio-temporal features. Numerical results demonstrate that utilizing only 7 passive photonic nodes, critical modules of a digital convolutional neural network can be replaced. As a result, a 98.6% accuracy on the MNIST dataset was achieved, with a power consumption reduction of at least 26% compared to digital CNNs. Experimental results confirm these findings, achieving 97.7% accuracy with only 3 passive nodes.

5/13/2024

Memory Capacity Analysis of Time-delay Reservoir Computing Based on Silicon Microring Resonator Nonlinearities

Bernard J. Giron Castro, Christophe Peucheret, Francesco Da Ros

Silicon microring resonators (MRRs) have shown strong potential in acting as the nonlinear nodes of photonic reservoir computing (RC) schemes. By using nonlinearities within a silicon MRR, such as the ones caused by free-carrier dispersion (FCD) and thermo-optic (TO) effects, it is possible to map the input data of the RC to a higher dimensional space. Furthermore, by adding an external waveguide between the through and add ports of the MRR, it is possible to implement a time-delay RC (TDRC) with enhanced memory. The input from the through port is fed back into the add port of the ring with the delay applied by the external waveguide effectively adding memory. In a TDRC, the nodes are multiplexed in time, and their respective time evolutions are detected at the drop port. The performance of MRR-based TDRC is highly dependent on the amount of nonlinearity in the MRR. The nonlinear effects, in turn, are dependent on the physical properties of the MRR as they determine the lifetime of the effects. Another factor to take into account is the stability of the MRR response, as strong time-domain discontinuities at the drop port are known to emerge from FCD nonlinearities due to self-pulsing (high nonlinear behaviour). However, quantifying the right amount of nonlinearity that RC needs for a certain task in order to achieve optimum performance is challenging. Therefore, further analysis is required to fully understand the nonlinear dynamics of this TDRC setup. Here, we quantify the nonlinear and linear memory capacity of the previously described microring-based TDRC scheme, as a function of the time constants of the generated carriers and the thermal of the TO effects. We analyze the properties of the TDRC dynamics that generate the parameter space, in terms of input signal power and frequency detuning range, over which conventional RC tasks can be satisfactorily performed by the TDRC scheme.

6/5/2024