RF-Photonic Deep Learning Processor with Shannon-Limited Data Movement

2207.06883

Published 6/10/2024 by Ronald Davis III, Zaijun Chen, Ryan Hamerly, Dirk Englund

🤿

Abstract

Edholm's Law predicts exponential growth in data rate and spectrum bandwidth for communications and is forecasted to remain true for the upcoming deployment of 6G. Compounding this issue is the exponentially increasing demand for deep neural network (DNN) compute, including DNNs for signal processing. However, the slowing of Moore's Law due to the limitations of transistor-based electronics means that completely new paradigms for computing will be required to meet these increasing demands for advanced communications. Optical neural networks (ONNs) are promising DNN accelerators with ultra-low latency and energy consumption. Yet state-of-the-art ONNs struggle with scalability and implementing linear with in-line nonlinear operations. Here we introduce our multiplicative analog frequency transform ONN (MAFT-ONN) that encodes the data in the frequency domain, achieves matrix-vector products in a single shot using photoelectric multiplication, and uses a single electro-optic modulator for the nonlinear activation of all neurons in each layer. We experimentally demonstrate the first hardware accelerator that computes fully-analog deep learning on raw RF signals, performing single-shot modulation classification with 85% accuracy, where a 'majority vote' multi-measurement scheme can boost the accuracy to 95% within 5 consecutive measurements. In addition, we demonstrate frequency-domain finite impulse response (FIR) linear-time-invariant (LTI) operations, enabling a powerful combination of traditional and AI signal processing. We also demonstrate the scalability of our architecture by computing nearly 4 million fully-analog multiplies-and-accumulates for MNIST digit classification. Our latency estimation model shows that due to the Shannon capacity-limited analog data movement, MAFT-ONN is hundreds of times faster than traditional RF receivers operating at their theoretical peak performance.

Create account to get full access

Overview

Exponential growth in data rate and spectrum bandwidth for communications is predicted by Edholm's Law, which is expected to continue with the deployment of 6G.
Compounding this issue is the exponentially increasing demand for deep neural network (DNN) compute, including DNNs for signal processing.
The slowing of Moore's Law due to transistor-based electronics limitations means new computing paradigms will be required to meet these demands.
Optical neural networks (ONNs) are promising DNN accelerators with ultra-low latency and energy consumption, but struggle with scalability and implementing linear with in-line nonlinear operations.

Plain English Explanation

As our demand for faster and more efficient communications grows, the data and processing power required is increasing exponentially. The traditional transistor-based electronics that power our current computing and communications systems are reaching their limits, and new approaches are needed.

One promising solution is optical neural networks (ONNs), which use light instead of electricity to process information. ONNs have the potential to be much faster and more energy-efficient than traditional electronics, but they've faced challenges in scaling up and combining the linear and nonlinear operations needed for advanced signal processing and machine learning tasks.

The researchers in this paper have developed a new type of ONN called the "multiplicative analog frequency transform ONN" (MAFT-ONN) that addresses these challenges. By encoding data in the frequency domain and using a novel approach to performing matrix-vector multiplication and nonlinear activation, the MAFT-ONN can perform deep learning and signal processing tasks on raw RF signals with very low latency and high accuracy.

Technical Explanation

The key innovation in the MAFT-ONN is its ability to encode data in the frequency domain and perform matrix-vector multiplication and nonlinear activation in a single optical step using photoelectric multiplication. This allows the network to compute fully-analog deep learning models on raw RF signals, without the need for digital-to-analog conversion or other preprocessing steps.

The researchers experimentally demonstrated the MAFT-ONN performing single-shot modulation classification with 85% accuracy, which could be boosted to 95% accuracy within 5 consecutive measurements using a majority vote scheme. They also showed the MAFT-ONN could perform frequency-domain finite impulse response (FIR) linear-time-invariant (LTI) operations, combining traditional and AI-based signal processing.

Additionally, the researchers demonstrated the scalability of the MAFT-ONN architecture by computing nearly 4 million fully-analog multiplies-and-accumulates for MNIST digit classification. They also provided a latency estimation model showing that due to the Shannon capacity-limited analog data movement, the MAFT-ONN is hundreds of times faster than traditional RF receivers operating at their theoretical peak performance.

Critical Analysis

While the MAFT-ONN shows impressive results in terms of latency, accuracy, and scalability, the researchers acknowledge that there are still some limitations to the approach. For example, the current implementation relies on a single electro-optic modulator for the nonlinear activation, which could limit the network's depth and complexity.

Additionally, the researchers do not address the potential challenges of manufacturing and integrating the MAFT-ONN in real-world systems, such as the need for precise optical alignment, thermal management, and robustness to environmental factors.

Further research would also be needed to explore the MAFT-ONN's performance on more complex machine learning tasks, as the experiments in this paper focused on relatively simple classification and signal processing problems.

Conclusion

The MAFT-ONN represents a promising new approach to optical neural networks that could help address the growing demands for high-speed, low-latency communications and signal processing. By encoding data in the frequency domain and using novel optical computing techniques, the MAFT-ONN can perform deep learning and traditional signal processing tasks on raw RF signals with exceptional efficiency and speed.

While there are still some limitations and challenges to overcome, the research presented in this paper highlights the potential of optical computing to revolutionize the field of advanced communications and pave the way for future 6G and beyond technologies.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

🤿

Deep Learning for Low-Latency, Quantum-Ready RF Sensing

Pranav Gokhale, Caitlin Carnahan, William Clark, Frederic T. Chong

Recent work has shown the promise of applying deep learning to enhance software processing of radio frequency (RF) signals. In parallel, hardware developments with quantum RF sensors based on Rydberg atoms are breaking longstanding barriers in frequency range, resolution, and sensitivity. In this paper, we describe our implementations of quantum-ready machine learning approaches for RF signal classification. Our primary objective is latency: while deep learning offers a more powerful computational paradigm, it also traditionally incurs latency overheads that hinder wider scale deployment. Our work spans three axes. (1) A novel continuous wavelet transform (CWT) based recurrent neural network (RNN) architecture that enables flexible online classification of RF signals on-the-fly with reduced sampling time. (2) Low-latency inference techniques for both GPU and CPU that span over 100x reductions in inference time, enabling real-time operation with sub-millisecond inference. (3) Quantum-readiness validated through application of our models to physics-based simulation of Rydberg atom QRF sensors. Altogether, our work bridges towards next-generation RF sensors that use quantum technology to surpass previous physical limits, paired with latency-optimized AI/ML software that is suitable for real-time deployment.

4/30/2024

cs.AI cs.LG cs.PF cs.SY eess.SY

🧠

1-bit Quantized On-chip Hybrid Diffraction Neural Network Enabled by Authentic All-optical Fully-connected Architecture

Yu Shao, Haiqi Gao, Yipeng Chen, Yujie liu, Junren Wen, Haidong He, Yuchuan Shao, Yueguang Zhang, Weidong Shen, Chenying Yang

Optical Diffraction Neural Networks (DNNs), a subset of Optical Neural Networks (ONNs), show promise in mirroring the prowess of electronic networks. This study introduces the Hybrid Diffraction Neural Network (HDNN), a novel architecture that incorporates matrix multiplication into DNNs, synergizing the benefits of conventional ONNs with those of DNNs to surmount the modulation limitations inherent in optical diffraction neural networks. Utilizing a singular phase modulation layer and an amplitude modulation layer, the trained neural network demonstrated remarkable accuracies of 96.39% and 89% in digit recognition tasks in simulation and experiment, respectively. Additionally, we develop the Binning Design (BD) method, which effectively mitigates the constraints imposed by sampling intervals on diffraction units, substantially streamlining experimental procedures. Furthermore, we propose an on-chip HDNN that not only employs a beam-splitting phase modulation layer for enhanced integration level but also significantly relaxes device fabrication requirements, replacing metasurfaces with relief surfaces designed by 1-bit quantization. Besides, we conceptualized an all-optical HDNN-assisted lesion detection network, achieving detection outcomes that were 100% aligned with simulation predictions. This work not only advances the performance of DNNs but also streamlines the path towards industrial optical neural network production.

4/12/2024

cs.ET cs.LG

Deep-Learning-Based Channel Estimation for Distributed MIMO with 1-bit Radio-Over-Fiber Fronthaul

Alireza Bordbar, Lise Aabel, Christian Hager, Christian Fager, Giuseppe Durisi

We consider the problem of pilot-aided, uplink channel estimation in a distributed massive multiple-input multiple-output (MIMO) architecture, in which the access points are connected to a central processing unit via fiber-optical fronthaul links, carrying a two-level-quantized version of the received analog radio-frequency signal. We adapt to this architecture the deep-learning-based channel-estimation algorithm recently proposed by Nguyen et al. (2023), and explore its robustness to the additional signal distortions (beyond 1-bit quantization) introduced in the considered architecture by the automatic gain controllers (AGCs) and by the comparators. These components are used at the access points to generate the two-level analog waveform from the received signal. Via simulation results, we illustrate that the proposed channel-estimation method outperforms significantly the Bussgang linear minimum mean-square error channel estimator, and it is robust against the additional impairments introduced by the AGCs and the comparators.

6/18/2024

eess.SP cs.LG

🤿

Deep Neural Operator Driven Real Time Inference for Nuclear Systems to Enable Digital Twin Solutions

Kazuma Kobayashi, Syed Bahauddin Alam

This paper focuses on the feasibility of Deep Neural Operator (DeepONet) as a robust surrogate modeling method within the context of digital twin (DT) for nuclear energy systems. Through benchmarking and evaluation, this study showcases the generalizability and computational efficiency of DeepONet in solving a challenging particle transport problem. DeepONet also exhibits remarkable prediction accuracy and speed, outperforming traditional ML methods, making it a suitable algorithm for real-time DT inference. However, the application of DeepONet also reveals challenges related to optimal sensor placement and model evaluation, critical aspects of real-world implementation. Addressing these challenges will further enhance the method's practicality and reliability. Overall, DeepONet presents a promising and transformative nuclear engineering research and applications tool. Its accurate prediction and computational efficiency capabilities can revolutionize DT systems, advancing nuclear engineering research. This study marks an important step towards harnessing the power of surrogate modeling techniques in critical engineering domains.

4/30/2024

stat.ML cs.LG