SCATTER: Algorithm-Circuit Co-Sparse Photonic Accelerator with Thermal-Tolerant, Power-Efficient In-situ Light Redistribution

Read original: arXiv:2407.05510 - Published 7/9/2024 by Ziang Yin, Nicholas Gangi, Meng Zhang, Jeff Zhang, Rena Huang, Jiaqi Gu

SCATTER: Algorithm-Circuit Co-Sparse Photonic Accelerator with Thermal-Tolerant, Power-Efficient In-situ Light Redistribution

Overview

• The paper "DarkLight: Dynamic Sparse Topology-Enabled Energy-Efficient, Robust Photonic Accelerator via in-situ Light Redistribution" presents a new photonic accelerator design that aims to be energy-efficient and robust.

• The key ideas include a dynamic sparse topology that can reconfigure the connections between photonic components to optimize energy usage, and an in-situ light redistribution mechanism to handle hardware variations and faults.

Plain English Explanation

The researchers have developed a new type of photonic chip that can be used to speed up certain types of computations, like those used in machine learning. Photonic chips use light instead of electricity to process information, which can be faster and more energy-efficient than traditional electronic chips.

The main innovation in this paper is a system that can dynamically adjust the connections between the different components on the photonic chip. This allows the chip to adapt its architecture to the specific computation being performed, using only the components that are needed and avoiding waste. The system also has a way to automatically redistribute the light signals across the chip to compensate for any variations or defects in the hardware, making the system more robust and reliable.

These features help make the photonic accelerator more energy-efficient and resilient compared to previous designs. This could be useful for applications that require a lot of compute power, like Scaling Analog Photonic Accelerators to Byte-Size Integer, while also being able to handle hardware issues that can occur over time, like Doctor: Dynamic Chip Temporal Variation Remediation Toward efficient and reliable operation.

Technical Explanation

The key innovation in "DarkLight" is a dynamic sparse topology that can reconfigure the connections between photonic components on the chip. This allows the system to only use the components that are needed for a given computation, reducing energy consumption.

The system also includes an in-situ light redistribution mechanism that can detect and compensate for variations or defects in the hardware, such as Photonic Neuromorphic Accelerator for Convolutional Neural Networks-Based waveguide imperfections or component misalignments. This helps maintain the performance and reliability of the system over time, unlike static photonic architectures that can degrade due to Architecture-Level Modeling of Photonic Deep Neural Network hardware variations.

The researchers evaluate their design using simulations and demonstrate significant improvements in energy efficiency and robustness compared to previous photonic accelerator architectures, including FLAASH: Flexible Accelerator Architecture for Sparse High-Order computations.

Critical Analysis

The paper provides a comprehensive evaluation of the DarkLight architecture, including comparisons to prior work and analysis of the energy savings and fault tolerance capabilities. However, the authors do not discuss any potential limitations or challenges that may arise in the practical implementation of this system.

For example, the dynamic reconfiguration of the photonic topology may introduce additional complexity and overhead that could offset some of the energy benefits. Additionally, the in-situ light redistribution mechanism relies on the ability to accurately detect and characterize hardware variations, which may be difficult to achieve in a real-world setting.

Further research and experimentation would be needed to fully understand the tradeoffs and practical challenges of deploying a system like DarkLight in a production environment. Nonetheless, the core ideas presented in this paper represent an important advancement in the field of energy-efficient and resilient photonic computing.

Conclusion

The "DarkLight" paper introduces a new photonic accelerator design that aims to be both energy-efficient and robust to hardware variations and faults. The key innovations are a dynamic sparse topology that can reconfigure the connections between components to optimize energy usage, and an in-situ light redistribution mechanism to maintain performance in the face of hardware issues.

These features could make photonic accelerators more practical and widely deployable for applications that require high-performance and energy-efficient computing, such as Scaling Analog Photonic Accelerators to Byte-Size Integer, Doctor: Dynamic Chip Temporal Variation Remediation Toward efficient operation, and Photonic Neuromorphic Accelerator for Convolutional Neural Networks-Based applications. While the paper provides a thorough technical evaluation, further research is needed to understand the practical challenges and tradeoffs of implementing a system like DarkLight in the real world.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

SCATTER: Algorithm-Circuit Co-Sparse Photonic Accelerator with Thermal-Tolerant, Power-Efficient In-situ Light Redistribution

Ziang Yin, Nicholas Gangi, Meng Zhang, Jeff Zhang, Rena Huang, Jiaqi Gu

Photonic computing has emerged as a promising solution for accelerating computation-intensive artificial intelligence (AI) workloads. However, limited reconfigurability, high electrical-optical conversion cost, and thermal sensitivity limit the deployment of current optical analog computing engines to support power-restricted, performance-sensitive AI workloads at scale. Sparsity provides a great opportunity for hardware-efficient AI accelerators. However, current dense photonic accelerators fail to fully exploit the power-saving potential of algorithmic sparsity. It requires sparsity-aware hardware specialization with a fundamental re-design of photonic tensor core topology and cross-layer device-circuit-architecture-algorithm co-optimization aware of hardware non-ideality and power bottleneck. To trim down the redundant power consumption while maximizing robustness to thermal variations, we propose SCATTER, a novel algorithm-circuit co-sparse photonic accelerator featuring dynamically reconfigurable signal path via thermal-tolerant, power-efficient in-situ light redistribution and power gating. A power-optimized, crosstalk-aware dynamic sparse training framework is introduced to explore row-column structured sparsity and ensure marginal accuracy loss and maximum power efficiency. The extensive evaluation shows that our cross-stacked optimized accelerator SCATTER achieves a 511X area reduction and 12.4X power saving with superior crosstalk tolerance that enables unprecedented circuit layout compactness and on-chip power efficiency.

7/9/2024

DOCTOR: Dynamic On-Chip Temporal Variation Remediation Toward Self-Corrected Photonic Tensor Accelerators

Haotian Lu, Sanmitra Banerjee, Jiaqi Gu

Photonic computing has emerged as a promising solution for accelerating computation-intensive artificial intelligence (AI) workloads, offering unparalleled speed and energy efficiency, especially in resource-limited, latency-sensitive edge computing environments. However, the deployment of analog photonic tensor accelerators encounters reliability challenges due to hardware noise and environmental variations. While off-chip noise-aware training and on-chip training have been proposed to enhance the variation tolerance of optical neural accelerators with moderate, static noise, we observe a notable performance degradation over time due to temporally drifting variations, which requires a real-time, in-situ calibration mechanism. To tackle this challenging reliability issues, for the first time, we propose a lightweight dynamic on-chip remediation framework, dubbed DOCTOR, providing adaptive, in-situ accuracy recovery against temporally drifting noise. The DOCTOR framework intelligently monitors the chip status using adaptive probing and performs fast in-situ training-free calibration to restore accuracy when necessary. Recognizing nonuniform spatial variation distributions across devices and tensor cores, we also propose a variation-aware architectural remapping strategy to avoid executing critical tasks on noisy devices. Extensive experiments show that our proposed framework can guarantee sustained performance under drifting variations with 34% higher accuracy and 2-3 orders-of-magnitude lower overhead compared to state-of-the-art on-chip training methods. Our code is open-sourced at https://github.com/ScopeX-ASU/DOCTOR.

6/4/2024

Scaling Analog Photonic Accelerators for Byte-Size, Integer General Matrix Multiply (GEMM) Kernels

Oluwaseun Adewunmi Alo, Sairam Sri Vatsavai, Ishan Thakkar

Deep Neural Networks (DNNs) predominantly rely on General Matrix Multiply (GEMM) kernels, which are often accelerated using specialized hardware architectures. Recently, analog photonic GEMM accelerators have emerged as a promising alternative, offering vastly superior speed and energy efficiency compared to traditional electronic accelerators. However, these photonic cannot support wider than 4-bit integer operands due to their inherent trade-offs between analog dynamic range and parallelism. This is often inadequate for DNN training as at least 8-bit wide operands are deemed necessary to prevent significant accuracy drops. To address these limitations, we introduce a scalable photonic GEMM accelerator named SPOGA. SPOGA utilizes enhanced features such as analog summation of homodyne optical signals and in-transduction positional weighting of operands. By employing an extended optical-analog dataflow that minimizes overheads associated with bit-sliced integer arithmetic, SPOGA supports byte-size integer GEMM kernels, achieving significant improvements in throughput, latency, and energy efficiency. Specifically, SPOGA demonstrates up to 14.4$times$, 2$times$, and 28.5$times$ improvements in frames-per-second (FPS), FPS/Watt, and FPS/Watt/mm$^2$ respectively, compared to existing state-of-the-art photonic solutions.

7/9/2024

🧠

Photonic Neuromorphic Accelerator for Convolutional Neural Networks based on an Integrated Reconfigurable Mesh

Aris Tsirigotis, Gerge Sarantoglou, Stavros Deligiannidis, Erica Sanchez, Ana Gutierrez, Adonis Bogris, Jose Capmany, Charis Mesaritakis

In this work, we present and experimentally validate a passive photonic-integrated neuromorphic accelerator that uses a hardware-friendly optical spectrum slicing technique through a reconfigurable silicon photonic mesh. The proposed scheme acts as an analogue convolutional engine, enabling information preprocessing in the optical domain, dimensionality reduction and extraction of spatio-temporal features. Numerical results demonstrate that utilizing only 7 passive photonic nodes, critical modules of a digital convolutional neural network can be replaced. As a result, a 98.6% accuracy on the MNIST dataset was achieved, with a power consumption reduction of at least 26% compared to digital CNNs. Experimental results confirm these findings, achieving 97.7% accuracy with only 3 passive nodes.

5/13/2024