Revisiting DNN Training for Intermittently Powered Energy Harvesting Micro Computers

Read original: arXiv:2408.13696 - Published 8/27/2024 by Cyan Subhra Mishra, Deeksha Chaudhary, Jack Sampson, Mahmut Taylan Knademir, Chita Das

Revisiting DNN Training for Intermittently Powered Energy Harvesting Micro Computers

Overview

Describes revisiting the training of deep neural networks (DNNs) for micro-computers that are powered by intermittent energy harvesting
Explores challenges and solutions for training DNNs in this resource-constrained environment
Provides insights and strategies to enable effective DNN training on energy-harvesting micro-computers

Plain English Explanation

The paper discusses the challenges of training deep neural networks (DNNs) on micro-computers that rely on intermittent energy harvesting, such as those powered by solar or vibration energy. These micro-computers have limited resources, including processing power, memory, and energy, which can make it difficult to effectively train complex machine learning models like DNNs.

The researchers explore various strategies and techniques to enable efficient DNN training in this resource-constrained environment. This includes optimizing the training process to reduce the computing and memory requirements, adapting the DNN architecture to better fit the capabilities of the micro-computer, and leveraging intermittent execution to make the most of the available energy.

By addressing these challenges, the researchers aim to make it more feasible to deploy powerful AI models on small, energy-harvesting devices, opening up new possibilities for on-device machine learning in a variety of applications.

Technical Explanation

The paper begins by outlining the challenges of training DNNs on intermittently-powered, energy-harvesting micro-computers. These devices have limited computational resources and unpredictable power availability, which can disrupt the training process and lead to poor model performance.

To address these challenges, the researchers propose several techniques to optimize the DNN training process. This includes techniques to reduce the memory footprint and methods to improve the energy efficiency of the training, such as leveraging low-precision computations and intermittent execution.

The paper also explores adapting the DNN architecture to better fit the constraints of the micro-computer. This includes designing more compact and efficient DNN models and incorporating energy-awareness into the model design to improve performance under intermittent power conditions.

Finally, the researchers investigate the use of intermittent execution during the training process, optimizing the checkpoint and recovery mechanisms to enable the training to seamlessly continue despite power interruptions.

Critical Analysis

The paper provides a comprehensive and thoughtful approach to the challenge of training DNNs on intermittently-powered, energy-harvesting micro-computers. The researchers have identified key bottlenecks and constraints, and have proposed a range of techniques to address them.

One potential limitation of the research is the reliance on simulations and emulations to evaluate the proposed techniques. While this allows for controlled experimentation, it may not fully capture the complexities and nuances of real-world deployment scenarios. Further validation on physical hardware platforms would strengthen the findings.

Additionally, the paper does not delve deeply into the trade-offs and potential drawbacks of some of the proposed techniques, such as the impact of low-precision computations on model accuracy or the overhead of the checkpoint/recovery mechanisms. A more thorough discussion of these aspects would help readers better understand the practical considerations and limitations of the approaches.

Overall, the paper presents a valuable contribution to the field of energy-constrained machine learning, offering insights and strategies that could enable more widespread deployment of AI models on small, battery-less devices. Further research and real-world validation could build upon this work to unlock the full potential of on-device machine learning in a variety of applications.

Conclusion

This paper tackles the challenging problem of training deep neural networks (DNNs) on intermittently-powered, energy-harvesting micro-computers. By addressing the key constraints of limited resources and unpredictable power availability, the researchers have developed a range of techniques to optimize the DNN training process and enable more efficient deployment of AI models on small, battery-less devices.

The proposed solutions, including memory and energy-aware optimizations, adaptive DNN architectures, and intermittent execution strategies, hold the potential to unlock new opportunities for on-device machine learning in a variety of applications, from environmental sensing to wearable devices. As the research continues to evolve and be validated in real-world settings, these advancements could pave the way for a future where powerful AI models can reliably operate on the edge, even in resource-constrained and energy-harvesting environments.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Revisiting DNN Training for Intermittently Powered Energy Harvesting Micro Computers

Cyan Subhra Mishra, Deeksha Chaudhary, Jack Sampson, Mahmut Taylan Knademir, Chita Das

The deployment of Deep Neural Networks in energy-constrained environments, such as Energy Harvesting Wireless Sensor Networks, presents unique challenges, primarily due to the intermittent nature of power availability. To address these challenges, this study introduces and evaluates a novel training methodology tailored for DNNs operating within such contexts. In particular, we propose a dynamic dropout technique that adapts to both the architecture of the device and the variability in energy availability inherent in energy harvesting scenarios. Our proposed approach leverages a device model that incorporates specific parameters of the network architecture and the energy harvesting profile to optimize dropout rates dynamically during the training phase. By modulating the network's training process based on predicted energy availability, our method not only conserves energy but also ensures sustained learning and inference capabilities under power constraints. Our preliminary results demonstrate that this strategy provides 6 to 22 percent accuracy improvements compared to the state of the art with less than 5 percent additional compute. This paper details the development of the device model, describes the integration of energy profiles with intermittency aware dropout and quantization algorithms, and presents a comprehensive evaluation of the proposed approach using real-world energy harvesting data.

8/27/2024

Accelerate Intermittent Deep Inference

Ziliang Zhang

Emerging research in edge devices and micro-controller units (MCU) enables on-device computation of Deep Learning Training and Inferencing tasks. More recently, contemporary trends focus on making the Deep Neural Net (DNN) Models runnable on battery-less intermittent devices. One of the approaches is to shrink the DNN models by enabling weight sharing, pruning, and conducted Neural Architecture Search (NAS) with optimized search space to target specific edge devices cite{Cai2019OnceFA} cite{Lin2020MCUNetTD} cite{Lin2021MCUNetV2MP} cite{Lin2022OnDeviceTU}. Another approach analyzes the intermittent execution and designs the corresponding system by performing NAS that is aware of intermittent execution cycles and resource constraints cite{iNAS} cite{HW-NAS} cite{iLearn}. However, the optimized NAS was only considering consecutive execution with no power loss, and intermittent execution designs only focused on balancing data reuse and costs related to intermittent inference and often with low accuracy. We proposed Accelerated Intermittent Deep Inference to harness the power of optimized inferencing DNN models specifically targeting SRAM under 256KB and make it schedulable and runnable within intermittent power. Our main contribution is: (1) Schedule tasks performed by on-device inferencing into intermittent execution cycles and optimize for latency; (2) Develop a system that can satisfy the end-to-end latency while achieving a much higher accuracy compared to baseline cite{iNAS} cite{HW-NAS}

7/23/2024

🤯

Decentralized LLM Inference over Edge Networks with Energy Harvesting

Aria Khoshsirat, Giovanni Perin, Michele Rossi

Large language models have significantly transformed multiple fields with their exceptional performance in natural language tasks, but their deployment in resource-constrained environments like edge networks presents an ongoing challenge. Decentralized techniques for inference have emerged, distributing the model blocks among multiple devices to improve flexibility and cost effectiveness. However, energy limitations remain a significant concern for edge devices. We propose a sustainable model for collaborative inference on interconnected, battery-powered edge devices with energy harvesting. A semi-Markov model is developed to describe the states of the devices, considering processing parameters and average green energy arrivals. This informs the design of scheduling algorithms that aim to minimize device downtimes and maximize network throughput. Through empirical evaluations and simulated runs, we validate the effectiveness of our approach, paving the way for energy-efficient decentralized inference over edge networks.

8/29/2024

Memory-efficient Energy-adaptive Inference of Pre-Trained Models on Batteryless Embedded Systems

Pietro Farina, Subrata Biswas, Eren Y{i}ld{i}z, Khakim Akhunov, Saad Ahmed, Bashima Islam, Kas{i}m Sinan Y{i}ld{i}r{i}m

Batteryless systems frequently face power failures, requiring extra runtime buffers to maintain inference progress and leaving only a memory space for storing ultra-tiny deep neural networks (DNNs). Besides, making these models responsive to stochastic energy harvesting dynamics during inference requires a balance between inference accuracy, latency, and energy overhead. Recent works on compression mostly focus on time and memory, but often ignore energy dynamics or significantly reduce the accuracy of pre-trained DNNs. Existing energy-adaptive inference works modify the architecture of pre-trained models and have significant memory overhead. Thus, energy-adaptive and accurate inference of pre-trained DNNs on batteryless devices with extreme memory constraints is more challenging than traditional microcontrollers. We combat these issues by proposing FreeML, a framework to optimize pre-trained DNN models for memory-efficient and energy-adaptive inference on batteryless systems. FreeML comprises (1) a novel compression technique to reduce the model footprint and runtime memory requirements simultaneously, making them executable on extremely memory-constrained batteryless platforms; and (2) the first early exit mechanism that uses a single exit branch for all exit points to terminate inference at any time, making models energy-adaptive with minimal memory overhead. Our experiments showed that FreeML reduces the model sizes by up to $95 times$, supports adaptive inference with a $2.03-19.65 times$ less memory overhead, and provides significant time and energy benefits with only a negligible accuracy drop compared to the state-of-the-art.

5/20/2024