Contingency-constrained economic dispatch with safe reinforcement learning

Read original: arXiv:2205.06212 - Published 7/17/2024 by Michael Eichelbeck, Hannah Markgraf, Matthias Althoff

🏅

Overview

Future power systems will rely heavily on microgrids with a high share of decentralized renewable energy sources and energy storage systems.
The complexity and uncertainty in this context may make traditional power dispatch strategies impractical.
Reinforcement learning (RL) controllers can address this challenge, but cannot themselves provide safety guarantees, preventing their deployment in practice.
The researchers propose a formally validated RL controller for economic dispatch that addresses this limitation.

Plain English Explanation

The paper discusses how future power grids will increasingly rely on smaller, localized power systems called microgrids that use a lot of decentralized renewable energy sources like solar and wind, as well as energy storage. This creates a very complex and uncertain environment, making traditional ways of managing the power grid infeasible.

The researchers say reinforcement learning-based controllers could be a solution, as they can adapt to these dynamic conditions. However, these RL controllers can't guarantee safety, which prevents them from being used in real power systems.

To overcome this, the researchers developed an RL controller that has been formally verified to be safe. They added a special time-dependent constraint to the controller that accounts for the possibility of the microgrid becoming isolated (or "islanded") from the main grid. This constraint is calculated using a mathematical technique called set-based backwards reachability analysis. The RL agent's actions are then checked against this safety constraint, and any unsafe actions are adjusted to be safe before being implemented.

The researchers tested this approach on a residential microgrid case study using real-world data, demonstrating how it can provide safe and economical power dispatch in these complex future grid scenarios.

Technical Explanation

The researchers propose a formally validated reinforcement learning-based economic dispatch controller for microgrids with high penetration of decentralized renewable energy sources and energy storage.

They extend the conventional dispatch constraints with a time-dependent constraint that encodes the islanding contingency. This contingency constraint is computed using set-based backwards reachability analysis, which allows the controller to anticipate and avoid unsafe states.

The RL agent's actions are then verified through a safety layer that projects any unsafe actions into the safe action space. This leverages constrained zonotope set representations for computational efficiency.

The developed approach is demonstrated on a residential microgrid use case using real-world measurement data. The results show that the formally validated RL controller can provide safe and economically optimal power dispatch in these complex future grid scenarios.

Critical Analysis

The paper provides a promising solution to the challenge of enabling safe RL-based control in future power systems with high penetration of renewables and distributed energy resources. By incorporating formal verification techniques, the researchers are able to address the key limitation of standard RL controllers - the inability to provide safety guarantees.

However, the paper does not extensively discuss the potential computational complexity and scalability challenges of the proposed approach, especially as grid size and the number of distributed energy resources increase. There may also be practical implementation challenges in terms of obtaining accurate models and measurements required for the set-based reachability analysis.

Additionally, the paper focuses on economic dispatch optimization, but does not address other important aspects of microgrid control like voltage and frequency regulation, which would need to be considered for real-world deployment. Further research may be needed to extend this framework to handle a more comprehensive set of control objectives and constraints.

Overall, the work represents an important step towards enabling safe RL-based control in future power systems, but additional research and development will be required to fully realize this vision.

Conclusion

This paper presents a formally validated reinforcement learning-based controller for economic dispatch in microgrids with high penetration of decentralized renewable energy sources and energy storage. By incorporating a time-dependent constraint that encodes islanding contingencies, computed through set-based backwards reachability analysis, the approach is able to provide safety guarantees lacking in standard RL controllers.

The demonstrated residential microgrid case study shows the potential of this approach to enable safe and economically optimal power dispatch in complex future grid scenarios. While further research is needed to address scalability and broader control objectives, this work represents a significant advance towards the practical application of RL techniques in mission-critical power system applications.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🏅

Contingency-constrained economic dispatch with safe reinforcement learning

Michael Eichelbeck, Hannah Markgraf, Matthias Althoff

Future power systems will rely heavily on micro grids with a high share of decentralised renewable energy sources and energy storage systems. The high complexity and uncertainty in this context might make conventional power dispatch strategies infeasible. Reinforcement-learning based (RL) controllers can address this challenge, however, cannot themselves provide safety guarantees, preventing their deployment in practice. To overcome this limitation, we propose a formally validated RL controller for economic dispatch. We extend conventional constraints by a time-dependent constraint encoding the islanding contingency. The contingency constraint is computed using set-based backwards reachability analysis and actions of the RL agent are verified through a safety layer. Unsafe actions are projected into the safe action space while leveraging constrained zonotope set representations for computational efficiency. The developed approach is demonstrated on a residential use case using real-world measurements.

7/17/2024

🏅

Control Policy Correction Framework for Reinforcement Learning-based Energy Arbitrage Strategies

Seyed Soroush Karimi Madahi, Gargya Gokhale, Marie-Sophie Verwee, Bert Claessens, Chris Develder

A continuous rise in the penetration of renewable energy sources, along with the use of the single imbalance pricing, provides a new opportunity for balance responsible parties to reduce their cost through energy arbitrage in the imbalance settlement mechanism. Model-free reinforcement learning (RL) methods are an appropriate choice for solving the energy arbitrage problem due to their outstanding performance in solving complex stochastic sequential problems. However, RL is rarely deployed in real-world applications since its learned policy does not necessarily guarantee safety during the execution phase. In this paper, we propose a new RL-based control framework for batteries to obtain a safe energy arbitrage strategy in the imbalance settlement mechanism. In our proposed control framework, the agent initially aims to optimize the arbitrage revenue. Subsequently, in the post-processing step, we correct (constrain) the learned policy following a knowledge distillation process based on properties that follow human intuition. Our post-processing step is a generic method and is not restricted to the energy arbitrage domain. We use the Belgian imbalance price of 2023 to evaluate the performance of our proposed framework. Furthermore, we deploy our proposed control framework on a real battery to show its capability in the real world.

5/1/2024

🏅

End-to-End Reinforcement Learning of Curative Curtailment with Partial Measurement Availability

Hinrikus Wolf, Luis Bottcher, Sarra Bouchkati, Philipp Lutat, Jens Breitung, Bastian Jung, Tina Mollemann, Viktor Todosijevi'c, Jan Schiefelbein-Lach, Oliver Pohl, Andreas Ulbig, Martin Grohe

In the course of the energy transition, the expansion of generation and consumption will change, and many of these technologies, such as PV systems, electric cars and heat pumps, will influence the power flow, especially in the distribution grids. Scalable methods that can make decisions for each grid connection are needed to enable congestion-free grid operation in the distribution grids. This paper presents a novel end-to-end approach to resolving congestion in distribution grids with deep reinforcement learning. Our architecture learns to curtail power and set appropriate reactive power to determine a non-congested and, thus, feasible grid state. State-of-the-art methods such as the optimal power flow (OPF) demand high computational costs and detailed measurements of every bus in a grid. In contrast, the presented method enables decisions under sparse information with just some buses observable in the grid. Distribution grids are generally not yet fully digitized and observable, so this method can be used for decision-making on the majority of low-voltage grids. On a real low-voltage grid the approach resolves 100% of violations in the voltage band and 98.8% of asset overloads. The results show that decisions can also be made on real grids that guarantee sufficient quality for congestion-free grid operation.

6/21/2024

✅

Real-world validation of safe reinforcement learning, model predictive control and decision tree-based home energy management systems

Julian Ruddick, Glenn Ceusters, Gilles Van Kriekinge, Evgenii Genov, Thierry Coosemans, Maarten Messagie

Recent advancements in machine learning based energy management approaches, specifically reinforcement learning with a safety layer (OptLayerPolicy) and a metaheuristic algorithm generating a decision tree control policy (TreeC), have shown promise. However, their effectiveness has only been demonstrated in computer simulations. This paper presents the real-world validation of these methods, comparing against model predictive control and simple rule-based control benchmark. The experiments were conducted on the electrical installation of 4 reproductions of residential houses, which all have their own battery, photovoltaic and dynamic load system emulating a non-controllable electrical load and a controllable electric vehicle charger. The results show that the simple rules, TreeC, and model predictive control-based methods achieved similar costs, with a difference of only 0.6%. The reinforcement learning based method, still in its training phase, obtained a cost 25.5% higher to the other methods. Additional simulations show that the costs can be further reduced by using a more representative training dataset for TreeC and addressing errors in the model predictive control implementation caused by its reliance on accurate data from various sources. The OptLayerPolicy safety layer allows safe online training of a reinforcement learning agent in the real-world, given an accurate constraint function formulation. The proposed safety layer method remains error-prone, nonetheless, it is found beneficial for all investigated methods. The TreeC method, which does require building a realistic simulation for training, exhibits the safest operational performance, exceeding the grid limit by only 27.1 Wh compared to 593.9 Wh for reinforcement learning.

8/15/2024