Real-world validation of safe reinforcement learning, model predictive control and decision tree-based home energy management systems

Read original: arXiv:2408.07435 - Published 8/15/2024 by Julian Ruddick, Glenn Ceusters, Gilles Van Kriekinge, Evgenii Genov, Thierry Coosemans, Maarten Messagie

✅

Overview

This paper presents the real-world validation of two machine learning-based energy management approaches: reinforcement learning with a safety layer (OptLayerPolicy) and a metaheuristic algorithm generating a decision tree control policy (TreeC).
These methods were tested and compared against model predictive control and simple rule-based control in experiments on the electrical installations of 4 residential houses with battery, photovoltaic, and dynamic load systems.

Plain English Explanation

The paper explores two innovative machine learning-based approaches for managing energy use in homes. The first approach uses reinforcement learning with a safety layer to keep the system within safe operating limits. The second approach uses a metaheuristic algorithm to generate a decision tree-based control policy.

These new methods were tested in real-world experiments on electrical systems in 4 model homes, each with a battery, solar panels, and a mix of controllable and uncontrollable electrical loads (like an electric vehicle charger). The performance of the machine learning approaches was compared to traditional model predictive control and simple rule-based control.

The results showed that the machine learning methods performed similarly to the traditional approaches in terms of cost savings, with the reinforcement learning method still in a training phase and performing slightly worse. Additional simulations suggested the machine learning methods could be further improved by using more representative training data and addressing limitations in the model predictive control implementation.

The key advantages of the machine learning approaches are that the safety layer allows safe real-world training, and the decision tree control policy is very robust and reliable. Overall, the paper demonstrates the potential of these new machine learning techniques for improving energy management in residential settings.

Technical Explanation

The paper evaluates the real-world performance of two machine learning-based energy management methods:

OptLayerPolicy: A reinforcement learning approach with a safety layer to ensure the system remains within safe operating limits during the training process.
TreeC: A metaheuristic algorithm that generates a decision tree-based control policy.

These methods were tested and compared against:

Model Predictive Control (MPC): An optimization-based control approach that uses a model of the system to predict future behavior.
Simple Rule-Based Control: A basic set of predefined rules for managing the energy system.

The experiments were conducted on the electrical installations of 4 reproductions of residential houses, each with its own battery, photovoltaic system, and a mix of controllable and uncontrollable electrical loads (including an electric vehicle charger).

The results showed that the simple rules, TreeC, and MPC-based methods achieved similar costs, with a difference of only 0.6%. The reinforcement learning-based OptLayerPolicy method, still in its training phase, obtained a cost 25.5% higher than the other methods.

Additional simulations suggested the costs could be further reduced by:

Using a more representative training dataset for TreeC
Addressing errors in the MPC implementation caused by its reliance on accurate data from various sources

The paper also found that the OptLayerPolicy safety layer was beneficial for all investigated methods, although it remains error-prone. The TreeC method, which requires building a realistic simulation for training, exhibited the safest operational performance, exceeding the grid limit by only 27.1 Wh compared to 593.9 Wh for reinforcement learning.

Critical Analysis

The paper presents a real-world validation of two promising machine learning-based energy management approaches, which is a significant contribution to the field. However, the results also highlight some limitations and areas for further research:

Reinforcement Learning Performance: The reinforcement learning-based OptLayerPolicy method performed worse than the other approaches in the current experiments, likely due to being in an early training phase. More research is needed to optimize the training process and improve its performance.
Simulation Fidelity: The paper notes that the TreeC method's performance was improved by using a more representative training simulation. This suggests the need for accurate and comprehensive simulation models to effectively train machine learning-based control policies.
MPC Limitations: The paper identified limitations in the MPC implementation caused by its reliance on accurate data from various sources. This highlights the importance of robust model-based control approaches and the need to address data quality issues.
Safety Layer Reliability: While the safety layer was found to be beneficial, the paper notes that it remains error-prone. Improving the reliability and accuracy of the safety constraint formulation is an important area for further research.

Overall, the paper demonstrates the potential of machine learning-based energy management approaches, while also identifying key challenges and areas for improvement. Continued research and real-world testing will be crucial for advancing these techniques and unlocking their full benefits in residential energy systems.

Conclusion

This paper presents the real-world validation of two innovative machine learning-based energy management approaches: reinforcement learning with a safety layer (OptLayerPolicy) and a metaheuristic algorithm generating a decision tree control policy (TreeC). The experiments conducted on residential electrical systems showed that these machine learning methods performed similarly to traditional model predictive control and rule-based control in terms of cost savings, with the reinforcement learning approach still in a training phase.

The key advantages of the machine learning approaches are the safety layer, which allows for safe real-world training, and the robust decision tree control policy. While the results are promising, the paper also identifies areas for further research, such as optimizing the reinforcement learning training process, improving simulation fidelity, and enhancing the reliability of the safety constraint formulation. Continued advancement in these areas will be crucial for unlocking the full potential of machine learning-based energy management in residential settings.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

✅

Real-world validation of safe reinforcement learning, model predictive control and decision tree-based home energy management systems

Julian Ruddick, Glenn Ceusters, Gilles Van Kriekinge, Evgenii Genov, Thierry Coosemans, Maarten Messagie

Recent advancements in machine learning based energy management approaches, specifically reinforcement learning with a safety layer (OptLayerPolicy) and a metaheuristic algorithm generating a decision tree control policy (TreeC), have shown promise. However, their effectiveness has only been demonstrated in computer simulations. This paper presents the real-world validation of these methods, comparing against model predictive control and simple rule-based control benchmark. The experiments were conducted on the electrical installation of 4 reproductions of residential houses, which all have their own battery, photovoltaic and dynamic load system emulating a non-controllable electrical load and a controllable electric vehicle charger. The results show that the simple rules, TreeC, and model predictive control-based methods achieved similar costs, with a difference of only 0.6%. The reinforcement learning based method, still in its training phase, obtained a cost 25.5% higher to the other methods. Additional simulations show that the costs can be further reduced by using a more representative training dataset for TreeC and addressing errors in the model predictive control implementation caused by its reliance on accurate data from various sources. The OptLayerPolicy safety layer allows safe online training of a reinforcement learning agent in the real-world, given an accurate constraint function formulation. The proposed safety layer method remains error-prone, nonetheless, it is found beneficial for all investigated methods. The TreeC method, which does require building a realistic simulation for training, exhibits the safest operational performance, exceeding the grid limit by only 27.1 Wh compared to 593.9 Wh for reinforcement learning.

8/15/2024

Reinforcement Learning for Efficient Design and Control Co-optimisation of Energy Systems

Marine Cauz, Adrien Bolland, Nicolas Wyrsch, Christophe Ballif

The ongoing energy transition drives the development of decentralised renewable energy sources, which are heterogeneous and weather-dependent, complicating their integration into energy systems. This study tackles this issue by introducing a novel reinforcement learning (RL) framework tailored for the co-optimisation of design and control in energy systems. Traditionally, the integration of renewable sources in the energy sector has relied on complex mathematical modelling and sequential processes. By leveraging RL's model-free capabilities, the framework eliminates the need for explicit system modelling. By optimising both control and design policies jointly, the framework enhances the integration of renewable sources and improves system efficiency. This contribution paves the way for advanced RL applications in energy management, leading to more efficient and effective use of renewable energy sources.

7/1/2024

🛠️

Data-driven modeling and supervisory control system optimization for plug-in hybrid electric vehicles

Hao Zhang, Nuo Lei, Boli Chen, Bingbing Li, Rulong Li, Zhi Wang

Learning-based intelligent energy management systems for plug-in hybrid electric vehicles (PHEVs) are crucial for achieving efficient energy utilization. However, their application faces system reliability challenges in the real world, which prevents widespread acceptance by original equipment manufacturers (OEMs). This paper begins by establishing a PHEV model based on physical and data-driven models, focusing on the high-fidelity training environment. It then proposes a real-vehicle application-oriented control framework, combining horizon-extended reinforcement learning (RL)-based energy management with the equivalent consumption minimization strategy (ECMS) to enhance practical applicability, and improves the flawed method of equivalent factor evaluation based on instantaneous driving cycle and powertrain states found in existing research. Finally, comprehensive simulation and hardware-in-the-loop validation are carried out which demonstrates the advantages of the proposed control framework in fuel economy over adaptive-ECMS and rule-based strategies. Compared to conventional RL architectures that directly control powertrain components, the proposed control method not only achieves similar optimality but also significantly enhances the disturbance resistance of the energy management system, providing an effective control framework for RL-based energy management strategies aimed at real-vehicle applications by OEMs.

6/14/2024

🏅

Control Policy Correction Framework for Reinforcement Learning-based Energy Arbitrage Strategies

Seyed Soroush Karimi Madahi, Gargya Gokhale, Marie-Sophie Verwee, Bert Claessens, Chris Develder

A continuous rise in the penetration of renewable energy sources, along with the use of the single imbalance pricing, provides a new opportunity for balance responsible parties to reduce their cost through energy arbitrage in the imbalance settlement mechanism. Model-free reinforcement learning (RL) methods are an appropriate choice for solving the energy arbitrage problem due to their outstanding performance in solving complex stochastic sequential problems. However, RL is rarely deployed in real-world applications since its learned policy does not necessarily guarantee safety during the execution phase. In this paper, we propose a new RL-based control framework for batteries to obtain a safe energy arbitrage strategy in the imbalance settlement mechanism. In our proposed control framework, the agent initially aims to optimize the arbitrage revenue. Subsequently, in the post-processing step, we correct (constrain) the learned policy following a knowledge distillation process based on properties that follow human intuition. Our post-processing step is a generic method and is not restricted to the energy arbitrage domain. We use the Belgian imbalance price of 2023 to evaluate the performance of our proposed framework. Furthermore, we deploy our proposed control framework on a real battery to show its capability in the real world.

5/1/2024