Fair Reinforcement Learning Algorithm for PV Active Control in LV Distribution Networks

Read original: arXiv:2409.09074 - Published 9/17/2024 by Maurizio Vassallo, Amina Benzerga, Alireza Bahmanyar, Damien Ernst
Total Score

0

Fair Reinforcement Learning Algorithm for PV Active Control in LV Distribution Networks

Sign in to get full access

or

If you already have an account, we'll log you in

Overview

  • This paper proposes a fair reinforcement learning algorithm for active control of photovoltaic (PV) systems in low-voltage (LV) distribution networks.
  • The goal is to maintain voltage levels within acceptable limits while ensuring fair curtailment of PV generation across different customers.
  • The algorithm learns an optimal control policy through interactions with the power grid, aiming to balance voltage regulation and fair PV curtailment.

Plain English Explanation

The paper focuses on managing the power generated by rooftop solar panels (photovoltaic or PV systems) in low-voltage electricity distribution networks. As more homes and businesses install solar panels, the power they generate can sometimes exceed the capacity of the local grid, leading to voltage issues.

To address this, the researchers developed a reinforcement learning algorithm that can automatically control the output of PV systems. The algorithm learns an optimal control policy through interacting with the power grid, with the goal of maintaining voltage levels within acceptable limits while also ensuring that the burden of reducing solar power output (known as "curtailment") is distributed fairly among different customers.

The key innovation is the "fairness" aspect, which means ensuring that no single customer or group of customers bears a disproportionate share of the curtailment. This helps to avoid unfairly penalizing those who have invested in solar panels.

Technical Explanation

The paper presents a fair reinforcement learning algorithm for active control of PV systems in LV distribution networks. The algorithm learns an optimal control policy through interactions with the power grid, aiming to balance two objectives:

  1. Voltage regulation: Maintaining voltage levels within acceptable limits to ensure grid stability and power quality.
  2. Fair PV curtailment: Distributing the burden of reducing PV generation fairly across different customers to avoid disproportionately impacting those who have invested in solar.

The algorithm uses a multi-agent reinforcement learning framework, where each PV system is modeled as an agent that learns its own control policy. The agents interact with the power grid and receive rewards based on the voltage regulation and fairness of curtailment. Over time, the agents converge to an optimal control policy that balances these two objectives.

The paper includes experiments on a realistic LV distribution network model, demonstrating the algorithm's ability to maintain voltage levels within limits while ensuring fair PV curtailment compared to other approaches.

Critical Analysis

The paper provides a comprehensive and well-designed solution to the problem of active voltage control in LV distribution networks with increasing PV penetration. The inclusion of fairness as a key objective is a notable contribution, as it addresses an important practical concern for PV owners.

However, the paper does not explore the potential impact of the algorithm on the financial incentives and return on investment for PV system owners. Excessively frequent or severe curtailment could reduce the overall benefits of solar adoption, which would be an important consideration for policymakers and grid operators.

Additionally, the paper focuses on a single LV distribution network model and does not discuss the scalability of the algorithm to larger, more complex power systems. Further research may be needed to understand how the algorithm would perform in more diverse grid topologies and with different penetration levels of distributed energy resources.

Conclusion

This paper presents a fair reinforcement learning algorithm for active control of PV systems in LV distribution networks. The algorithm aims to maintain voltage levels within acceptable limits while ensuring that the burden of PV curtailment is distributed fairly across customers. The experimental results demonstrate the effectiveness of the proposed approach in balancing these two important objectives.

The inclusion of fairness as a key design consideration is a notable contribution, as it addresses a practical concern that is crucial for encouraging broader adoption of distributed solar energy. Further research may be needed to fully understand the algorithm's performance and implications in more diverse power system contexts.



This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Fair Reinforcement Learning Algorithm for PV Active Control in LV Distribution Networks
Total Score

0

Fair Reinforcement Learning Algorithm for PV Active Control in LV Distribution Networks

Maurizio Vassallo, Amina Benzerga, Alireza Bahmanyar, Damien Ernst

The increasing adoption of distributed energy resources, particularly photovoltaic (PV) panels, has presented new and complex challenges for power network control. With the significant energy production from PV panels, voltage issues in the network have become a problem. Currently, PV smart inverters (SIs) are used to mitigate the voltage problems by controlling their active power generation and reactive power injection or absorption. However, reducing the active power output of PV panels can be perceived as unfair to some customers, discouraging future installations. To solve this issue, in this paper, a reinforcement learning technique is proposed to address voltage issues in a distribution network, while considering fairness in active power curtailment among customers. The feasibility of the proposed approach is explored through experiments, demonstrating its ability to effectively control voltage in a fair and efficient manner.

Read more

9/17/2024

Robust Deep Reinforcement Learning for Inverter-based Volt-Var Control in Partially Observable Distribution Networks
Total Score

0

Robust Deep Reinforcement Learning for Inverter-based Volt-Var Control in Partially Observable Distribution Networks

Qiong Liu, Ye Guo, Tong Xu

Inverter-based volt-var control is studied in this paper. One key issue in DRL-based approaches is the limited measurement deployment in active distribution networks, which leads to problems of a partially observable state and unknown reward. To address those problems, this paper proposes a robust DRL approach with a conservative critic and a surrogate reward. The conservative critic utilizes the quantile regression technology to estimate conservative state-action value function based on the partially observable state, which helps to train a robust policy; the surrogate rewards of power loss and voltage violation are designed that can be calculated from the limited measurements. The proposed approach optimizes the power loss of the whole network and the voltage profile of buses with measurable voltages while indirectly improving the voltage profile of other buses. Extensive simulations verify the effectiveness of the robust DRL approach in different limited measurement conditions, even when only the active power injection of the root bus and less than 10% of bus voltages are measurable.

Read more

8/14/2024

🏅

Total Score

0

Safety Constrained Multi-Agent Reinforcement Learning for Active Voltage Control

Yang Qu, Jinming Ma, Feng Wu

Active voltage control presents a promising avenue for relieving power congestion and enhancing voltage quality, taking advantage of the distributed controllable generators in the power network, such as roof-top photovoltaics. While Multi-Agent Reinforcement Learning (MARL) has emerged as a compelling approach to address this challenge, existing MARL approaches tend to overlook the constrained optimization nature of this problem, failing in guaranteeing safety constraints. In this paper, we formalize the active voltage control problem as a constrained Markov game and propose a safety-constrained MARL algorithm. We expand the primal-dual optimization RL method to multi-agent settings, and augment it with a novel approach of double safety estimation to learn the policy and to update the Lagrange-multiplier. In addition, we proposed different cost functions and investigated their influences on the behavior of our constrained MARL method. We evaluate our approach in the power distribution network simulation environment with real-world scale scenarios. Experimental results demonstrate the effectiveness of the proposed method compared with the state-of-the-art MARL methods. This paper is published at url{https://www.ijcai.org/Proceedings/2024/}.

Read more

9/4/2024

🏅

Total Score

0

End-to-End Reinforcement Learning of Curative Curtailment with Partial Measurement Availability

Hinrikus Wolf, Luis Bottcher, Sarra Bouchkati, Philipp Lutat, Jens Breitung, Bastian Jung, Tina Mollemann, Viktor Todosijevi'c, Jan Schiefelbein-Lach, Oliver Pohl, Andreas Ulbig, Martin Grohe

In the course of the energy transition, the expansion of generation and consumption will change, and many of these technologies, such as PV systems, electric cars and heat pumps, will influence the power flow, especially in the distribution grids. Scalable methods that can make decisions for each grid connection are needed to enable congestion-free grid operation in the distribution grids. This paper presents a novel end-to-end approach to resolving congestion in distribution grids with deep reinforcement learning. Our architecture learns to curtail power and set appropriate reactive power to determine a non-congested and, thus, feasible grid state. State-of-the-art methods such as the optimal power flow (OPF) demand high computational costs and detailed measurements of every bus in a grid. In contrast, the presented method enables decisions under sparse information with just some buses observable in the grid. Distribution grids are generally not yet fully digitized and observable, so this method can be used for decision-making on the majority of low-voltage grids. On a real low-voltage grid the approach resolves 100% of violations in the voltage band and 98.8% of asset overloads. The results show that decisions can also be made on real grids that guarantee sufficient quality for congestion-free grid operation.

Read more

6/21/2024