Deep Reinforcement Learning Based Volt-VAR Optimization in Smart Distribution Systems

Because of the high penetration of renewable energies and the installation of new control devices, modern distribution networks are faced with voltage regulation challenges. Recently, the rapid development of artificial intelligence technology has introduced new solutions for optimal control problems with high dimensions and dynamics. In this paper, a deep reinforcement learning method is proposed to solve the two-timescale optimal voltage control problem. All control variables are assigned to different agents, and discrete variables are solved by a deep Q network (DQN) agent while the continuous variables are solved by a deep deterministic policy gradient (DDPG) agent. All agents are trained simultaneously with specially designed reward aiming at minimizing long-term average voltage deviation. Case study is executed on a modified IEEE-123 bus system, and the results demonstrate that the proposed algorithm has similar or even better performance than the model-based optimal control scheme and has high computational efficiency and competitive potential for online application.

Download Full-text

Optimal dispatch of PV inverters in unbalanced distribution systems using Reinforcement Learning

International Journal of Electrical Power & Energy Systems ◽

10.1016/j.ijepes.2021.107628 ◽

2022 ◽

Vol 136 ◽

pp. 107628

Author(s):

Pedro P. Vergara ◽

Mauricio Salazar ◽

Juan S. Giraldo ◽

Peter Palensky

Keyword(s):

Reinforcement Learning ◽

Distribution Systems ◽

Optimal Dispatch

Download Full-text

Optimal Adaptive Prediction Intervals for Electricity Load Forecasting in Distribution Systems via Reinforcement Learning

10.36227/techrxiv.17925911 ◽

2022 ◽

Author(s):

Yufan Zhang ◽

Honglin Wen ◽

Qiuwei Wu ◽

Qian Ai

Keyword(s):

Reinforcement Learning ◽

Distribution Systems ◽

Selection Process ◽

Concept Drift ◽

Prediction Intervals ◽

Learning Ability ◽

Optimal Probability ◽

Experience Replay ◽

Electricity Load ◽

Net Load

Prediction intervals (PIs) offer an effective tool for quantifying uncertainty of loads in distribution systems. The traditional central PIs cannot adapt well to skewed distributions, and their offline training fashion is vulnerable to the unforeseen change in future load patterns. Therefore, we propose an optimal PI estimation approach, which is online and adaptive to different data distributions by adaptively determining symmetric or asymmetric probability proportion pairs for quantiles of PIs’ bounds. It relies on the online learning ability of reinforcement learning (RL) to integrate the two online tasks, i.e., the adaptive selection of probability proportion pairs and quantile predictions, both of which are modeled by neural networks. As such, the quality of quantiles-formed PI can guide the selection process of optimal probability proportion pairs, which forms a closed loop to improve PIs’ quality. Furthermore, to improve the learning efficiency of quantile forecasts, a prioritized experience replay (PER) strategy is proposed for online quantile regression processes. Case studies on both load and net load demonstrate that the proposed method can better adapt to data distribution compared with online central PIs method. Compared with offline-trained methods, it obtains PIs with better quality and is more robust against concept drift.

Download Full-text