Voltage Control-Based Ancillary Service Using Deep Reinforcement Learning

Ancillary services rely on operating reserves to support an uninterrupted electricity supply that meets demand. One of the hidden reserves of the grid is in thermostatically controlled loads. To efficiently exploit these reserves, a new realization of control of voltage in the allowable range to follow the set power reference is proposed. The proposed approach is based on the deep reinforcement learning (RL) algorithm. Double DQN is utilized because of the proven state-of-the-art level of performance in complex control tasks, native handling of continuous environment state variables, and model-free application of the trained DDQN to the real grid. To evaluate the deep RL control performance, the proposed method was compared with a classic proportional control of the voltage change according to the power reference setup. The solution was validated in setups with a different number of thermostatically controlled loads (TCLs) in a feeder to show its generalization capabilities. In this article, the particularities of deep reinforcement learning application in the power system domain are discussed along with the results achieved by such an RL-powered demand response solution. The tuning of hyperparameters for the RL algorithm was performed to achieve the best performance of the double deep Q-network (DDQN) algorithm. In particular, the influence of a learning rate, a target network update step, network hidden layer size, batch size, and replay buffer size were assessed. The achieved performance is roughly two times better than the competing approach of optimal control selection within the considered time interval of the simulation. The decrease in deviation of the actual power consumption from the reference power profile is demonstrated. The benefit in costs is estimated for the presented voltage control-based ancillary service to show the potential impact.

Download Full-text

Transient and Stationary Losses in a Finite-Buffer Queue with Batch Arrivals

Mathematical Problems in Engineering ◽

10.1155/2012/326830 ◽

2012 ◽

Vol 2012 ◽

pp. 1-17 ◽

Cited By ~ 6

Author(s):

Andrzej Chydzinski ◽

Blazej Adamczyk

Keyword(s):

Batch Size ◽

Buffer Size ◽

Arrival Process ◽

Time Interval ◽

Finite Buffer ◽

Finite Time Interval ◽

Batch Arrivals ◽

Batch Markovian Arrival Process ◽

Buffer Overflows ◽

Finite Buffer Queue

We present an analysis of the number of losses, caused by the buffer overflows, in a finite-buffer queue with batch arrivals and autocorrelated interarrival times. Using the batch Markovian arrival process, the formulas for the average number of losses in a finite time interval and the stationary loss ratio are shown. In addition, several numerical examples are presented, including illustrations of the dependence of the number of losses on the average batch size, buffer size, system load, autocorrelation structure, and time.

Download Full-text

Voltage control-based ancillary service using thermostatically controlled loads

2016 IEEE Power and Energy Society General Meeting (PESGM) ◽

10.1109/pesgm.2016.7741640 ◽

2016 ◽

Cited By ~ 3

Author(s):

Tetiana Bogodorova ◽

Luigi Vanfretti ◽

Konstantin Turitsyn

Keyword(s):

Voltage Control ◽

Ancillary Service ◽

Thermostatically Controlled Loads

Download Full-text

Deep Reinforcement Learning-Based Voltage Control to Deal with Model Uncertainties in Distribution Networks

Energies ◽

10.3390/en13153928 ◽

2020 ◽

Vol 13 (15) ◽

pp. 3928

Author(s):

Jean-François Toubeau ◽

Bashir Bakhshideh Zad ◽

Martin Hupez ◽

Zacharie De Grève ◽

François Vallée

Keyword(s):

Reinforcement Learning ◽

Power Flow ◽

Voltage Control ◽

Distribution Networks ◽

Analytical Models ◽

Physical Parameters ◽

Voltage Profile ◽

Promising Alternative ◽

Model Free ◽

Linear Behavior

This paper addresses the voltage control problem in medium-voltage distribution networks. The objective is to cost-efficiently maintain the voltage profile within a safe range, in presence of uncertainties in both the future working conditions, as well as the physical parameters of the system. Indeed, the voltage profile depends not only on the fluctuating renewable-based power generation and load demand, but also on the physical parameters of the system components. In reality, the characteristics of loads, lines and transformers are subject to complex and dynamic dependencies, which are difficult to model. In such a context, the quality of the control strategy depends on the accuracy of the power flow representation, which requires to capture the non-linear behavior of the power network. Relying on the detailed analytical models (which are still subject to uncertainties) introduces a high computational power that does not comply with the real-time constraint of the voltage control task. To address this issue, while avoiding arbitrary modeling approximations, we leverage a deep reinforcement learning model to ensure an autonomous grid operational control. Outcomes show that the proposed model-free approach offers a promising alternative to find a compromise between calculation time, conservativeness and economic performance.

Download Full-text

Deep Reinforcement Learning Enabled Physical-Model-Free Two-Timescale Voltage Control Method for Active Distribution Systems

IEEE Transactions on Smart Grid ◽

10.1109/tsg.2021.3113085 ◽

2021 ◽

pp. 1-1

Author(s):

Di Cao ◽

Junbo Zhao ◽

Weihao Hu ◽

Nanpeng Yu ◽

Fei Ding ◽

...

Keyword(s):

Reinforcement Learning ◽

Physical Model ◽

Distribution Systems ◽

Control Method ◽

Voltage Control ◽

Model Free

Download Full-text

Model-free voltage control of active distribution system with PVs using surrogate model-based deep reinforcement learning

Applied Energy ◽

10.1016/j.apenergy.2021.117982 ◽

2022 ◽

Vol 306 ◽

pp. 117982

Author(s):

Di Cao ◽

Junbo Zhao ◽

Weihao Hu ◽

Fei Ding ◽

Nanpeng Yu ◽

...

Keyword(s):

Reinforcement Learning ◽

Distribution System ◽

Surrogate Model ◽

Voltage Control ◽

Model Based ◽

Model Free ◽

Active Distribution System

Download Full-text

Cooperative secondary voltage control of static converters in a microgrid using model-free reinforcement learning

2019 21st European Conference on Power Electronics and Applications (EPE '19 ECCE Europe) ◽

10.23919/epe.2019.8914869 ◽

2019 ◽

Author(s):

Edward Smith ◽

Duane A. Robinson ◽

Ashish Agalgaonkar

Keyword(s):

Reinforcement Learning ◽

Voltage Control ◽

Model Free ◽

Static Converters ◽

Secondary Voltage

Download Full-text

Shaping Model-Free Reinforcement-Learning with Model-Based Pseudorewards

10.32470/ccn.2018.1191-0 ◽

2018 ◽

Author(s):

Paul Krueger ◽

Thomas Griffiths

Keyword(s):

Reinforcement Learning ◽

Model Based ◽

Model Free

Download Full-text

Study on Autonomous Decentralized Voltage Control by Reinforcement Learning

IEEJ Transactions on Power and Energy ◽

10.1541/ieejpes.139.122 ◽

2019 ◽

Vol 139 (2) ◽

pp. 122-129

Author(s):

Ryuichiro Takenaka ◽

Satoshi Takayama ◽

Atsushi Ishigame

Keyword(s):

Reinforcement Learning ◽

Voltage Control

Download Full-text

Model-Based and Model-Free Social Cognition

10.31234/osf.io/ue6j2 ◽

2019 ◽

Author(s):

Leor M Hackel ◽

Jeffrey Jordan Berg ◽

Björn Lindström ◽

David Amodio

Keyword(s):

Reinforcement Learning ◽

Social Cognition ◽

Learning Strategies ◽

Memory Systems ◽

Learning Task ◽

Financial Advisors ◽

Model Based ◽

Model Free ◽

Systems Model ◽

Task Assessment

Do habits play a role in our social impressions? To investigate the contribution of habits to the formation of social attitudes, we examined the roles of model-free and model-based reinforcement learning in social interactions—computations linked in past work to habit and planning, respectively. Participants in this study learned about novel individuals in a sequential reinforcement learning paradigm, choosing financial advisors who led them to high- or low-paying stocks. Results indicated that participants relied on both model-based and model-free learning, such that each independently predicted choice during the learning task and self-reported liking in a post-task assessment. Specifically, participants liked advisors who could provide large future rewards as well as advisors who had provided them with large rewards in the past. Moreover, participants varied in their use of model-based and model-free learning strategies, and this individual difference influenced the way in which learning related to self-reported attitudes: among participants who relied more on model-free learning, model-free social learning related more to post-task attitudes. We discuss implications for attitudes, trait impressions, and social behavior, as well as the role of habits in a memory systems model of social cognition.

Download Full-text

Faculty Opinions recommendation of States versus rewards: dissociable neural prediction error signals underlying model-based and model-free reinforcement learning.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.4125957.4076054 ◽

2010 ◽

Author(s):

Susan Courtney

Keyword(s):

Reinforcement Learning ◽

Prediction Error ◽

Model Based ◽

Model Free

Download Full-text