Approximate Value Iteration in the Reinforcement Learning Context. Application to Electrical Power System Control.

In this paper we explain how to design intelligent agents able to process the information acquired from interaction with a system to learn a good control policy and show how the methodology can be applied to control some devices aimed to damp electrical power oscillations. The control problem is formalized as a discrete-time optimal control problem and the information acquired from interaction with the system is a set of samples, where each sample is composed of four elements: a state, the action taken while being in this state, the instantaneous reward observed and the successor state of the system. To process this information we consider reinforcement learning algorithms that determine an approximation of the so-called Q-function by mimicking the behavior of the value iteration algorithm. Simulations are first carried on a benchmark power system modeled with two state variables. Then we present a more complex case study on a four-machine power system where the reinforcement learning algorithm controls a Thyristor Controlled Series Capacitor (TCSC) aimed to damp power system oscillations.

Download Full-text

Approximate Value Iteration in the Reinforcement Learning Context. Application to Electrical Power System Control.

ENERGYO ◽

10.1515/energyo.0034.00204 ◽

2018 ◽

Author(s):

Damien Ernst ◽

Mevludin Glavic ◽

Pierre Geurts ◽

Louis Wehenkel

Keyword(s):

Reinforcement Learning ◽

Power System ◽

Electrical Power ◽

System Control ◽

Value Iteration ◽

Learning Context ◽

Electrical Power System ◽

Power System Control ◽

Approximate Value Iteration

Download Full-text

A Model-Based Factored Bayesian Reinforcement Learning Approach

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.513-517.1092 ◽

2014 ◽

Vol 513-517 ◽

pp. 1092-1095

Author(s):

Bo Wu ◽

Yan Peng Feng ◽

Hong Yan Zheng

Keyword(s):

Reinforcement Learning ◽

Large Scale ◽

Iteration Algorithm ◽

Value Iteration ◽

Practical Applications ◽

Model Based ◽

Online Planning ◽

Bayesian Reinforcement Learning ◽

Bayesian Inference Method ◽

Unknown Structure

Bayesian reinforcement learning has turned out to be an effective solution to the optimal tradeoff between exploration and exploitation. However, in practical applications, the learning parameters with exponential growth are the main impediment for online planning and learning. To overcome this problem, we bring factored representations, model-based learning, and Bayesian reinforcement learning together in a new approach. Firstly, we exploit a factored representation to describe the states to reduce the size of learning parameters, and adopt Bayesian inference method to learn the unknown structure and parameters simultaneously. Then, we use an online point-based value iteration algorithm to plan and learn. The experimental results show that the proposed approach is an effective way for improving the learning efficiency in large-scale state spaces.

Download Full-text

Power System Emergency Control to Improve Short-Term Voltage Stability Using Deep Reinforcement Learning Algorithm

2019 IEEE 3rd International Electrical and Energy Conference (CIEEC) ◽

10.1109/cieec47146.2019.cieec-2019640 ◽

2019 ◽

Author(s):

C. X. Jiang ◽

Zhigang Li ◽

J. H. Zheng ◽

Q. H. Wu

Keyword(s):

Reinforcement Learning ◽

Power System ◽

Voltage Stability ◽

Learning Algorithm ◽

Short Term ◽

Emergency Control ◽

Reinforcement Learning Algorithm

Download Full-text

Design of Robust Load Frequency Controller for Multi-Area Interconnected Power System Using SDO Software

International Journal of Advances in Applied Sciences ◽

10.11591/ijaas.v6.i1.pp12-22 ◽

2017 ◽

Vol 6 (1) ◽

pp. 12

Author(s):

Pasala Gopi ◽

P. Linga Reddy

Keyword(s):

Power System ◽

Control Problem ◽

Pid Controller ◽

Frequency Control ◽

Electrical Power ◽

Load Frequency Control ◽

Superior Performance ◽

Electrical Power System ◽

Interconnected Power System ◽

Load Frequency

The response of the load frequency control problem in multi-area interconnected electrical power system is much more complex with increasing size, changing structure and increasing load. This paper deals with Load Frequency Control of three area interconnected Power system incorporating Reheat, Non-reheat and Reheat turbines in all areas respectively. The response of the load frequency control problem in a multi-area interconnected power system is improved by designing PID controller using different tuning techniques and proved that the PID controller which was designed by Simulink Design Optimization (SDO) Software gives the superior performance than other controllers for step perturbations. Finally the robustness of controller was checked against system parameter variations..

Download Full-text

General value iteration based reinforcement learning for solving optimal tracking control problem of continuous–time affine nonlinear systems

Neurocomputing ◽

10.1016/j.neucom.2017.03.038 ◽

2017 ◽

Vol 245 ◽

pp. 114-123 ◽

Cited By ~ 7

Author(s):

Geyang Xiao ◽

Huaguang Zhang ◽

Yanhong Luo ◽

Qiuxia Qu

Keyword(s):

Reinforcement Learning ◽

Nonlinear Systems ◽

Control Problem ◽

Continuous Time ◽

Tracking Control ◽

Value Iteration ◽

Optimal Tracking ◽

Optimal Tracking Control ◽

Affine Nonlinear Systems

Download Full-text

Nano Topological Analysis For Power System Control

INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY ◽

10.24297/ijct.v15i14.4793 ◽

2016 ◽

Vol 15 (14) ◽

pp. 7416-7422

Author(s):

M.Kamel EL-Sayed

Keyword(s):

Power System ◽

Rough Set ◽

Topological Analysis ◽

Electrical Power ◽

Real Life ◽

System Control ◽

Suggested Model ◽

Electrical Power System ◽

Lower Approximation ◽

Power System Control

In this paper,we introduce an approach for analysis of information concerning electrical power system. The suggested method is a result of hybridizing rough set concepts with nano topology constructed on the set of all data using the boundary of uncertain decision sets and its lower approximation. Bases of nano topologies are used as indicators for selecting effective features in information system of a power control. This method is applied using the main experimental data which make the suggested model near from the real life information.

Download Full-text

A SURVEY OF THE POWER SYSTEM CONTROL PROBLEM

International Conference on Aerospace Sciences and Aviation Technology ◽

10.21608/asat.1985.26574 ◽

1985 ◽

Vol 1 (CONFERENCE) ◽

pp. 1-10

Author(s):

F. Bendary ◽

M. Drouin ◽

M. El-Metwally

Keyword(s):

Power System ◽

Control Problem ◽

System Control ◽

Power System Control

Download Full-text

Solving a Joint Pricing and Inventory Control Problem for Perishables via Deep Reinforcement Learning

Complexity ◽

10.1155/2021/6643131 ◽

2021 ◽

Vol 2021 ◽

pp. 1-17

Author(s):

Rui Wang ◽

Xianghua Gan ◽

Qing Li ◽

Xiao Yan

Keyword(s):

Reinforcement Learning ◽

Control Problem ◽

Inventory Control ◽

Learning Algorithm ◽

Planning Horizon ◽

Periodic Review ◽

Current Period ◽

Joint Pricing ◽

Inventory Control Problem ◽

Reinforcement Learning Algorithm

We study a joint pricing and inventory control problem for perishables with positive lead time in a finite horizon periodic-review system. Unlike most studies considering a continuous density function of demand, in our paper the customer demand depends on the price of current period and arrives according to a homogeneous Poisson process. We consider both backlogging and lost-sales cases, and our goal is to find a simultaneously ordering and pricing policy to maximize the expected discounted profit over the planning horizon. When there is no fixed ordering cost involved, we design a deep reinforcement learning algorithm to obtain a near-optimal ordering policy and show that there are some monotonicity properties in the learned policy. We also show that our deep reinforcement learning algorithm achieves a better performance than tabular-based Q-learning algorithms. When a fixed ordering cost is involved, we show that our deep reinforcement learning algorithm is effective and efficient, under which the problem of “curse of dimension” is circumvented.

Download Full-text

APPLICATION OF VARIABLE STRUCTURE SYSTEM CONCEPT TO POWER SYSTEM CONTROL PROBLEM

Proceedings of the Eighth Power Systems Computation Conference ◽

10.1016/b978-0-408-01468-7.50164-8 ◽

1984 ◽

pp. 1119-1123

Author(s):

Ashok Kumar ◽

O.P. Malik ◽

G.S. Hope

Keyword(s):

Power System ◽

Control Problem ◽

Variable Structure ◽

System Control ◽

Structure System ◽

Power System Control ◽

System Concept ◽

Variable Structure System

Download Full-text

Value Iteration Networks

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2017/700 ◽

2017 ◽

Cited By ~ 31

Author(s):

Aviv Tamar ◽

Yi Wu ◽

Garrett Thomas ◽

Sergey Levine ◽

Pieter Abbeel

Keyword(s):

Neural Network ◽

Reinforcement Learning ◽

Path Planning ◽

Natural Language ◽

Convolutional Neural Network ◽

Search Task ◽

Iteration Algorithm ◽

Continuous Path ◽

Value Iteration ◽

Value Iteration Algorithm

We introduce the value iteration network (VIN): a fully differentiable neural network with a `planning module' embedded within. VINs can learn to plan, and are suitable for predicting outcomes that involve planning-based reasoning, such as policies for reinforcement learning. Key to our approach is a novel differentiable approximation of the value-iteration algorithm, which can be represented as a convolutional neural network, and trained end-to-end using standard backpropagation.We evaluate VIN based policies on discrete and continuous path-planning domains, and on a natural-language based search task. We show that by learning an explicit planning computation, VIN policies generalize better to new, unseen domains.This paper is a significantly abridged and IJCAI audience targeted version of the original NIPS 2016 paper with the same title, available here: https://arxiv.org/abs/1602.02867

Download Full-text