Reinforcement Learning of the Prediction Horizon in Model Predictive Control

Stable Model Predictive Control Design: Sequential Approach The paper addresses the problem of output feedback stable model predictive control design with guaranteed cost. The proposed design method pursues the idea of sequential design for N prediction horizon using one-step ahead model predictive control design approach. Numerical examples are given to illustrate the effectiveness of the proposed method.

Download Full-text

Model Predictive Control for DC–DC Boost Converters With Reduced-Prediction Horizon and Constant Switching Frequency

IEEE Transactions on Power Electronics ◽

10.1109/tpel.2017.2785255 ◽

2018 ◽

Vol 33 (10) ◽

pp. 9064-9075 ◽

Cited By ~ 26

Author(s):

Long Cheng ◽

Pablo Acuna ◽

Ricardo P. Aguilera ◽

Jiuchun Jiang ◽

Shaoyuan Wei ◽

...

Keyword(s):

Model Predictive Control ◽

Predictive Control ◽

Switching Frequency ◽

Boost Converters ◽

Prediction Horizon

Download Full-text

MODEL PREDICTIVE CONTROL TOOLBOX DESIGN FOR NONSTATIONARY PROCESS

KPI Science News ◽

10.20535/kpisn.2021.1.217992 ◽

2021 ◽

Author(s):

Oleksandr V. Stepanets ◽

Yurii I. Mariiash

Keyword(s):

Optimal Control ◽

Control System ◽

Model Predictive Control ◽

Predictive Control ◽

Complex Structure ◽

Nonstationary Process ◽

Actual State ◽

Quadratic Functional ◽

Linear Quadratic ◽

Prediction Horizon

Background. Model predictive control (MPC) approach is the basic feedback scheme, combined with high adaptive properties, which determines its successful use in the practice of design and operation of control systems. These advantages allow managing multidimensional objects with a complex structure, including nonlinearity, optimizing processes in real time within the constraints on controlled and managed variables, taking into account uncertainties in the task of objects and perturbations. Objective. The purpose of the paper is to design and analyse control system of carbon monoxide oxidation in the convector cavity based on MPC with linear-quadratic cost functional with constraint. Methods. The design of MPC is based on mathematical model of an object (relatively simple). At the current step, the prediction of object dynamic response on some final period of time (prediction horizon) is carried out; control optimization is performed, the purpose of which is to approximate the control variables of the prediction model to the corresponding setpoint on the predict horizon. The found optimal control is applied and measurement of an actual state of object at the end of a step is carried out. The prediction horizon is shifted one step further, and this algorithm are repeated. Results. The results of modeling the automatic control system show that the MPC approach provides maintenance of carbon dioxide content when changing oxygen consumption and overshoot caused by introduction bulk does not exceed 0.6 % that meets the technological requirements of the process. Conclusions. A fuse of the MPC and the quadratic functional given the constraints on the input signals is proposed. The problems of control degree of carbon oxidation in the convector cavity include non-stationarity, so the use of classical control methods is difficult. The MPC approach minimizes the cost function that characterizes the quality of the process. The predicted behaviour of a dynamic system will usually differ from its actual motion. The obtained quadratic functional is optimized to find the optimal control of degree of CO oxidation to CO2.

Download Full-text

Reinforcement Learning for Weight Tuning of Model Predictive Control Matrices

10.14264/35b22f2 ◽

2020 ◽

Author(s):

◽

Steven Klotz

Keyword(s):

Reinforcement Learning ◽

Model Predictive Control ◽

Predictive Control

Download Full-text

Model Predictive Control Tuning by Inverse Matching for a Wave Energy Converter

Energies ◽

10.3390/en12214158 ◽

2019 ◽

Vol 12 (21) ◽

pp. 4158 ◽

Cited By ~ 2

Author(s):

Hancheol Cho ◽

Giorgio Bacelli ◽

Ryan G. Coe

Keyword(s):

Model Predictive Control ◽

Predictive Control ◽

Wave Energy ◽

Wave Energy Converter ◽

Energy Converter ◽

Feedback Form ◽

Linear Controller ◽

Prediction Horizon ◽

The Cost ◽

Input And State Constraints

This paper investigates the application of a method to find the cost function or the weight matrices to be used in model predictive control (MPC) such that the MPC has the same performance as a predesigned linear controller in state-feedback form when constraints are not active. This is potentially useful when a successful linear controller already exists and it is necessary to incorporate the constraint-handling capabilities of MPC. This is the case for a wave energy converter (WEC), where the maximum power transfer law is well-understood. In addition to solutions based on numerical optimization, a simple analytical solution is also derived for cases with a short prediction horizon. These methods are applied for the control of an empirically-based WEC model. The results show that the MPC can be successfully tuned to follow an existing linear control law and to comply with both input and state constraints, such as actuator force and actuator stroke.

Download Full-text

Optimally designed Lyapunov–Krasovskii terminal costs for robust stable–feasible model predictive control of uncertain time-delay nonlinear dynamical systems

Proceedings of the Institution of Mechanical Engineers Part I Journal of Systems and Control Engineering ◽

10.1177/0959651820952193 ◽

2020 ◽

pp. 095965182095219

Author(s):

S Yaqubi ◽

MR Homaeinezhad

Keyword(s):

Time Delay ◽

Dynamical Systems ◽

Model Predictive Control ◽

Predictive Control ◽

Robust Stability ◽

Control Algorithm ◽

Nonlinear Dynamical Systems ◽

Prediction Horizon ◽

Delay Effects ◽

Uncertain Time

This article details a new Model Predictive Control algorithm ensuring robust stability and control feasibility for uncertain nonlinear multi-input multi-output dynamical systems considering uncertain time-delay effects. The proposed control algorithm is based on construction of a Lyapunov–Krasovskii functional as terminal cost. Incorporation of this terminal cost into the Model Predictive Control optimization problem and calculation of the associated admissible set result in robust feasibility and robust stability of closed-loop system in presence of uncertain time-delay effects and bounded disturbance signals. The Lyapunov–Krasovskii functional term is constructed with respect to predicted sliding functions over the prediction horizon and considers the effects of dynamical variations over the prediction horizon in generation of control inputs. As dynamical variations are investigated in a sample-to-sample basis, feasible sliding regions are updated at each sample as well. Finally, based on expression of sliding functions as a combination of dynamical variations and input-based terms, required control inputs are calculated in the admissible bound by the optimization algorithm. Construction of control scheme on this basis permits straightforward calculation of robust stability and feasibility conditions for a general class of uncertain nonlinear system in finite prediction horizon whereas in the previous works, often-restrictive conditions were considered for the investigated dynamical systems. Numerical illustrations indicate precision and efficiency of control algorithm and improved stability and convergence rate for multivariable nonlinear dynamical systems considering uncertain time-delay effects. Finally, hardware-in-the-loop implementation indicates applicability of the proposed scheme in real-time control applications particularly in case appropriate compromises between optimality and calculation speed are considered.

Download Full-text

Towards Real-Time Reinforcement Learning Control of a Wave Energy Converter

Journal of Marine Science and Engineering ◽

10.3390/jmse8110845 ◽

2020 ◽

Vol 8 (11) ◽

pp. 845

Author(s):

Enrico Anderlini ◽

Salman Husain ◽

Gordon G. Parker ◽

Mohammad Abusara ◽

Giles Thomas

Keyword(s):

Reinforcement Learning ◽

Model Predictive Control ◽

System Dynamics ◽

Real Time ◽

Predictive Control ◽

Wave Energy ◽

Economies Of Scale ◽

Computational Cost ◽

Effective Control ◽

Deployment Time

The levellised cost of energy of wave energy converters (WECs) is not competitive with fossil fuel-powered stations yet. To improve the feasibility of wave energy, it is necessary to develop effective control strategies that maximise energy absorption in mild sea states, whilst limiting motions in high waves. Due to their model-based nature, state-of-the-art control schemes struggle to deal with model uncertainties, adapt to changes in the system dynamics with time, and provide real-time centralised control for large arrays of WECs. Here, an alternative solution is introduced to address these challenges, applying deep reinforcement learning (DRL) to the control of WECs for the first time. A DRL agent is initialised from data collected in multiple sea states under linear model predictive control in a linear simulation environment. The agent outperforms model predictive control for high wave heights and periods, but suffers close to the resonant period of the WEC. The computational cost at deployment time of DRL is also much lower by diverting the computational effort from deployment time to training. This provides confidence in the application of DRL to large arrays of WECs, enabling economies of scale. Additionally, model-free reinforcement learning can autonomously adapt to changes in the system dynamics, enabling fault-tolerant control.

Download Full-text