scholarly journals Reinforcement Learning of the Prediction Horizon in Model Predictive Control

2021 ◽  
Vol 54 (6) ◽  
pp. 314-320
Author(s):  
Eivind Bøhn ◽  
Sebastien Gros ◽  
Signe Moe ◽  
Tor Arne Johansen
2021 ◽  
Vol 303 ◽  
pp. 117634
Author(s):  
Glenn Ceusters ◽  
Román Cantú Rodríguez ◽  
Alberte Bouso García ◽  
Rüdiger Franke ◽  
Geert Deconinck ◽  
...  

2011 ◽  
Vol 62 (2) ◽  
pp. 99-103
Author(s):  
Vojtech Veselý

Stable Model Predictive Control Design: Sequential Approach The paper addresses the problem of output feedback stable model predictive control design with guaranteed cost. The proposed design method pursues the idea of sequential design for N prediction horizon using one-step ahead model predictive control design approach. Numerical examples are given to illustrate the effectiveness of the proposed method.


2018 ◽  
Vol 33 (10) ◽  
pp. 9064-9075 ◽  
Author(s):  
Long Cheng ◽  
Pablo Acuna ◽  
Ricardo P. Aguilera ◽  
Jiuchun Jiang ◽  
Shaoyuan Wei ◽  
...  

Author(s):  
Oleksandr V. Stepanets ◽  
Yurii I. Mariiash

Background. Model predictive control (MPC) approach is the basic feedback scheme, combined with high adaptive properties, which determines its successful use in the practice of design and operation of control systems. These advantages allow managing multidimensional objects with a complex structure, including nonlinearity, optimizing processes in real time within the constraints on controlled and managed variables, taking into account uncertainties in the task of objects and perturbations. Objective. The purpose of the paper is to design and analyse control system of carbon monoxide oxidation in the convector cavity based on MPC with linear-quadratic cost functional with constraint. Methods. The design of MPC is based on mathematical model of an object (relatively simple). At the current step, the prediction of object dynamic response on some final period of time (prediction horizon) is carried out; control optimization is performed, the purpose of which is to approximate the control variables of the prediction model to the corresponding setpoint on the predict horizon. The found optimal control is applied and measurement of an actual state of object at the end of a step is carried out. The prediction horizon is shifted one step further, and this algorithm are repeated. Results. The results of modeling the automatic control system show that the MPC approach provides maintenance of carbon dioxide content when changing oxygen consumption and overshoot caused by introduction bulk does not exceed 0.6 % that meets the technological requirements of the process. Conclusions. A fuse of the MPC and the quadratic functional given the constraints on the input signals is proposed. The problems of control degree of carbon oxidation in the convector cavity include non-stationarity, so the use of classical control methods is difficult. The MPC approach minimizes the cost function that characterizes the quality of the process. The predicted behaviour of a dynamic system will usually differ from its actual motion. The obtained quadratic functional is optimized to find the optimal control of degree of CO oxidation to CO2.


Energies ◽  
2019 ◽  
Vol 12 (21) ◽  
pp. 4158 ◽  
Author(s):  
Hancheol Cho ◽  
Giorgio Bacelli ◽  
Ryan G. Coe

This paper investigates the application of a method to find the cost function or the weight matrices to be used in model predictive control (MPC) such that the MPC has the same performance as a predesigned linear controller in state-feedback form when constraints are not active. This is potentially useful when a successful linear controller already exists and it is necessary to incorporate the constraint-handling capabilities of MPC. This is the case for a wave energy converter (WEC), where the maximum power transfer law is well-understood. In addition to solutions based on numerical optimization, a simple analytical solution is also derived for cases with a short prediction horizon. These methods are applied for the control of an empirically-based WEC model. The results show that the MPC can be successfully tuned to follow an existing linear control law and to comply with both input and state constraints, such as actuator force and actuator stroke.


Author(s):  
S Yaqubi ◽  
MR Homaeinezhad

This article details a new Model Predictive Control algorithm ensuring robust stability and control feasibility for uncertain nonlinear multi-input multi-output dynamical systems considering uncertain time-delay effects. The proposed control algorithm is based on construction of a Lyapunov–Krasovskii functional as terminal cost. Incorporation of this terminal cost into the Model Predictive Control optimization problem and calculation of the associated admissible set result in robust feasibility and robust stability of closed-loop system in presence of uncertain time-delay effects and bounded disturbance signals. The Lyapunov–Krasovskii functional term is constructed with respect to predicted sliding functions over the prediction horizon and considers the effects of dynamical variations over the prediction horizon in generation of control inputs. As dynamical variations are investigated in a sample-to-sample basis, feasible sliding regions are updated at each sample as well. Finally, based on expression of sliding functions as a combination of dynamical variations and input-based terms, required control inputs are calculated in the admissible bound by the optimization algorithm. Construction of control scheme on this basis permits straightforward calculation of robust stability and feasibility conditions for a general class of uncertain nonlinear system in finite prediction horizon whereas in the previous works, often-restrictive conditions were considered for the investigated dynamical systems. Numerical illustrations indicate precision and efficiency of control algorithm and improved stability and convergence rate for multivariable nonlinear dynamical systems considering uncertain time-delay effects. Finally, hardware-in-the-loop implementation indicates applicability of the proposed scheme in real-time control applications particularly in case appropriate compromises between optimality and calculation speed are considered.


2020 ◽  
Vol 8 (11) ◽  
pp. 845
Author(s):  
Enrico Anderlini ◽  
Salman Husain ◽  
Gordon G. Parker ◽  
Mohammad Abusara ◽  
Giles Thomas

The levellised cost of energy of wave energy converters (WECs) is not competitive with fossil fuel-powered stations yet. To improve the feasibility of wave energy, it is necessary to develop effective control strategies that maximise energy absorption in mild sea states, whilst limiting motions in high waves. Due to their model-based nature, state-of-the-art control schemes struggle to deal with model uncertainties, adapt to changes in the system dynamics with time, and provide real-time centralised control for large arrays of WECs. Here, an alternative solution is introduced to address these challenges, applying deep reinforcement learning (DRL) to the control of WECs for the first time. A DRL agent is initialised from data collected in multiple sea states under linear model predictive control in a linear simulation environment. The agent outperforms model predictive control for high wave heights and periods, but suffers close to the resonant period of the WEC. The computational cost at deployment time of DRL is also much lower by diverting the computational effort from deployment time to training. This provides confidence in the application of DRL to large arrays of WECs, enabling economies of scale. Additionally, model-free reinforcement learning can autonomously adapt to changes in the system dynamics, enabling fault-tolerant control.


Sign in / Sign up

Export Citation Format

Share Document