Model-Based Safe Reinforcement Learning with Time-Varying State and Control Constraints: An Application to Intelligent Vehicles

10.36227/techrxiv.17205740 ◽

2021 ◽

Author(s):

Xinglong Zhang ◽

Yaoqian Peng ◽

Biao Luo ◽

Wei Pan ◽

Xin Xu ◽

...

Keyword(s):

Optimal Control ◽

Reinforcement Learning ◽

Control Policy ◽

Intelligent Vehicles ◽

Time Varying ◽

Control Constraints ◽

Model Based ◽

Safety Constraints ◽

And Control ◽

State And Control Constraints

<div>Recently, barrier function-based safe reinforcement learning (RL) with the actor-critic structure for continuous control tasks has received increasing attention. It is still challenging to learn a near-optimal control policy with safety and convergence guarantees. Also, few works have addressed the safe RL algorithm design under time-varying safety constraints. This paper proposes a model-based safe RL algorithm for optimal control of nonlinear systems with time-varying state and control constraints. In the proposed approach, we construct a novel barrier-based control policy structure that can guarantee control safety. A multi-step policy evaluation mechanism is proposed to predict the policy's safety risk under time-varying safety constraints and guide the policy to update safely. Theoretical results on stability and robustness are proven. Also, the convergence of the actor-critic learning algorithm is analyzed. The performance of the proposed algorithm outperforms several state-of-the-art RL algorithms in the simulated Safety Gym environment. Furthermore, the approach is applied to the integrated path following and collision avoidance problem for two real-world intelligent vehicles. A differential-drive vehicle and an Ackermann-drive one are used to verify the offline deployment performance and the online learning performance, respectively. Our approach shows an impressive sim-to-real transfer capability and a satisfactory online control performance in the experiment.</div>

Download Full-text

Model-Based Safe Reinforcement Learning with Time-Varying State and Control Constraints: An Application to Intelligent Vehicles

10.36227/techrxiv.17205740.v1 ◽

2021 ◽

Author(s):

Xinglong Zhang ◽

Yaoqian Peng ◽

Biao Luo ◽

Wei Pan ◽

Xin Xu ◽

...

Keyword(s):

Optimal Control ◽

Reinforcement Learning ◽

Control Policy ◽

Intelligent Vehicles ◽

Time Varying ◽

Control Constraints ◽

Model Based ◽

Safety Constraints ◽

And Control ◽

State And Control Constraints

<div>Recently, barrier function-based safe reinforcement learning (RL) with the actor-critic structure for continuous control tasks has received increasing attention. It is still challenging to learn a near-optimal control policy with safety and convergence guarantees. Also, few works have addressed the safe RL algorithm design under time-varying safety constraints. This paper proposes a model-based safe RL algorithm for optimal control of nonlinear systems with time-varying state and control constraints. In the proposed approach, we construct a novel barrier-based control policy structure that can guarantee control safety. A multi-step policy evaluation mechanism is proposed to predict the policy's safety risk under time-varying safety constraints and guide the policy to update safely. Theoretical results on stability and robustness are proven. Also, the convergence of the actor-critic learning algorithm is analyzed. The performance of the proposed algorithm outperforms several state-of-the-art RL algorithms in the simulated Safety Gym environment. Furthermore, the approach is applied to the integrated path following and collision avoidance problem for two real-world intelligent vehicles. A differential-drive vehicle and an Ackermann-drive one are used to verify the offline deployment performance and the online learning performance, respectively. Our approach shows an impressive sim-to-real transfer capability and a satisfactory online control performance in the experiment.</div>

Download Full-text

Optimal Control Problems with State and Control Constraints

Applied and Computational Optimal Control - Springer Optimization and Its Applications ◽

10.1007/978-3-030-69913-0_9 ◽

2021 ◽

pp. 315-369

Author(s):

Kok Lay Teo ◽

Bin Li ◽

Changjun Yu ◽

Volker Rehbock

Keyword(s):

Optimal Control ◽

Optimal Control Problems ◽

Control Problems ◽

Control Constraints ◽

And Control ◽

State And Control Constraints

Download Full-text

Time-varying, saturating feedback control of linear systems under state and control constraints

10.1109/cdc.1990.203513 ◽

1990 ◽

Cited By ~ 4

Author(s):

G.A. Pajunen ◽

N. Erdol

Keyword(s):

Feedback Control ◽

Linear Systems ◽

Time Varying ◽

Control Constraints ◽

Control Of Linear Systems ◽

And Control ◽

State And Control Constraints

Download Full-text

A-posteriori error estimates for optimal control problems with state and control constraints

Numerische Mathematik ◽

10.1007/s00211-011-0422-z ◽

2011 ◽

Vol 120 (4) ◽

pp. 733-762 ◽

Cited By ~ 25

Author(s):

Arnd Rösch ◽

Daniel Wachsmuth

Keyword(s):

Optimal Control ◽

Optimal Control Problems ◽

A Posteriori Error Estimates ◽

Posteriori Error ◽

Control Problems ◽

Control Constraints ◽

A Posteriori ◽

A Posteriori Error ◽

And Control ◽

State And Control Constraints

Download Full-text

Sufficient conditions for optimal control with state and control constraints

Journal of Optimization Theory and Applications ◽

10.1007/bf00945421 ◽

1971 ◽

Vol 7 (2) ◽

pp. 118-135 ◽

Cited By ~ 32

Author(s):

Harold Stalford

Keyword(s):

Optimal Control ◽

Sufficient Conditions ◽

Control Constraints ◽

And Control ◽

State And Control Constraints

Download Full-text

A Feasible Directions Algorithm for Optimal Control Problems with State and Control Constraints: Convergence Analysis

SIAM Journal on Control and Optimization ◽

10.1137/s0363012996297649 ◽

1998 ◽

Vol 36 (6) ◽

pp. 1999-2019 ◽

Cited By ~ 24

Author(s):

R. Pytlak ◽

R. B. Vinter

Keyword(s):

Optimal Control ◽

Convergence Analysis ◽

Optimal Control Problems ◽

Control Problems ◽

Control Constraints ◽

Feasible Directions ◽

And Control ◽

State And Control Constraints

Download Full-text

Decentralized Reinforcement Learning Robust Optimal Tracking Control for Time Varying Constrained Reconfigurable Modular Robot Based on ACI andQ-Function

Mathematical Problems in Engineering ◽

10.1155/2013/387817 ◽

2013 ◽

Vol 2013 ◽

pp. 1-16 ◽

Cited By ~ 9

Author(s):

Bo Dong ◽

Yuanchun Li

Keyword(s):

Optimal Control ◽

Reinforcement Learning ◽

Control Theory ◽

Continuous Time ◽

Tracking Control ◽

Control Policy ◽

Time Varying ◽

Optimal Tracking ◽

Tracking Controller ◽

Global Uncertainty

A novel decentralized reinforcement learning robust optimal tracking control theory for time varying constrained reconfigurable modular robots based on action-critic-identifier (ACI) and state-action value function (Q-function) has been presented to solve the problem of the continuous time nonlinear optimal control policy for strongly coupled uncertainty robotic system. The dynamics of time varying constrained reconfigurable modular robot is described as a synthesis of interconnected subsystem, and continuous time state equation andQ-function have been designed in this paper. Combining with ACI and RBF network, the global uncertainty of the subsystem and the HJB (Hamilton-Jacobi-Bellman) equation have been estimated, where critic-NN and action-NN are used to approximate the optimalQ-function and the optimal control policy, and the identifier is adopted to identify the global uncertainty as well as RBF-NN which is used to update the weights of ACI-NN. On this basis, a novel decentralized robust optimal tracking controller of the subsystem is proposed, so that the subsystem can track the desired trajectory and the tracking error can converge to zero in a finite time. The stability of ACI and the robust optimal tracking controller are confirmed by Lyapunov theory. Finally, comparative simulation examples are presented to illustrate the effectiveness of the proposed ACI and decentralized control theory.

Download Full-text

Feasible Direction Algorithm for Optimal Control Problems with State and Control Constraints: Implementation

Journal of Optimization Theory and Applications ◽

10.1023/a:1021742204850 ◽

1999 ◽

Vol 101 (3) ◽

pp. 623-649 ◽

Cited By ~ 15

Author(s):

R. Pytlak ◽

R. B. Vinter

Keyword(s):

Optimal Control ◽

Optimal Control Problems ◽

Control Problems ◽

Control Constraints ◽

Feasible Direction ◽

Feasible Direction Algorithm ◽

And Control ◽

State And Control Constraints

Download Full-text

A morley finite element method for an elliptic distributed optimal control problem with pointwise state and control constraints

ESAIM Control Optimisation and Calculus of Variations ◽

10.1051/cocv/2017031 ◽

2018 ◽

Vol 24 (3) ◽

pp. 1181-1206 ◽

Cited By ~ 1

Author(s):

Susanne C. Brenner ◽

Thirupathi Gudi ◽

Kamana Porwal ◽

Li-yeng Sung

Keyword(s):

Finite Element Method ◽

Optimal Control ◽

Finite Element ◽

Optimal Control Problem ◽

Control Problem ◽

Control Constraints ◽

Distributed Optimal Control ◽

And Control ◽

State And Control Constraints ◽

Element Method

We design and analyze a Morley finite element method for an elliptic distributed optimal control problem with pointwise state and control constraints on convex polygonal domains. It is based on the formulation of the optimal control problem as a fourth order variational inequality. Numerical results that illustrate the performance of the method are also presented.

Download Full-text