Online Synchronous Policy Iteration Method for Optimal Control

The policy iteration method is a classical algorithm for solving optimal control problems. In this paper, we introduce a policy iteration method for Mean Field Games systems, and we study the convergence of this procedure to a solution of the problem. We also introduce suitable discretizations to numerically solve both stationary and evolutive problems. We show the convergence of the policy iteration method for the discrete problem and we study the performance of the proposed algorithm on some examples in dimension one and two.

Download Full-text

Data-based approximate policy iteration for affine nonlinear continuous-time optimal control design

Automatica ◽

10.1016/j.automatica.2014.10.056 ◽

2014 ◽

Vol 50 (12) ◽

pp. 3281-3290 ◽

Cited By ~ 143

Author(s):

Biao Luo ◽

Huai-Ning Wu ◽

Tingwen Huang ◽

Derong Liu

Keyword(s):

Optimal Control ◽

Continuous Time ◽

Control Design ◽

Policy Iteration ◽

Time Optimal Control ◽

Time Optimal ◽

Approximate Policy Iteration

Download Full-text

Adaptive Optimal Robust Control for Uncertain Nonlinear Systems Using Neural Network Approximation in Policy Iteration

Applied Sciences ◽

10.3390/app11052312 ◽

2021 ◽

Vol 11 (5) ◽

pp. 2312

Author(s):

Dengguo Xu ◽

Qinglin Wang ◽

Yuan Li

Keyword(s):

Neural Network ◽

Optimal Control ◽

Robust Control ◽

Nonlinear Systems ◽

Policy Iteration ◽

Control Law ◽

Globally Asymptotically Stable ◽

Control Approach ◽

Input Uncertainties ◽

Theoretical Results

In this study, based on the policy iteration (PI) in reinforcement learning (RL), an optimal adaptive control approach is established to solve robust control problems of nonlinear systems with internal and input uncertainties. First, the robust control is converted into solving an optimal control containing a nominal or auxiliary system with a predefined performance index. It is demonstrated that the optimal control law enables the considered system globally asymptotically stable for all admissible uncertainties. Second, based on the Bellman optimality principle, the online PI algorithms are proposed to calculate robust controllers for the matched and the mismatched uncertain systems. The approximate structure of the robust control law is obtained by approximating the optimal cost function with neural network in PI algorithms. Finally, in order to illustrate the availability of the proposed algorithm and theoretical results, some numerical examples are provided.

Download Full-text

The RSS-like iteration method for block two-by-two linear systems from time-periodic parabolic optimal control problems

Applied Mathematics and Computation ◽

10.1016/j.amc.2021.126477 ◽

2021 ◽

Vol 410 ◽

pp. 126477

Author(s):

Min-Li Zeng

Keyword(s):

Optimal Control ◽

Linear Systems ◽

Iteration Method ◽

Optimal Control Problems ◽

Control Problems ◽

Parabolic Optimal Control ◽

Time Periodic

Download Full-text

Discrete-Time Nonlinear Generalized Policy Iteration for Optimal Control Using Neural Networks

Neural Information Processing - Lecture Notes in Computer Science ◽

10.1007/978-3-319-12637-1_49 ◽

2014 ◽

pp. 389-396

Author(s):

Qinglai Wei ◽

Derong Liu ◽

Xiong Yang

Keyword(s):

Neural Networks ◽

Optimal Control ◽

Discrete Time ◽

Policy Iteration

Download Full-text

The policy iteration method for the optimal stopping of a markov chain with an application

Lecture Notes in Computer Science - Optimization Techniques Modeling and Optimization in the Service of Man Part 2 ◽

10.1007/3-540-07623-9_277 ◽

1976 ◽

pp. 22-36

Author(s):

K. M. Hee

Keyword(s):

Markov Chain ◽

Optimal Stopping ◽

Iteration Method ◽

Policy Iteration

Download Full-text

Finding the optimal control of linear systems via He's variational iteration method

International Journal of Computer Mathematics ◽

10.1080/00207160903019480 ◽

2010 ◽

Vol 87 (5) ◽

pp. 1042-1050 ◽

Cited By ~ 29

Author(s):

S. A. Yousefi ◽

Mehdi Dehghan ◽

A. Lotfi

Keyword(s):

Optimal Control ◽

Linear Systems ◽

Iteration Method ◽

Variational Iteration Method ◽

Variational Iteration ◽

Control Of Linear Systems ◽

He’S Variational Iteration Method

Download Full-text

Motion planning of a quadrotor robot game using a simulation-based projected policy iteration method

Frontiers of Information Technology & Electronic Engineering ◽

10.1631/fitee.1800571 ◽

2019 ◽

Vol 20 (4) ◽

pp. 525-537

Author(s):

Li-dong Zhang ◽

Ban Wang ◽

Zhi-xiang Liu ◽

You-min Zhang ◽

Jian-liang Ai

Keyword(s):

Motion Planning ◽

Iteration Method ◽

Policy Iteration ◽

Simulation Based ◽

Quadrotor Robot

Download Full-text

Adaptive Optimal Control for a Class of Nonlinear Systems: The Online Policy Iteration Approach

IEEE Transactions on Neural Networks and Learning Systems ◽

10.1109/tnnls.2019.2905715 ◽

2020 ◽

Vol 31 (2) ◽

pp. 549-558 ◽

Cited By ~ 38

Author(s):

Shuping He ◽

Haiyang Fang ◽

Maoguang Zhang ◽

Fei Liu ◽

Zhengtao Ding

Keyword(s):

Optimal Control ◽

Nonlinear Systems ◽

Policy Iteration ◽

Adaptive Optimal Control

Download Full-text

Neuro-Optimal Control for Discrete Stochastic Processes via a Novel Policy Iteration Algorithm

IEEE Transactions on Systems Man and Cybernetics Systems ◽

10.1109/tsmc.2019.2907991 ◽

2020 ◽

Vol 50 (11) ◽

pp. 3972-3985 ◽

Cited By ~ 1

Author(s):

Mingming Liang ◽

Ding Wang ◽

Derong Liu

Keyword(s):

Optimal Control ◽

Stochastic Processes ◽

Policy Iteration ◽

Iteration Algorithm ◽

Policy Iteration Algorithm

Download Full-text