CertRL: formalizing convergence proofs for value and policy iteration in Coq

Proceedings of the 10th ACM SIGPLAN International Conference on Certified Programs and Proofs ◽

10.1145/3437992.3439927 ◽

2021 ◽

Author(s):

Koundinya Vajjha ◽

Avraham Shinnar ◽

Barry Trager ◽

Vasily Pestun ◽

Nathan Fulton

Keyword(s):

Policy Iteration ◽

Convergence Proofs

Download Full-text

Robust Policy Iteration for Continuous-Time Linear Quadratic Regulation

IEEE Transactions on Automatic Control ◽

10.1109/tac.2021.3085510 ◽

2021 ◽

pp. 1-1

Author(s):

Bo Pang ◽

Tao Bian ◽

Zhong-Ping Jiang

Keyword(s):

Continuous Time ◽

Policy Iteration ◽

Linear Quadratic ◽

Linear Quadratic Regulation ◽

Download Full-text

Online Speech Enhancement by Retraining of LSTM Using SURE Loss and Policy Iteration

Neural Processing Letters ◽

10.1007/s11063-021-10535-5 ◽

2021 ◽

Author(s):

Sriharsha Koundinya ◽

Abhijit Karmakar

Keyword(s):

Speech Enhancement ◽

Policy Iteration

Download Full-text

Data-based approximate policy iteration for affine nonlinear continuous-time optimal control design

10.1016/j.automatica.2014.10.056 ◽

2014 ◽

Vol 50 (12) ◽

pp. 3281-3290 ◽

Author(s):

Biao Luo ◽

Huai-Ning Wu ◽

Tingwen Huang ◽

Derong Liu

Keyword(s):

Optimal Control ◽

Continuous Time ◽

Control Design ◽

Policy Iteration ◽

Time Optimal Control ◽

Time Optimal ◽

Approximate Policy Iteration

Download Full-text

Adaptive Optimal Robust Control for Uncertain Nonlinear Systems Using Neural Network Approximation in Policy Iteration

Applied Sciences ◽

10.3390/app11052312 ◽

2021 ◽

Vol 11 (5) ◽

pp. 2312

Author(s):

Dengguo Xu ◽

Qinglin Wang ◽

Yuan Li

Keyword(s):

Neural Network ◽

Optimal Control ◽

Robust Control ◽

Nonlinear Systems ◽

Policy Iteration ◽

Control Law ◽

Globally Asymptotically Stable ◽

Control Approach ◽

Input Uncertainties ◽

Theoretical Results

In this study, based on the policy iteration (PI) in reinforcement learning (RL), an optimal adaptive control approach is established to solve robust control problems of nonlinear systems with internal and input uncertainties. First, the robust control is converted into solving an optimal control containing a nominal or auxiliary system with a predefined performance index. It is demonstrated that the optimal control law enables the considered system globally asymptotically stable for all admissible uncertainties. Second, based on the Bellman optimality principle, the online PI algorithms are proposed to calculate robust controllers for the matched and the mismatched uncertain systems. The approximate structure of the robust control law is obtained by approximating the optimal cost function with neural network in PI algorithms. Finally, in order to illustrate the availability of the proposed algorithm and theoretical results, some numerical examples are provided.

Download Full-text

Policy iteration for Hamilton–Jacobi–Bellman equations with control constraints

Computational Optimization and Applications ◽

10.1007/s10589-021-00278-3 ◽

2021 ◽

Author(s):

Sudeep Kundu ◽

Karl Kunisch

Keyword(s):

Linear Equations ◽

Hjb Equation ◽

Policy Iteration ◽

Optimal Feedback Control ◽

Iteration Step ◽

Control Constraints ◽

Hjb Equations ◽

Bellman Equations ◽

Hamilton Jacobi Bellman ◽

Feedback Control Theory

AbstractPolicy iteration is a widely used technique to solve the Hamilton Jacobi Bellman (HJB) equation, which arises from nonlinear optimal feedback control theory. Its convergence analysis has attracted much attention in the unconstrained case. Here we analyze the case with control constraints both for the HJB equations which arise in deterministic and in stochastic control cases. The linear equations in each iteration step are solved by an implicit upwind scheme. Numerical examples are conducted to solve the HJB equation with control constraints and comparisons are shown with the unconstrained cases.

Download Full-text

Combined Fixed Point and Policy Iteration for Hamilton--Jacobi--Bellman Equations in Finance

SIAM Journal on Numerical Analysis ◽

10.1137/100812641 ◽

2012 ◽

Vol 50 (4) ◽

pp. 1861-1882 ◽

Author(s):

Y. Huang ◽

P. A. Forsyth ◽

G. Labahn

Keyword(s):

Fixed Point ◽

Policy Iteration ◽

Bellman Equations ◽

Hamilton Jacobi Bellman

Download Full-text

Policy iteration for optimal switching with continuous-time dynamics

2016 International Joint Conference on Neural Networks (IJCNN) ◽

10.1109/ijcnn.2016.7727653 ◽

2016 ◽

Author(s):

Tohid Sardarmehni ◽

Ali Heydari

Keyword(s):

Continuous Time ◽

Policy Iteration ◽

Optimal Switching ◽

Download Full-text

Aggregation of Perturbation Realization Factors and Service Rate-Based Policy Iteration for Queueing Systems

Proceedings of the 45th IEEE Conference on Decision and Control ◽

10.1109/cdc.2006.377674 ◽

2006 ◽

Author(s):

Li Xia ◽

Xi-Ren Cao

Keyword(s):

Queueing Systems ◽

Policy Iteration ◽

Service Rate ◽

Realization Factors ◽

Perturbation Realization

Download Full-text

A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications

Journal of Control Theory and Applications ◽

10.1007/s11768-011-0313-y ◽

2011 ◽

Vol 9 (3) ◽

pp. 336-352 ◽

Author(s):

Warren B. Powell ◽

Jun Ma

Keyword(s):

Function Approximation ◽

Value Function ◽

Policy Iteration ◽

Stochastic Algorithms ◽

Value Function Approximation ◽

Approximate Policy Iteration

Download Full-text

Finite horizon optimal tracking control of partially unknown linear continuous-time systems using policy iteration

IET Control Theory and Applications ◽

10.1049/iet-cta.2014.1325 ◽

2015 ◽

Vol 9 (12) ◽

pp. 1791-1801 ◽

Author(s):

Chao Li ◽

Derong Liu ◽

Hongliang Li

Keyword(s):

Continuous Time ◽

Tracking Control ◽

Policy Iteration ◽

Finite Horizon ◽

Optimal Tracking ◽

Optimal Tracking Control ◽

Continuous Time Systems ◽

Download Full-text