scholarly journals Stochastic LQR optimal control with white and colored noise: Dynamic programming technique

2021 ◽  
Vol 20 (2) ◽  
pp. 1113-1129
Author(s):  
B. Escobedo-Trujillo ◽  
◽  
J. Garrido-Meléndez
Author(s):  
Ali Benmerzouga

A solution to the control of switching linear systems with input constraints was given in Benmerzouga (1997) for both the conventional enumeration approach and the new approach. The solution given there turned out to be not unique. The main objective in this work is to determine the optimal control sequences {Ui(k) ,  i = 1,..., M ;  k = 0, 1, ...,  N -1} which transfer the system from a given initial state  X0  to a specific target state  XT  (or to be as close as possible) by using the same discrete time solution obtained in Benmerzouga (1997) and minimizing a running cost-to-go function. By using the dynamic programming technique, the optimal solution is found for both approaches given in Benmerzouga (1997). The computational complexity of the modified algorithm is also given.  


Sensors ◽  
2021 ◽  
Vol 21 (16) ◽  
pp. 5609
Author(s):  
Xiaowei Xing ◽  
Dong Eui Chang

The paper develops the adaptive dynamic programming toolbox (ADPT), which is a MATLAB-based software package and computationally solves optimal control problems for continuous-time control-affine systems. The ADPT produces approximate optimal feedback controls by employing the adaptive dynamic programming technique and solving the Hamilton–Jacobi–Bellman equation approximately. A novel implementation method is derived to optimize the memory consumption by the ADPT throughout its execution. The ADPT supports two working modes: model-based mode and model-free mode. In the former mode, the ADPT computes optimal feedback controls provided the system dynamics. In the latter mode, optimal feedback controls are generated from the measurements of system trajectories, without the requirement of knowledge of the system model. Multiple setting options are provided in the ADPT, such that various customized circumstances can be accommodated. Compared to other popular software toolboxes for optimal control, the ADPT features computational precision and time efficiency, which is illustrated with its applications to a highly non-linear satellite attitude control problem.


Sign in / Sign up

Export Citation Format

Share Document