Maximum Principle of Discrete Stochastic Control System Driven by Both Fractional Noise and White Noise

In this paper, we investigate the necessary optimality conditions of the discrete stochastic optimal control problems driven by both fractional noise and white noise. Here, the admissible control region is not necessarily convex. The corresponding variational inequalities are obtained by applying the classical variation method and Malliavin calculus. We also apply the stochastic maximum principle to a linear-quadratic optimal control problem to illustrate the main result.

Download Full-text

A Hida–Malliavin white noise calculus approach to optimal control

Infinite Dimensional Analysis Quantum Probability and Related Topics ◽

10.1142/s0219025718500145 ◽

2018 ◽

Vol 21 (03) ◽

pp. 1850014

Author(s):

Nacira Agram ◽

Bernt Øksendal

Keyword(s):

Optimal Control ◽

Diffusion Coefficient ◽

White Noise ◽

Backward Stochastic Differential Equation ◽

Second Order ◽

Linear Quadratic ◽

Linear Quadratic Optimal Control ◽

Alternative Approach ◽

Quadratic Optimal Control ◽

White Noise Calculus

The classical maximum principle for optimal stochastic control states that if a control [Formula: see text] is optimal, then the corresponding Hamiltonian has a maximum at [Formula: see text]. The first proofs for this result assumed that the control did not enter the diffusion coefficient. Moreover, it was assumed that there were no jumps in the system. Subsequently, it was discovered by Shige Peng (still assuming no jumps) that one could also allow the diffusion coefficient to depend on the control, provided that the corresponding adjoint backward stochastic differential equation (BSDE) for the first-order derivative was extended to include an extra BSDE for the second-order derivatives. In this paper, we present an alternative approach based on Hida–Malliavin calculus and white noise theory. This enables us to handle the general case with jumps, allowing both the diffusion coefficient and the jump coefficient to depend on the control, and we do not need the extra BSDE with second-order derivatives. The result is illustrated by an example of a constrained linear-quadratic optimal control.

Download Full-text

The linear quadratic optimal control problem for discrete-time Markov jump linear singular systems

Automatica ◽

10.1016/j.automatica.2021.109506 ◽

2021 ◽

Vol 127 ◽

pp. 109506

Author(s):

Jorge R. Chávez-Fuentes ◽

Eduardo F. Costa ◽

Marco H. Terra ◽

Kaio D.T. Rocha

Keyword(s):

Optimal Control ◽

Optimal Control Problem ◽

Control Problem ◽

Discrete Time ◽

Singular Systems ◽

Linear Quadratic ◽

Markov Jump ◽

Linear Quadratic Optimal Control ◽

Quadratic Optimal Control

Download Full-text

Convergence results for an averaged LQR problem with applications to reinforcement learning

Mathematics of Control Signals and Systems ◽

10.1007/s00498-021-00294-y ◽

2021 ◽

Author(s):

Andrea Pesare ◽

Michele Palladino ◽

Maurizio Falcone

Keyword(s):

Optimal Control ◽

Reinforcement Learning ◽

Optimal Control Problem ◽

Control Problem ◽

Current System ◽

Numerical Test ◽

Linear Quadratic ◽

Linear Quadratic Optimal Control ◽

Lqr Problem ◽

Quadratic Optimal Control

AbstractIn this paper, we will deal with a linear quadratic optimal control problem with unknown dynamics. As a modeling assumption, we will suppose that the knowledge that an agent has on the current system is represented by a probability distribution $$\pi $$ π on the space of matrices. Furthermore, we will assume that such a probability measure is opportunely updated to take into account the increased experience that the agent obtains while exploring the environment, approximating with increasing accuracy the underlying dynamics. Under these assumptions, we will show that the optimal control obtained by solving the “average” linear quadratic optimal control problem with respect to a certain $$\pi $$ π converges to the optimal control driven related to the linear quadratic optimal control problem governed by the actual, underlying dynamics. This approach is closely related to model-based reinforcement learning algorithms where prior and posterior probability distributions describing the knowledge on the uncertain system are recursively updated. In the last section, we will show a numerical test that confirms the theoretical results.

Download Full-text

Two Inverse Problems Solution by Feedback Tracking Control

Axioms ◽

10.3390/axioms10030137 ◽

2021 ◽

Vol 10 (3) ◽

pp. 137

Author(s):

Vladimir Turetsky

Keyword(s):

Optimal Control ◽

Optimal Control Problem ◽

Control Problem ◽

Feedback Linearization ◽

Linear Quadratic ◽

Linear Quadratic Optimal Control ◽

Linear Quadratic Tracking ◽

Ill Posed ◽

Quadratic Optimal Control ◽

Quadratic Tracking

Two inverse ill-posed problems are considered. The first problem is an input restoration of a linear system. The second one is a restoration of time-dependent coefficients of a linear ordinary differential equation. Both problems are reformulated as auxiliary optimal control problems with regularizing cost functional. For the coefficients restoration problem, two control models are proposed. In the first model, the control coefficients are approximated by the output and the estimates of its derivatives. This model yields an approximating linear-quadratic optimal control problem having a known explicit solution. The derivatives are also obtained as auxiliary linear-quadratic tracking controls. The second control model is accurate and leads to a bilinear-quadratic optimal control problem. The latter is tackled in two ways: by an iterative procedure and by a feedback linearization. Simulation results show that a bilinear model provides more accurate coefficients estimates.

Download Full-text