A multilayer recurrent neural network for real-time synthesis of linear-quadratic optimal control systems

AbstractObjectiveThe rapid acceleration of tools for recording neuronal populations and targeted optogenetic manipulation has enabled real-time, feedback control of neuronal circuits in the brain. Continuously-graded control of measured neuronal activity poses a wide range of technical challenges, which we address through a combination of optogenetic stimulation and a state-space optimal control framework implemented in the thalamocortical circuit of the awake mouse.ApproachClosed-loop optogenetic control of neurons was performed in real-time via stimulation of channelrhodopsin-2 expressed in the somatosensory thalamus of the head-fixed mouse. A state-space linear dynamical system model structure was used to approximate the light-to-spiking input-output relationship in both single-neuron as well as multi-neuron scenarios when recording from multielectrode arrays. These models were utilized to design state feedback controller gains by way of linear quadratic optimal control and were also used online for estimation of state feedback, where a parameter-adaptive Kalman filter provided robustness to model-mismatchMain resultsThis model-based control scheme proved effective for feedback control of single-neuron firing rate in the thalamus of awake animals. Notably, the graded optical actuation utilized here did not synchronize simultaneously recorded neurons, but heterogeneity across the neuronal population resulted in a varied response to stimulation. Simulated multi-output feedback control provided better control of a heterogeneous population and demonstrated how the approach generalizes beyond single-neuron applicationsSignificanceTo our knowledge, this work represents the first experimental application of state space model-based feedback control for optogenetic stimulation. In combination with linear quadratic optimal control, the approaches here should generalize to future problems involving the control of highly complex neural circuits. More generally, feedback control of neuronal circuits opens the door to adaptively interacting with the dynamics underlying sensory, motor, and cognitive signaling, enabling a deeper understanding of circuit function and ultimately the control of function in injury or disease.

Download Full-text

Matrix factorization and Chandrasekhar equations techniques in the design of linear quadratic optimal control systems

International Journal of Systems Science ◽

10.1080/00207728108963791 ◽

1981 ◽

Vol 12 (8) ◽

pp. 907-915 ◽

Cited By ~ 4

Author(s):

RAYMOND GOREZ

Keyword(s):

Optimal Control ◽

Control Systems ◽

Matrix Factorization ◽

Linear Quadratic ◽

Linear Quadratic Optimal Control ◽

Quadratic Optimal Control

Download Full-text

The linear quadratic optimal control problem for discrete-time Markov jump linear singular systems

Automatica ◽

10.1016/j.automatica.2021.109506 ◽

2021 ◽

Vol 127 ◽

pp. 109506

Author(s):

Jorge R. Chávez-Fuentes ◽

Eduardo F. Costa ◽

Marco H. Terra ◽

Kaio D.T. Rocha

Keyword(s):

Optimal Control ◽

Optimal Control Problem ◽

Control Problem ◽

Discrete Time ◽

Singular Systems ◽

Linear Quadratic ◽

Markov Jump ◽

Linear Quadratic Optimal Control ◽

Quadratic Optimal Control

Download Full-text

Convergence results for an averaged LQR problem with applications to reinforcement learning

Mathematics of Control Signals and Systems ◽

10.1007/s00498-021-00294-y ◽

2021 ◽

Author(s):

Andrea Pesare ◽

Michele Palladino ◽

Maurizio Falcone

Keyword(s):

Optimal Control ◽

Reinforcement Learning ◽

Optimal Control Problem ◽

Control Problem ◽

Current System ◽

Numerical Test ◽

Linear Quadratic ◽

Linear Quadratic Optimal Control ◽

Lqr Problem ◽

Quadratic Optimal Control

AbstractIn this paper, we will deal with a linear quadratic optimal control problem with unknown dynamics. As a modeling assumption, we will suppose that the knowledge that an agent has on the current system is represented by a probability distribution $$\pi $$ π on the space of matrices. Furthermore, we will assume that such a probability measure is opportunely updated to take into account the increased experience that the agent obtains while exploring the environment, approximating with increasing accuracy the underlying dynamics. Under these assumptions, we will show that the optimal control obtained by solving the “average” linear quadratic optimal control problem with respect to a certain $$\pi $$ π converges to the optimal control driven related to the linear quadratic optimal control problem governed by the actual, underlying dynamics. This approach is closely related to model-based reinforcement learning algorithms where prior and posterior probability distributions describing the knowledge on the uncertain system are recursively updated. In the last section, we will show a numerical test that confirms the theoretical results.

Download Full-text

Two Inverse Problems Solution by Feedback Tracking Control

Axioms ◽

10.3390/axioms10030137 ◽

2021 ◽

Vol 10 (3) ◽

pp. 137

Author(s):

Vladimir Turetsky

Keyword(s):

Optimal Control ◽

Optimal Control Problem ◽

Control Problem ◽

Feedback Linearization ◽

Linear Quadratic ◽

Linear Quadratic Optimal Control ◽

Linear Quadratic Tracking ◽

Ill Posed ◽

Quadratic Optimal Control ◽

Quadratic Tracking

Two inverse ill-posed problems are considered. The first problem is an input restoration of a linear system. The second one is a restoration of time-dependent coefficients of a linear ordinary differential equation. Both problems are reformulated as auxiliary optimal control problems with regularizing cost functional. For the coefficients restoration problem, two control models are proposed. In the first model, the control coefficients are approximated by the output and the estimates of its derivatives. This model yields an approximating linear-quadratic optimal control problem having a known explicit solution. The derivatives are also obtained as auxiliary linear-quadratic tracking controls. The second control model is accurate and leads to a bilinear-quadratic optimal control problem. The latter is tackled in two ways: by an iterative procedure and by a feedback linearization. Simulation results show that a bilinear model provides more accurate coefficients estimates.

Download Full-text

A multilayer recurrent neural network for real-time synthesis of linear-quadratic optimal control systems

Linear quadratic optimal control problem with fuzzy variables via neural network

Singular Linear-Quadratic Optimal Control for Single-Input Linear Digital Control Systems.

Linear Quadratic Optimal Control Systems II

On Control Systems Described by a Class of Linear Differential-Algebraic Equations: State Realizations and Linear Quadratic Optimal Control

Linear Quadratic Optimal Control Systems I

State-space optimal feedback control of optogenetically driven neural activity

Matrix factorization and Chandrasekhar equations techniques in the design of linear quadratic optimal control systems

The linear quadratic optimal control problem for discrete-time Markov jump linear singular systems

Convergence results for an averaged LQR problem with applications to reinforcement learning

Two Inverse Problems Solution by Feedback Tracking Control

Export Citation Format