scholarly journals Bridging Reinforcement Learning and Iterative Learning Control: Autonomous Reference Tracking for Unknown, Nonlinear Dynamics

Author(s):  
Michael Meindl ◽  
Dustin Lehmann ◽  
Thomas Seel

<div>This work addresses the problem of reference tracking in autonomously learning agents with unknown, nonlinear dynamics. Existing solutions require model information or extensive parameter tuning, and have rarely been validated in real-world experiments. We propose a learning control scheme that learns to approximate the unknown dynamics by a Gaussian Process (GP), which is used to optimize and apply a feedforward control input on each trial. Unlike existing approaches, the proposed method neither requires knowledge of the system states and their dynamics nor knowledge of an effective feedback control structure. All algorithm parameters are chosen automatically, i.e. the learning method works plug and play. The proposed method is validated in extensive simulations and real-world experiments. In contrast to most existing work, we study learning dynamics for more than one motion task as well as the robustness of performance across a large range of learning parameters. The method’s plug and play applicability is demonstrated by experiments with a balancing robot, in which the proposed method rapidly learns to track the desired output. Due to its model-agnostic and plug and play properties, the proposed method is expected to have high potential for application to a large class of reference tracking problems in systems with unknown, nonlinear dynamics.</div>

2021 ◽  
Author(s):  
Michael Meindl ◽  
Dustin Lehmann ◽  
Thomas Seel

<div>This work addresses the problem of reference tracking in autonomously learning agents with unknown, nonlinear dynamics. Existing solutions require model information or extensive parameter tuning, and have rarely been validated in real-world experiments. We propose a learning control scheme that learns to approximate the unknown dynamics by a Gaussian Process (GP), which is used to optimize and apply a feedforward control input on each trial. Unlike existing approaches, the proposed method neither requires knowledge of the system states and their dynamics nor knowledge of an effective feedback control structure. All algorithm parameters are chosen automatically, i.e. the learning method works plug and play. The proposed method is validated in extensive simulations and real-world experiments. In contrast to most existing work, we study learning dynamics for more than one motion task as well as the robustness of performance across a large range of learning parameters. The method’s plug and play applicability is demonstrated by experiments with a balancing robot, in which the proposed method rapidly learns to track the desired output. Due to its model-agnostic and plug and play properties, the proposed method is expected to have high potential for application to a large class of reference tracking problems in systems with unknown, nonlinear dynamics.</div>


2019 ◽  
Vol 41 (12) ◽  
pp. 3372-3384 ◽  
Author(s):  
Shaoxin Sun ◽  
Huaguang Zhang ◽  
Jian Han ◽  
Yuling Liang

In this paper we investigate the fault estimation problem against local unknown nonlinear dynamics, sensor and actuator faults for a class of Takagi–Sugeno (T-S) fuzzy systems. In addition, the exogenous disturbances and measurement noise are considered, which are presented in the operation of the systems and are various and independent of the systems. A novel double-level observer is designed to estimate the system states and faults. Compared with the current research results, the proposed observer has a wider range of application. By designing a fuzzy augmented system and a Kalman filter as the first-level observer, the estimations of system states, sensor faults and actuator faults can be obtained simultaneously. The second-level observer can estimate the unknown nonlinear dynamic function by establishing generalized fuzzy hyperbolic model. The robust stability of the estimation error systems is considered by H∞ performance. Finally, three simulation examples are provided to demonstrate the effectiveness of the proposed fault estimation method.


Robotica ◽  
1993 ◽  
Vol 11 (4) ◽  
pp. 291-298 ◽  
Author(s):  
Jong-Woon Lee ◽  
Hak-Sung Lee ◽  
Zeungnam Bien

SUMMARYThe Fourier series is employed to approximate the input/output (I/O) characteristics of a dynamic system and, based on the approximation, a new learning control algorithm is proposed in order to find iteratively the control input for tracking a desired trajectory. The use of the Fourier series approximation of I/O renders at least a couple of useful consequences: the frequency characteristics of the system can be used in the controller design and the reconstruction of the system states is not required. The convergence condition of the proposed algorithm is provided and the existence and uniqueness of the desired control input is discussed. The effectiveness of the proposed algorithm is illustrated by computer simulation for a robot trajectory tracking. It is shown that, by adding a feedback term in learning control algorithm, robustness and convergence speed can be improved.


2014 ◽  
Vol 24 (3) ◽  
pp. 299-319 ◽  
Author(s):  
Kamen Delchev ◽  
George Boiadjiev ◽  
Haruhisa Kawasaki ◽  
Tetsuya Mouri

Abstract This paper deals with the improvement of the stability of sampled-data (SD) feedback control for nonlinear multiple-input multiple-output time varying systems, such as robotic manipulators, by incorporating an off-line model based nonlinear iterative learning controller. The proposed scheme of nonlinear iterative learning control (NILC) with SD feedback is applicable to a large class of robots because the sampled-data feedback is required for model based feedback controllers, especially for robotic manipulators with complicated dynamics (6 or 7 DOF, or more), while the feedforward control from the off-line iterative learning controller should be assumed as a continuous one. The robustness and convergence of the proposed NILC law with SD feedback is proven, and the derived sufficient condition for convergence is the same as the condition for a NILC with a continuous feedback control input. With respect to the presented NILC algorithm applied to a virtual PUMA 560 robot, simulation results are presented in order to verify convergence and applicability of the proposed learning controller with SD feedback controller attached


Author(s):  
Ray Huffaker ◽  
Marco Bittelli ◽  
Rodolfo Rosa

Detecting causal interactions among climatic, environmental, and human forces in complex biophysical systems is essential for understanding how these systems function and how public policies can be devised that protect the flow of essential services to biological diversity, agriculture, and other core economic activities. Convergent Cross Mapping (CCM) detects causal networks in real-world systems diagnosed with deterministic, low-dimension, and nonlinear dynamics. If CCM detects correspondence between phase spaces reconstructed from observed time series variables, then the variables are determined to causally interact in the same dynamic system. CCM can give false positives by misconstruing synchronized variables as causally interactive. Extended (delayed) CCM screens for false positives among synchronized variables.


Complexity ◽  
2018 ◽  
Vol 2018 ◽  
pp. 1-8 ◽  
Author(s):  
Xuejing Lan ◽  
Zhenghao Wu ◽  
Wenbiao Xu ◽  
Guiyun Liu

This paper considers the region-based formation control for a swarm of robots with unknown nonlinear dynamics and disturbances. An adaptive neural network is designed to approximate the unknown nonlinear dynamics, and the desired formation shape is achieved by designing appropriate potential functions. Moreover, the collision avoidance, velocity consensus, and region tracking are all considered in the controller. The stability of the multirobot system has been demonstrated based on the Lyapunov theorem. Finally, three numerical simulations show the effectiveness of the proposed formation control scheme to deal with the narrow space, loss of robots, and formation merging problems.


Author(s):  
Ian L. Cassidy ◽  
Jeffrey T. Scruggs ◽  
Sam Behrens

This study addresses the formulation of feedback controllers for stochastically-excited vibratory energy harvesters. Maximizing power generation from stochastic disturbances can be accomplished using LQG control theory, with the transducer current treated as the control input. For the case where the power flow direction is unconstrained, an electronic drive capable of extracting as well as delivering power to the transducer is required to implement the optimal controller. It is demonstrated that for stochastic disturbances characterized by second-order, bandpass-filtered white noise, energy harvesters can be passively tuned such that optimal stationary power generation only requires half of the system states for feedback in the active circuit. However, there are many applications where the implementation of a bi-directional power electronic drive is infeasible, due to the higher parasitic losses they must sustain. If the electronics are designed to be capable of only single-directional power flow (i.e., where the electronics are incapable of power injection), then these parasitics can be reduced significantly, which makes single-directional converters more appropriate at smaller power scales. The constraint on the directionality of power flow imposes a constraint on the feedback laws that can be implemented with such converters. In this paper, we present a sub-optimal nonlinear control design technique for this class of problems, which exhibits an analytically computable upper bound on average power generation.


Sign in / Sign up

Export Citation Format

Share Document