Bridging Reinforcement Learning and Iterative Learning Control: Autonomous Reference Tracking for Unknown, Nonlinear Dynamics

10.36227/techrxiv.15022497 ◽

2021 ◽

Author(s):

Michael Meindl ◽

Dustin Lehmann ◽

Thomas Seel

Keyword(s):

Nonlinear Dynamics ◽

Real World ◽

Feedforward Control ◽

Learning Control ◽

Parameter Tuning ◽

Control Input ◽

Plug And Play ◽

Reference Tracking ◽

System States ◽

Unknown Nonlinear Dynamics

<div>This work addresses the problem of reference tracking in autonomously learning agents with unknown, nonlinear dynamics. Existing solutions require model information or extensive parameter tuning, and have rarely been validated in real-world experiments. We propose a learning control scheme that learns to approximate the unknown dynamics by a Gaussian Process (GP), which is used to optimize and apply a feedforward control input on each trial. Unlike existing approaches, the proposed method neither requires knowledge of the system states and their dynamics nor knowledge of an effective feedback control structure. All algorithm parameters are chosen automatically, i.e. the learning method works plug and play. The proposed method is validated in extensive simulations and real-world experiments. In contrast to most existing work, we study learning dynamics for more than one motion task as well as the robustness of performance across a large range of learning parameters. The method’s plug and play applicability is demonstrated by experiments with a balancing robot, in which the proposed method rapidly learns to track the desired output. Due to its model-agnostic and plug and play properties, the proposed method is expected to have high potential for application to a large class of reference tracking problems in systems with unknown, nonlinear dynamics.</div>

Download Full-text

Repetitive Learning Control for Class of Unidirectional Control Input Systems with Unknown Nonlinear Dynamics

Journal of Mechanical Engineering ◽

10.3901/jme.2017.22.109 ◽

2017 ◽

Vol 53 (22) ◽

pp. 109

Author(s):

Yougang SUN

Keyword(s):

Nonlinear Dynamics ◽

Learning Control ◽

Control Input ◽

Repetitive Learning ◽

Unknown Nonlinear Dynamics

Download Full-text

A novel double-level observer-based fault estimation for Takagi–Sugeno fuzzy systems with unknown nonlinear dynamics

Transactions of the Institute of Measurement and Control ◽

10.1177/0142331219826655 ◽

2019 ◽

Vol 41 (12) ◽

pp. 3372-3384 ◽

Cited By ~ 2

Author(s):

Shaoxin Sun ◽

Huaguang Zhang ◽

Jian Han ◽

Yuling Liang

Keyword(s):

Nonlinear Dynamics ◽

Fuzzy Systems ◽

Estimation Error ◽

Estimation Method ◽

Fault Estimation ◽

Hyperbolic Model ◽

Actuator Faults ◽

System States ◽

Takagi Sugeno ◽

Unknown Nonlinear Dynamics

In this paper we investigate the fault estimation problem against local unknown nonlinear dynamics, sensor and actuator faults for a class of Takagi–Sugeno (T-S) fuzzy systems. In addition, the exogenous disturbances and measurement noise are considered, which are presented in the operation of the systems and are various and independent of the systems. A novel double-level observer is designed to estimate the system states and faults. Compared with the current research results, the proposed observer has a wider range of application. By designing a fuzzy augmented system and a Kalman filter as the first-level observer, the estimations of system states, sensor faults and actuator faults can be obtained simultaneously. The second-level observer can estimate the unknown nonlinear dynamic function by establishing generalized fuzzy hyperbolic model. The robust stability of the estimation error systems is considered by H∞ performance. Finally, three simulation examples are provided to demonstrate the effectiveness of the proposed fault estimation method.

Download Full-text

Iterative learning control with feedback using Fourier series with application to robot trajectory tracking

Robotica ◽

10.1017/s0263574700016544 ◽

1993 ◽

Vol 11 (4) ◽

pp. 291-298 ◽

Cited By ~ 18

Author(s):

Jong-Woon Lee ◽

Hak-Sung Lee ◽

Zeungnam Bien

Keyword(s):

Fourier Series ◽

Trajectory Tracking ◽

Control Algorithm ◽

Controller Design ◽

Learning Control ◽

Convergence Condition ◽

Control Input ◽

Robot Trajectory ◽

Fourier Series Approximation ◽

System States

SUMMARYThe Fourier series is employed to approximate the input/output (I/O) characteristics of a dynamic system and, based on the approximation, a new learning control algorithm is proposed in order to find iteratively the control input for tracking a desired trajectory. The use of the Fourier series approximation of I/O renders at least a couple of useful consequences: the frequency characteristics of the system can be used in the controller design and the reconstruction of the system states is not required. The convergence condition of the proposed algorithm is provided and the existence and uniqueness of the desired control input is discussed. The effectiveness of the proposed algorithm is illustrated by computer simulation for a robot trajectory tracking. It is shown that, by adding a feedback term in learning control algorithm, robustness and convergence speed can be improved.

Download Full-text

Iterative learning control with sampled-data feedback for robot manipulators

Archives of Control Sciences ◽

10.2478/acsc-2014-0018 ◽

2014 ◽

Vol 24 (3) ◽

pp. 299-319 ◽

Cited By ~ 4

Author(s):

Kamen Delchev ◽

George Boiadjiev ◽

Haruhisa Kawasaki ◽

Tetsuya Mouri

Keyword(s):

Feedback Control ◽

Iterative Learning Control ◽

Feedforward Control ◽

Robotic Manipulators ◽

Learning Control ◽

Iterative Learning ◽

Control Input ◽

Sampled Data ◽

Model Based ◽

Data Feedback

Abstract This paper deals with the improvement of the stability of sampled-data (SD) feedback control for nonlinear multiple-input multiple-output time varying systems, such as robotic manipulators, by incorporating an off-line model based nonlinear iterative learning controller. The proposed scheme of nonlinear iterative learning control (NILC) with SD feedback is applicable to a large class of robots because the sampled-data feedback is required for model based feedback controllers, especially for robotic manipulators with complicated dynamics (6 or 7 DOF, or more), while the feedforward control from the off-line iterative learning controller should be assumed as a continuous one. The robustness and convergence of the proposed NILC law with SD feedback is proven, and the derived sufficient condition for convergence is the same as the condition for a NILC with a continuous feedback control input. With respect to the presented NILC algorithm applied to a virtual PUMA 560 robot, simulation results are presented in order to verify convergence and applicability of the proposed learning controller with SD feedback controller attached

Download Full-text

Phenomenological Modelling

10.1093/oso/9780198782933.003.0009 ◽

2018 ◽

Author(s):

Ray Huffaker ◽

Marco Bittelli ◽

Rodolfo Rosa

Keyword(s):

Nonlinear Dynamics ◽

Time Series ◽

Real World ◽

Biological Diversity ◽

False Positives ◽

World Systems ◽

Economic Activities ◽

Essential Services ◽

Low Dimension ◽

Convergent Cross Mapping

Detecting causal interactions among climatic, environmental, and human forces in complex biophysical systems is essential for understanding how these systems function and how public policies can be devised that protect the flow of essential services to biological diversity, agriculture, and other core economic activities. Convergent Cross Mapping (CCM) detects causal networks in real-world systems diagnosed with deterministic, low-dimension, and nonlinear dynamics. If CCM detects correspondence between phase spaces reconstructed from observed time series variables, then the variables are determined to causally interact in the same dynamic system. CCM can give false positives by misconstruing synchronized variables as causally interactive. Extended (delayed) CCM screens for false positives among synchronized variables.

Download Full-text

Fuzzy Adaptive Swarm Control for the High-Order Self-organized System with Unknown Nonlinear Dynamics and Unmeasured States

International Journal of Fuzzy Systems ◽

10.1007/s40815-021-01142-6 ◽

2021 ◽

Author(s):

Kang Chen ◽

Tao Xu ◽

Hao Xu ◽

Long Chen

Keyword(s):

Nonlinear Dynamics ◽

High Order ◽

Self Organized ◽

Unknown Nonlinear Dynamics ◽

Fuzzy Adaptive

Download Full-text

Feedforward control design for shaking table by Data driven control considering control input limitation

IFAC-PapersOnLine ◽

10.1016/j.ifacol.2020.12.2019 ◽

2020 ◽

Vol 53 (2) ◽

pp. 9011-9016

Author(s):

Shinji Ishihara ◽

Koichi Tahara ◽

Koji Hironaka

Keyword(s):

Feedforward Control ◽

Control Design ◽

Shaking Table ◽

Data Driven ◽

Control Input

Download Full-text

Adaptive-Neural-Network-Based Shape Control for a Swarm of Robots

Complexity ◽

10.1155/2018/8382702 ◽

2018 ◽

Vol 2018 ◽

pp. 1-8 ◽

Cited By ~ 4

Author(s):

Xuejing Lan ◽

Zhenghao Wu ◽

Wenbiao Xu ◽

Guiyun Liu

Keyword(s):

Neural Network ◽

Nonlinear Dynamics ◽

Shape Control ◽

Formation Control ◽

Adaptive Neural Network ◽

Multirobot System ◽

Control Scheme ◽

Narrow Space ◽

The Stability ◽

Unknown Nonlinear Dynamics

This paper considers the region-based formation control for a swarm of robots with unknown nonlinear dynamics and disturbances. An adaptive neural network is designed to approximate the unknown nonlinear dynamics, and the desired formation shape is achieved by designing appropriate potential functions. Moreover, the collision avoidance, velocity consensus, and region tracking are all considered in the controller. The stability of the multirobot system has been demonstrated based on the Lyapunov theorem. Finally, three numerical simulations show the effectiveness of the proposed formation control scheme to deal with the narrow space, loss of robots, and formation merging problems.

Download Full-text

Nonlinear Stochastic Feedback Controllers for Vibratory Energy Harvesters With Power Flow Constraints

ASME 2011 Conference on Smart Materials, Adaptive Structures and Intelligent Systems, Volume 2 ◽

10.1115/smasis2011-5008 ◽

2011 ◽

Author(s):

Ian L. Cassidy ◽

Jeffrey T. Scruggs ◽

Sam Behrens

Keyword(s):

Power Generation ◽

Power Flow ◽

Flow Direction ◽

Average Power ◽

Control Input ◽

Stochastic Disturbances ◽

Feedback Controllers ◽

Energy Harvesters ◽

System States ◽

Feedback Laws

This study addresses the formulation of feedback controllers for stochastically-excited vibratory energy harvesters. Maximizing power generation from stochastic disturbances can be accomplished using LQG control theory, with the transducer current treated as the control input. For the case where the power flow direction is unconstrained, an electronic drive capable of extracting as well as delivering power to the transducer is required to implement the optimal controller. It is demonstrated that for stochastic disturbances characterized by second-order, bandpass-filtered white noise, energy harvesters can be passively tuned such that optimal stationary power generation only requires half of the system states for feedback in the active circuit. However, there are many applications where the implementation of a bi-directional power electronic drive is infeasible, due to the higher parasitic losses they must sustain. If the electronics are designed to be capable of only single-directional power flow (i.e., where the electronics are incapable of power injection), then these parasitics can be reduced significantly, which makes single-directional converters more appropriate at smaller power scales. The constraint on the directionality of power flow imposes a constraint on the feedback laws that can be implemented with such converters. In this paper, we present a sub-optimal nonlinear control design technique for this class of problems, which exhibits an analytically computable upper bound on average power generation.

Download Full-text