A Reinforcement-Learning Neural Network for the Control of Nonlinear Systems

AbstractIn this paper, we propose a safe reinforcement learning approach to synthesize deep neural network (DNN) controllers for nonlinear systems subject to safety constraints. The proposed approach employs an iterative scheme where a learner and a verifier interact to synthesize safe DNN controllers. The learner trains a DNN controller via deep reinforcement learning, and the verifier certifies the learned controller through computing a maximal safe initial region and its corresponding barrier certificate, based on polynomial abstraction and bilinear matrix inequalities solving. Compared with the existing verification-in-the-loop synthesis methods, our iterative framework is a sequential synthesis scheme of controllers and barrier certificates, which can learn safe controllers with adaptive barrier certificates rather than user-defined ones. We implement the tool SRLBC and evaluate its performance over a set of benchmark examples. The experimental results demonstrate that our approach efficiently synthesizes safe DNN controllers even for a nonlinear system with dimension up to 12.

Download Full-text

Neural-network-based reinforcement learning controller for nonlinear systems with non-symmetric dead-zone inputs

2009 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning ◽

10.1109/adprl.2009.4927535 ◽

2009 ◽

Cited By ~ 10

Author(s):

Xin Zhang ◽

Huaguang Zhang ◽

Derong Liu ◽

Yongsu Kim

Keyword(s):

Neural Network ◽

Reinforcement Learning ◽

Nonlinear Systems ◽

Dead Zone

Download Full-text

Reinforcement learning neural network used in control of nonlinear systems

Proceedings of IEEE International Conference on Industrial Technology 2000 (IEEE Cat. No.00TH8482) ◽

10.1109/icit.2000.854247 ◽

2000 ◽

Cited By ~ 2

Author(s):

Ovidiu Grigore ◽

Octavian Grigore

Keyword(s):

Neural Network ◽

Reinforcement Learning ◽

Nonlinear Systems

Download Full-text

Adaptive neural network optimized control using reinforcement learning of critic-actor architecture for a class of non-affine nonlinear systems

IEEE Access ◽

10.1109/access.2021.3120835 ◽

2021 ◽

pp. 1-1

Author(s):

Xue Yang ◽

Bin Li ◽

Guoxing Wen

Keyword(s):

Neural Network ◽

Reinforcement Learning ◽

Nonlinear Systems ◽

Adaptive Neural Network ◽

Affine Nonlinear Systems

Download Full-text

Neural Network Based Adaptive Control of Uncertain and Unknown Nonlinear Systems

10.21236/ada389515 ◽

2001 ◽

Author(s):

Anthony J. Calise

Keyword(s):

Neural Network ◽

Adaptive Control ◽

Nonlinear Systems

Download Full-text

Building HVAC Scheduling Using Reinforcement Learning via Neural Network Based Model Approximation

Proceedings of the 6th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation ◽

10.1145/3360322.3360861 ◽

2019 ◽

Cited By ~ 8

Author(s):

Chi Zhang ◽

Sanmukh R. Kuppannagari ◽

Rajgopal Kannan ◽

Viktor K. Prasanna

Keyword(s):

Neural Network ◽

Reinforcement Learning ◽

Model Approximation

Download Full-text

Location- and Person-Independent Activity Recognition with WiFi, Deep Neural Networks, and Reinforcement Learning

ACM Transactions on Internet of Things ◽

10.1145/3424739 ◽

2021 ◽

Vol 2 (1) ◽

pp. 1-25

Author(s):

Yongsen Ma ◽

Sheheryar Arshad ◽

Swetha Muniraju ◽

Eric Torkildson ◽

Enrico Rantala ◽

...

Keyword(s):

Neural Network ◽

Neural Networks ◽

Reinforcement Learning ◽

Activity Recognition ◽

Deep Neural Networks ◽

State Machine ◽

Recognition Algorithm ◽

The State ◽

Neural Architecture ◽

Learning Agent

In recent years, Channel State Information (CSI) measured by WiFi is widely used for human activity recognition. In this article, we propose a deep learning design for location- and person-independent activity recognition with WiFi. The proposed design consists of three Deep Neural Networks (DNNs): a 2D Convolutional Neural Network (CNN) as the recognition algorithm, a 1D CNN as the state machine, and a reinforcement learning agent for neural architecture search. The recognition algorithm learns location- and person-independent features from different perspectives of CSI data. The state machine learns temporal dependency information from history classification results. The reinforcement learning agent optimizes the neural architecture of the recognition algorithm using a Recurrent Neural Network (RNN) with Long Short-Term Memory (LSTM). The proposed design is evaluated in a lab environment with different WiFi device locations, antenna orientations, sitting/standing/walking locations/orientations, and multiple persons. The proposed design has 97% average accuracy when testing devices and persons are not seen during training. The proposed design is also evaluated by two public datasets with accuracy of 80% and 83%. The proposed design needs very little human efforts for ground truth labeling, feature engineering, signal processing, and tuning of learning parameters and hyperparameters.

Download Full-text

Robust Quadrotor Control through Reinforcement Learning with Disturbance Compensation

Applied Sciences ◽

10.3390/app11073257 ◽

2021 ◽

Vol 11 (7) ◽

pp. 3257

Author(s):

Chen-Huan Pi ◽

Wei-Yuan Ye ◽

Stone Cheng

Keyword(s):

Neural Network ◽

Reinforcement Learning ◽

Control Strategy ◽

External Disturbance ◽

Control Agent ◽

Network Control ◽

Outdoor Environment ◽

Disturbance Compensation ◽

Tracking Accuracy ◽

Control Scheme

In this paper, a novel control strategy is presented for reinforcement learning with disturbance compensation to solve the problem of quadrotor positioning under external disturbance. The proposed control scheme applies a trained neural-network-based reinforcement learning agent to control the quadrotor, and its output is directly mapped to four actuators in an end-to-end manner. The proposed control scheme constructs a disturbance observer to estimate the external forces exerted on the three axes of the quadrotor, such as wind gusts in an outdoor environment. By introducing an interference compensator into the neural network control agent, the tracking accuracy and robustness were significantly increased in indoor and outdoor experiments. The experimental results indicate that the proposed control strategy is highly robust to external disturbances. In the experiments, compensation improved control accuracy and reduced positioning error by 75%. To the best of our knowledge, this study is the first to achieve quadrotor positioning control through low-level reinforcement learning by using a global positioning system in an outdoor environment.

Download Full-text

Two-Level Lattice Neural Network Architectures for Control of Nonlinear Systems

2020 59th IEEE Conference on Decision and Control (CDC) ◽

10.1109/cdc42340.2020.9304079 ◽

2020 ◽

Author(s):

James Ferlez ◽

Xiaowu Sun ◽

Yasser Shoukry

Keyword(s):

Neural Network ◽

Nonlinear Systems ◽

Network Architectures ◽

Neural Network Architectures

Download Full-text