Acceleration-based Quadrotor Guidance Under Time Delays Using Deep Reinforcement Learning

AIAA Scitech 2021 Forum ◽

10.2514/6.2021-1751 ◽

2021 ◽

Author(s):

Kirk Hovell ◽

Steve Ulrich ◽

Murat Bronz

Keyword(s):

Reinforcement Learning ◽

Download Full-text

A Reinforcement Learning Based Model-Free Wide-Area Damping Control under Random PMU Time Delays

10.1109/isie45552.2021.9576319 ◽

2021 ◽

Author(s):

Qingyang Li ◽

Shichao Liu ◽

Hicham Chaoui

Keyword(s):

Reinforcement Learning ◽

Time Delays ◽

Wide Area ◽

Damping Control

Download Full-text

Model Mediated Teleoperation with a Hand-Arm Exoskeleton in Long Time Delays Using Reinforcement Learning

2020 29th IEEE International Conference on Robot and Human Interactive Communication (RO-MAN) ◽

10.1109/ro-man47096.2020.9223477 ◽

2020 ◽

Author(s):

Hadi Beik-Mohammadi ◽

Matthias Kerzel ◽

Benedikt Pleintinger ◽

Thomas Hulin ◽

Philipp Reisich ◽

...

Keyword(s):

Reinforcement Learning ◽

Time Delays ◽

Download Full-text

Multiple Model Reinforcement Learning for Environments with Poissonian Time Delays

10.22215/etd/2014-10293 ◽

2014 ◽

Author(s):

Jeff Campbell

Keyword(s):

Reinforcement Learning ◽

Time Delays ◽

Download Full-text

Unmanned Aerial Vehicle Pitch Control under Delay Using Deep Reinforcement Learning with Continuous Action in Wind Tunnel Test

Aerospace ◽

10.3390/aerospace8090258 ◽

2021 ◽

Vol 8 (9) ◽

pp. 258

Author(s):

Daichi Wada ◽

Sergio A. Araujo-Estrada ◽

Shane Windsor

Keyword(s):

Neural Networks ◽

Reinforcement Learning ◽

Wind Tunnel ◽

Time Delays ◽

Wind Tunnel Test ◽

The Real ◽

Pitch Control ◽

Controller Performance ◽

The Neural Networks

Nonlinear flight controllers for fixed-wing unmanned aerial vehicles (UAVs) can potentially be developed using deep reinforcement learning. However, there is often a reality gap between the simulation models used to train these controllers and the real world. This study experimentally investigated the application of deep reinforcement learning to the pitch control of a UAV in wind tunnel tests, with a particular focus of investigating the effect of time delays on flight controller performance. Multiple neural networks were trained in simulation with different assumed time delays and then wind tunnel tested. The neural networks trained with shorter delays tended to be susceptible to delay in the real tests and produce fluctuating behaviour. The neural networks trained with longer delays behaved more conservatively and did not produce oscillations but suffered steady state errors under some conditions due to unmodeled frictional effects. These results highlight the importance of performing physical experiments to validate controller performance and how the training approach used with reinforcement learning needs to be robust to reality gaps between simulation and the real world.

Download Full-text

Reinforcement learning-based online adaptive controller design for a class of unknown nonlinear discrete-time systems with time delays

Neural Computing and Applications ◽

10.1007/s00521-018-3537-7 ◽

2018 ◽

Vol 30 (6) ◽

pp. 1733-1745 ◽

Author(s):

Yuling Liang ◽

Huaguang Zhang ◽

Geyang Xiao ◽

He Jiang

Keyword(s):

Reinforcement Learning ◽

Discrete Time ◽

Time Delays ◽

Controller Design ◽

Adaptive Controller ◽

Discrete Time Systems ◽

Download Full-text

Supplemental Material for Reconciling Reinforcement Learning Models With Behavioral Extinction and Renewal: Implications for Addiction, Relapse, and Problem Gambling

Psychological Review ◽

10.1037/0033-295x.114.3.784.supp ◽

2007 ◽

Keyword(s):

Reinforcement Learning ◽

Problem Gambling ◽

Learning Models ◽

Behavioral Extinction ◽

Reinforcement Learning Models

Download Full-text

Bayes factors for reinforcement-learning models of the Iowa gambling task.

Decision ◽

10.1037/dec0000040 ◽

2016 ◽

Vol 3 (2) ◽

pp. 115-131 ◽

Author(s):

Helen Steingroever ◽

Ruud Wetzels ◽

Eric-Jan Wagenmakers

Keyword(s):

Reinforcement Learning ◽

Iowa Gambling Task ◽

Bayes Factors ◽

Gambling Task ◽

Learning Models ◽

Reinforcement Learning Models

Download Full-text

Analogical Reinforcement Learning With Two-Stage Memory Retrieval

PsycEXTRA Dataset ◽

10.1037/e528942014-705 ◽

2014 ◽

Author(s):

James Foster ◽

Matt Jones

Keyword(s):

Reinforcement Learning ◽

Memory Retrieval ◽

Download Full-text

Effects of Working Memory Capacity on the Speed and Accuracy of Learning in Reinforcement Learning Models

PsycEXTRA Dataset ◽

10.1037/e528942014-552 ◽

2014 ◽

Author(s):

Adnane Ez-Zizi ◽

Simon Farrell ◽

David Leslie

Keyword(s):

Working Memory ◽

Reinforcement Learning ◽

Working Memory Capacity ◽

Memory Capacity ◽

Learning Models ◽

Reinforcement Learning Models ◽

Speed And Accuracy

Download Full-text

Supplemental Material for Reinforcement Learning Models of Risky Choice and the Promotion of Risk-Taking by Losses Disguised as Wins in Rats

Journal of Experimental Psychology Animal Learning and Cognition ◽

10.1037/xan0000141.supp ◽

2017 ◽

Keyword(s):

Reinforcement Learning ◽

Risk Taking ◽

Risky Choice ◽

Learning Models ◽

Losses Disguised As Wins ◽

Reinforcement Learning Models

Download Full-text