Path Tracking Control of Non-holonomic Car-Like Robot with Reinforcement Learning

The path tracking control system is a crucial component for autonomous vehicles; it is challenging to realize accurate tracking control when approaching a wide range of uncertain situations and dynamic environments, particularly when such control must perform as well as, or better than, human drivers. While many methods provide state-of-the-art tracking performance, they tend to emphasize constant PID control parameters, calibrated by human experience, to improve tracking accuracy. A detailed analysis shows that PID controllers inefficiently reduce the lateral error under various conditions, such as complex trajectories and variable speed. In addition, intelligent driving vehicles are highly non-linear objects, and high-fidelity models are unavailable in most autonomous systems. As for the model-based controller (MPC or LQR), the complex modeling process may increase the computational burden. With that in mind, a self-optimizing, path tracking controller structure, based on reinforcement learning, is proposed. For the lateral control of the vehicle, a steering method based on the fusion of the reinforcement learning and traditional PID controllers is designed to adapt to various tracking scenarios. According to the pre-defined path geometry and the real-time status of the vehicle, the interactive learning mechanism, based on an RL framework (actor–critic—a symmetric network structure), can realize the online optimization of PID control parameters in order to better deal with the tracking error under complex trajectories and dynamic changes of vehicle model parameters. The adaptive performance of velocity changes was also considered in the tracking process. The proposed controlling approach was tested in different path tracking scenarios, both the driving simulator platforms and on-site vehicle experiments have verified the effects of our proposed self-optimizing controller. The results show that the approach can adaptively change the weights of PID to maintain a tracking error (simulation: within ±0.071 m; realistic vehicle: within ±0.272 m) and steering wheel vibration standard deviations (simulation: within ±0.04°; realistic vehicle: within ±80.69°); additionally, it can adapt to high-speed simulation scenarios (the maximum speed is above 100 km/h and the average speed through curves is 63–76 km/h).

Download Full-text

Multi-Kernel Online Reinforcement Learning for Path Tracking Control of Intelligent Vehicles

IEEE Transactions on Systems Man and Cybernetics Systems ◽

10.1109/tsmc.2020.2966631 ◽

2020 ◽

pp. 1-14 ◽

Cited By ~ 3

Author(s):

Jiahang Liu ◽

Zhenhua Huang ◽

Xin Xu ◽

Xinglong Zhang ◽

Shiliang Sun ◽

...

Keyword(s):

Reinforcement Learning ◽

Tracking Control ◽

Path Tracking ◽

Intelligent Vehicles

Download Full-text

Path Tracking Control of Hybrid-driven Robotic Fish Based on Deep Reinforcement Learning

2020 IEEE International Conference on Mechatronics and Automation (ICMA) ◽

10.1109/icma49215.2020.9233667 ◽

2020 ◽

Author(s):

Liangping Ma ◽

Zhenjia Yue ◽

Runfeng Zhang

Keyword(s):

Reinforcement Learning ◽

Tracking Control ◽

Path Tracking ◽

Robotic Fish

Download Full-text

Three-Dimensional Path Tracking Control of Autonomous Underwater Vehicle Based on Deep Reinforcement Learning

Journal of Marine Science and Engineering ◽

10.3390/jmse7120443 ◽

2019 ◽

Vol 7 (12) ◽

pp. 443 ◽

Cited By ~ 1

Author(s):

Yushan Sun ◽

Chenming Zhang ◽

Guocheng Zhang ◽

Hao Xu ◽

Xiangrui Ran

Keyword(s):

Reinforcement Learning ◽

Tracking Control ◽

Disturbance Observer ◽

Autonomous Underwater Vehicle ◽

Sliding Mode ◽

Three Dimensional ◽

Underwater Vehicle ◽

Path Tracking ◽

Learning Ability ◽

Reward Function

In this paper, the three-dimensional (3D) path tracking control of an autonomous underwater vehicle (AUV) under the action of sea currents was researched. A novel reward function was proposed to improve learning ability and a disturbance observer was developed to observe the disturbance caused by currents. Based on existing models, the dynamic and kinematic models of the AUV were established. Deep Deterministic Policy Gradient, a deep reinforcement learning, was employed for designing the path tracking controller. Compared with the backstepping sliding mode controller, the controller proposed in this article showed excellent performance, at least in the particular study developed in this article. The improved reward function and the disturbance observer were also found to work well with improving path tracking performance.

Download Full-text