Design of attitude and path tracking controllers for quad-rotor robots using reinforcement learning

Author(s):  
Sergio Ronaldo Barros dos Santos ◽  
Cairo Lucio Nascimento ◽  
Sidney Nascimento Givigi
Symmetry ◽  
2021 ◽  
Vol 14 (1) ◽  
pp. 31
Author(s):  
Jichang Ma ◽  
Hui Xie ◽  
Kang Song ◽  
Hao Liu

The path tracking control system is a crucial component for autonomous vehicles; it is challenging to realize accurate tracking control when approaching a wide range of uncertain situations and dynamic environments, particularly when such control must perform as well as, or better than, human drivers. While many methods provide state-of-the-art tracking performance, they tend to emphasize constant PID control parameters, calibrated by human experience, to improve tracking accuracy. A detailed analysis shows that PID controllers inefficiently reduce the lateral error under various conditions, such as complex trajectories and variable speed. In addition, intelligent driving vehicles are highly non-linear objects, and high-fidelity models are unavailable in most autonomous systems. As for the model-based controller (MPC or LQR), the complex modeling process may increase the computational burden. With that in mind, a self-optimizing, path tracking controller structure, based on reinforcement learning, is proposed. For the lateral control of the vehicle, a steering method based on the fusion of the reinforcement learning and traditional PID controllers is designed to adapt to various tracking scenarios. According to the pre-defined path geometry and the real-time status of the vehicle, the interactive learning mechanism, based on an RL framework (actor–critic—a symmetric network structure), can realize the online optimization of PID control parameters in order to better deal with the tracking error under complex trajectories and dynamic changes of vehicle model parameters. The adaptive performance of velocity changes was also considered in the tracking process. The proposed controlling approach was tested in different path tracking scenarios, both the driving simulator platforms and on-site vehicle experiments have verified the effects of our proposed self-optimizing controller. The results show that the approach can adaptively change the weights of PID to maintain a tracking error (simulation: within ±0.071 m; realistic vehicle: within ±0.272 m) and steering wheel vibration standard deviations (simulation: within ±0.04°; realistic vehicle: within ±80.69°); additionally, it can adapt to high-speed simulation scenarios (the maximum speed is above 100 km/h and the average speed through curves is 63–76 km/h).


Author(s):  
V. Ram Mohan Parimi ◽  
Devendra P. Garg

This paper deals with the design and optimization of a Fuzzy Logic Controller that is used in the obstacle avoidance and path tracking problems of mobile robot navigation. The Fuzzy Logic controller is tuned using reinforcement learning controlled Genetic Algorithm. The operator probabilities of the Genetic Algorithm are adapted using reinforcement learning technique. The reinforcement learning algorithm used in this paper is Q-learning, a recently developed reinforcement learning algorithm. The performance of the Fuzzy-Logic Controller tuned with reinforcement controlled Genetic Algorithm is then compared with the one tuned with uncontrolled Genetic Algorithm. The theory is applied to a two-wheeled mobile robot’s path tracking problem. It is shown that the performance of the Fuzzy-Logic controller tuned by Genetic Algorithm controlled via reinforcement learning is better than the performance of the Fuzzy-Logic controller tuned via uncontrolled Genetic Algorithm.


2019 ◽  
Vol 9 (13) ◽  
pp. 2649 ◽  
Author(s):  
Guoxing Bai ◽  
Yu Meng ◽  
Li Liu ◽  
Weidong Luo ◽  
Qing Gu ◽  
...  

At present, many path tracking controllers are unable to actively adjust the longitudinal velocity according to path information, such as the radius of the curve, to further improve tracking accuracy. For this problem, we propose a new path tracking framework based on model predictive control (MPC). This is a multilayer control system that includes three path tracking controllers with fixed velocities and a velocity decision controller. This new control method is named multilayer MPC. This new control method is compared to other control methods through simulation. In this paper, the maximum values of the displacement error and the heading error of multilayer MPC are 92.92% and 77.02%, respectively, smaller than those of nonlinear MPC. The real-time performance of multilayer MPC is very good, and parallel computation can further improve the real-time performance of this control method. In simulation results, the calculation time of multilayer MPC in each control period does not exceed 0.0130 s, which is much smaller than the control period. In addition, when the error of positioning systems is at the centimeter level, the performance of multilayer MPC is still good.


2021 ◽  
Vol 01 (01) ◽  
pp. 2150003
Author(s):  
Peng Wang ◽  
Di An ◽  
Ning Chen ◽  
Yang Quan Chen

Unlike time-based path tracking controllers, the [Formula: see text]-controller is a spatial path tracking controller. It is a purely geometric path tracking controller and essentially a P-controller to maintain the reasonable spatial distance, [Formula: see text], from the vehicle to the desired path. In this paper, we present some enhancement schemes using the non-conventional PI control laws via optimization. We propose to use a nonlinear term [Formula: see text] for the proportional controller. A fractional-order integral used to achieve a PI[Formula: see text] control. Among the schemes, an optimization search procedure applied to find optimal controller gains by meshing the regions around the values from approximate linear designs. The performance index for parametric optimization is the integration of the absolute purely spatial deviation from the desired path. Three different types of road shape were chosen and the Gazebo-ROS simulation results were presented to show the effectiveness of the proposed enhancement schemes. The results show that in some cases a smaller [Formula: see text] and [Formula: see text] can be achieved by using [Formula: see text] controller, but its disadvantage is there may be some oscillation. For PI[Formula: see text] controller, there is an additional adjustable parameter [Formula: see text], better performance can be achieved without significant disadvantages which is worth in-depth research.


Sign in / Sign up

Export Citation Format

Share Document