Particle Swarm Optimization for Model Predictive Control in Reinforcement Learning Environments

This chapter introduces a model-based reinforcement learning (RL) approach for continuous state and action spaces. While most RL methods try to find closed-form policies, the approach taken here employs numerical online optimization of control action sequences following the strategy of nonlinear model predictive control. First, a general method for reformulating RL problems as optimization tasks is provided. Subsequently, particle swarm optimization (PSO) is applied to search for optimal solutions. This PSO policy (PSO-P) is effective for high dimensional state spaces and does not require a priori assumptions about adequate policy representations. Furthermore, by translating RL problems into optimization tasks, the rich collection of real-world-inspired RL benchmarks is made available for benchmarking numerical optimization techniques. The effectiveness of PSO-P is demonstrated on two standard benchmarks mountain car and cart-pole swing-up and a new industry-inspired benchmark, the so-called industrial benchmark.

Download Full-text

Reinforcement Learning with Particle Swarm Optimization Policy (PSO-P) in Continuous State and Action Spaces

International Journal of Swarm Intelligence Research ◽

10.4018/ijsir.2016070102 ◽

2016 ◽

Vol 7 (3) ◽

pp. 23-42 ◽

Cited By ~ 5

Author(s):

Daniel Hein ◽

Alexander Hentschel ◽

Thomas A. Runkler ◽

Steffen Udluft

Keyword(s):

Particle Swarm Optimization ◽

Reinforcement Learning ◽

A Priori ◽

Particle Swarm ◽

Optimization Techniques ◽

Swarm Optimization ◽

Continuous State ◽

On Line ◽

The Rich ◽

Action Spaces

This article introduces a model-based reinforcement learning (RL) approach for continuous state and action spaces. While most RL methods try to find closed-form policies, the approach taken here employs numerical on-line optimization of control action sequences. First, a general method for reformulating RL problems as optimization tasks is provided. Subsequently, Particle Swarm Optimization (PSO) is applied to search for optimal solutions. This Particle Swarm Optimization Policy (PSO-P) is effective for high dimensional state spaces and does not require a priori assumptions about adequate policy representations. Furthermore, by translating RL problems into optimization tasks, the rich collection of real-world inspired RL benchmarks is made available for benchmarking numerical optimization techniques. The effectiveness of PSO-P is demonstrated on the two standard benchmarks: mountain car and cart pole.

Download Full-text

Hybrid Model Predictive Control based on modified Particle Swarm Optimization

2010 IEEE Fifth International Conference on Bio-Inspired Computing: Theories and Applications (BIC-TA) ◽

10.1109/bicta.2010.5645289 ◽

2010 ◽

Author(s):

Degui Xiao ◽

Dan Song ◽

Lixiang Peng ◽

Tingli Li

Keyword(s):

Particle Swarm Optimization ◽

Model Predictive Control ◽

Hybrid Model ◽

Predictive Control ◽

Particle Swarm ◽

Swarm Optimization ◽

Modified Particle Swarm Optimization

Download Full-text

Robust model predictive control for greenhouse temperature based on particle swarm optimization

Information Processing in Agriculture ◽

10.1016/j.inpa.2018.04.003 ◽

2018 ◽

Vol 5 (3) ◽

pp. 329-338 ◽

Cited By ~ 21

Author(s):

Lijun Chen ◽

Shangfeng Du ◽

Yaofeng He ◽

Meihui Liang ◽

Dan Xu

Keyword(s):

Particle Swarm Optimization ◽

Model Predictive Control ◽

Predictive Control ◽

Particle Swarm ◽

Swarm Optimization ◽

Robust Model Predictive Control ◽

Robust Model

Download Full-text

Model Predictive Control of Duplex Inlet and Outlet Ball Mill System Based on Parameter Adaptive Particle Swarm Optimization

Mathematical Problems in Engineering ◽

10.1155/2019/6812754 ◽

2019 ◽

Vol 2019 ◽

pp. 1-10 ◽

Cited By ~ 2

Author(s):

Leihua Feng ◽

Feng Yang ◽

Wei Zhang ◽

Hong Tian

Keyword(s):

Particle Swarm Optimization ◽

Model Predictive Control ◽

Predictive Control ◽

Optimization Algorithm ◽

Ball Mill ◽

Particle Swarm ◽

Swarm Optimization ◽

Adaptive Particle Swarm Optimization ◽

Better Regulation ◽

Parameter Adaptive

The direct-fired system with duplex inlet and outlet ball mill has strong hysteresis and nonlinearity. The original control system is difficult to meet the requirements. Model predictive control (MPC) method is designed for delay problems, but, as the most commonly used rolling optimization method, particle swarm optimization (PSO) has the defects of easy to fall into local minimum and non-adjustable parameters. Firstly, a LS-SVM model of mill output is established and is verified by simulation in this paper. Then, a particle similarity function is proposed, and based on this function a parameter adaptive particle swarm optimization algorithm (HPAPSO) is proposed. In this new method, the weights and acceleration coefficients of PSO are dynamically adjusted. It is verified by two common test functions through Matlab software that its convergence speed is faster and convergence accuracy is higher than standard PSO. Finally, this new optimization algorithm is combined with MPC for solving control problem of mill system. The MPC based on HPAPSO (HPAPSO-MPC) algorithms is compared with MPC based on PAPSO (PAPSO-MPC) and PID control method through simulation experiments. The results show that HPAPSO-MPC method is more accurate and can achieve better regulation performance than PAPSO-MPC and PID method.

Download Full-text