Optimization algorithm using multi-agents and reinforcement learning

Author(s):  
Y. Kobayashi ◽  
E. Aiyoshi
2012 ◽  
Vol 121 (5) ◽  
Author(s):  
A. A. Bielskis ◽  
E. Guseinoviene ◽  
D. Dzemydiene ◽  
D. Drungilas ◽  
G. Gricius

2020 ◽  
Vol 2020 ◽  
pp. 1-14
Author(s):  
Lvjiang Yin ◽  
Meier Zhuang ◽  
Jing Jia ◽  
Huan Wang

Currently, energy saving is increasingly important. During the production procedure, energy saving can be achieved if the operational method and machine infrastructure are improved, but it also increases the complexity of flow-shop scheduling. Actually, as one of the data mining technologies, Grey Wolf Optimization Algorithm is widely applied to various mathematical problems in engineering. However, due to the immaturity of this algorithm, it still has some defects. Therefore, we propose an improved multiobjective model based on Grey Wolf Optimization Algorithm related to Kalman filter and reinforcement learning operator, where Kalman filter is introduced to make the solution set closer to the Pareto optimal front end. By means of reinforcement learning operator, the convergence speed and solving ability of the algorithm can be improved. After testing six benchmark functions, the results show that it is better than that of the original algorithm and other comparison algorithms in terms of search accuracy and solution set diversity. The improved multiobjective model based on Grey Wolf Optimization Algorithm proposed in this paper is conducive to solving energy saving problems in flow-shop scheduling problem, and it is of great practical value in engineering and management.


2019 ◽  
Vol 2019 ◽  
pp. 1-8
Author(s):  
Shaojiang Liu ◽  
Kejing Hu ◽  
Weichuan Ni ◽  
Zhiming Xu ◽  
Feng Wang ◽  
...  

In cognitive relay networks, the cognitive user opportunistically accesses the authorized spectrum segment of the primary user and simultaneously serves as the data relay node of the primary user while sharing the spectrum resource of the primary user. This not only improves the utilization efficiency of the network spectrum resources but also improves the throughput of the primary users. However, if the primary user randomly selects the relay node, there is no guarantee for an optimal throughput. Moreover, the system power consumption may increase. In order to improve the throughput of cognitive relay network and optimize system utility, this paper proposes a cognitive relay network throughput optimization algorithm based on deep reinforcement learning. For the system model of cognitive relay networks, the Markov decision process is used to describe the channel transition probability of the system model in the paper. The algorithm proposes a cooperative wireless network cooperative relay strategy, analyzes the system outage probability under different transmission modes, and optimizes the system throughput by minimizing the outage probability. Then, the maximum utility optimization strategy based on deep reinforcement learning is proposed to maximize the system utility revenue by selecting the optimal behavior. The experimental results show that the proposed algorithm has a good effect in improving system throughput and optimizing system energy efficiency.


Sign in / Sign up

Export Citation Format

Share Document