Evaluation on the robustness of Genetic Network Programming with reinforcement learning

Author(s):  
Shingo Mabu ◽  
Andre Tjahjadi ◽  
Siti Sendari ◽  
Kotaro Hirasawa
2007 ◽  
Vol 15 (3) ◽  
pp. 369-398 ◽  
Author(s):  
Shingo Mabu ◽  
Kotaro Hirasawa ◽  
Jinglu Hu

This paper proposes a graph-based evolutionary algorithm called Genetic Network Programming (GNP). Our goal is to develop GNP, which can deal with dynamic environments efficiently and effectively, based on the distinguished expression ability of the graph (network) structure. The characteristics of GNP are as follows. 1) GNP programs are composed of a number of nodes which execute simple judgment/processing, and these nodes are connected by directed links to each other. 2) The graph structure enables GNP to re-use nodes, thus the structure can be very compact. 3) The node transition of GNP is executed according to its node connections without any terminal nodes, thus the past history of the node transition affects the current node to be used and this characteristic works as an implicit memory function. These structural characteristics are useful for dealing with dynamic environments. Furthermore, we propose an extended algorithm, “GNP with Reinforcement Learning (GNPRL)” which combines evolution and reinforcement learning in order to create effective graph structures and obtain better results in dynamic environments. In this paper, we applied GNP to the problem of determining agents' behavior to evaluate its effectiveness. Tileworld was used as the simulation environment. The results show some advantages for GNP over conventional methods.


Author(s):  
Hiroyuki Hatakeyama ◽  
◽  
Shingo Mabu ◽  
Kotaro Hirasawa ◽  
Jinglu Hu ◽  
...  

A new graph-based evolutionary algorithm named “Genetic Network Programming, GNP” has been already proposed. GNP represents its solutions as graph structures, which can improve the expression ability and performance. In addition, GNP with Reinforcement Learning (GNP-RL) was proposed a few years ago. Since GNP-RL can do reinforcement learning during task execution in addition to evolution after task execution, it can search for solutions efficiently. In this paper, GNP with Actor-Critic (GNP-AC) which is a new type of GNP-RL is proposed. Originally, GNP deals with discrete information, but GNP-AC aims to deal with continuous information. The proposed method is applied to the controller of the Khepera simulator and its performance is evaluated.


Author(s):  
Siti Sendari ◽  
Arif Nur Afandi ◽  
Ilham Ari Elbaith Zaeni ◽  
Yogi Dwi Mahandi ◽  
Kotaro Hirasawa ◽  
...  

Author(s):  
Jin Zhou ◽  
◽  
Lu Yu ◽  
Shingo Mabu ◽  
Kaoru Shimada ◽  
...  

In order to increase the transportation capability of elevator group systems in high-rise buildings without adding elevator installation space, double-deck elevator systems (DDES) is developed as one of the next generation elevator group control systems. Artificial intelligence (AI) technologies have been employed to find some efficient solutions in the elevator group control systems during the late 20th century. Genetic Network Programming (GNP), a new evolutionary computation method, has been employed as the elevator group control system controller in some studies of recent years. Moreover, reinforcement learning (RL) has been also found to be useful for more improvements of elevator group control performances when it is combined with GNP. In this paper, we proposed a new approach of DDES using GNP with RL, and did some experiments on a simulated elevator group control system of a typical office building to evaluate its applicability and efficiency. Simulation results show that the DDES using GNP with RL performs better than the one without RL in regular and down-peak time, while both of them outperforms a conventional approach and a heuristic approach in all three traffic patterns.


Sign in / Sign up

Export Citation Format

Share Document