scholarly journals Constructing Temporally Extended Actions through Incremental Community Detection

2018 ◽  
Vol 2018 ◽  
pp. 1-13
Author(s):  
Xiao Xu ◽  
Mei Yang ◽  
Ge Li

Hierarchical reinforcement learning works on temporally extended actions or skills to facilitate learning. How to automatically form such abstraction is challenging, and many efforts tackle this issue in the options framework. While various approaches exist to construct options from different perspectives, few of them concentrate on options’ adaptability during learning. This paper presents an algorithm to create options and enhance their quality online. Both aspects operate on detected communities of the learning environment’s state transition graph. We first construct options from initial samples as the basis of online learning. Then a rule-based community revision algorithm is proposed to update graph partitions, based on which existing options can be continuously tuned. Experimental results in two problems indicate that options from initial samples may perform poorly in more complex environments, and our presented strategy can effectively improve options and get better results compared with flat reinforcement learning.

2011 ◽  
Vol 2011 (0) ◽  
pp. _1A1-L10_1-_1A1-L10_3
Author(s):  
Kensuke HARADA ◽  
Hiromu ONDA ◽  
Natsuki YAMANOBE ◽  
Eiichi YOSHIDA ◽  
Tokuo TSUJI ◽  
...  

2011 ◽  
Vol 48-49 ◽  
pp. 71-78 ◽  
Author(s):  
Min Hu ◽  
Fang Fang Wu ◽  
Bo Zhu ◽  
Bo Lu ◽  
Jing Lei Pu

It is important and difficult to identify the Hazard before a disaster happen because disaster often happens suddenly. This paper proposes a new method – State Transition Graph, which based on visual data space reconstruction, to identify hazard. The change process of the system state movement from one state to another in a certain period is described by some state transition graphs. The system state, which is safe or hazard, could be distinguished by its state transition graphs. This paper conducted experiments on single-dimension and multi-dimension benchmark data to prove the new method is effectiveness. Especially the result of stimulation experiments, based on the Yangtze River tunnel engineering data, showed that state transition graph identifies hazard easily and has better performances than other method. The State transition graph method is worth further researching.


Sign in / Sign up

Export Citation Format

Share Document