scholarly journals Local Policy-sharing Systems for Multi-agent Reinforcement Learning-An Approach from the Learning Classifier System

Author(s):  
Hiroyasu INOUE ◽  
Katsunori SHIMOHARA ◽  
Osamu KATAI
2013 ◽  
Vol 347-350 ◽  
pp. 416-420
Author(s):  
Yong Bin Ma

This paper proposed a robot reinforcement learning method based on learning classifier system. A learning Classifier System is a rule-based machine learning system that combines reinforcement learning and genetic algorithms. The reinforcement learning component is responsible for adjusting the strength of rules in the system according to some reward obtained from the environment. The genetic algorithm acts as an innovation discovery component which is responsible for discovering new better learning rules. The advantages of this approach are its rule-based representation, which can be easily reduce learning space, online learning ability, robustness .


2002 ◽  
Vol 10 (2) ◽  
pp. 75-96 ◽  
Author(s):  
Martin V Butz ◽  
Joachim Hoffmann

The concept of anticipations controlling behavior is introduced. Background is provided about the importance of anticipations from a psychological perspective. Based on the psychological background wrapped in a framework of anticipatory behavioral control, the anticipatory learning classifier system ACS2 is explained. ACS2 learns and generalizes on-line a predictive environmental model (a model that allows the prediction of future environmental states). The model is a subjective model, that is, no global state information is available to the agent. It is shown that ACS2 can simulate anticipatory learning processes and anticipatory controlled behavior by means of the model. The simulations of various rat experiments, previously conducted by Colwill and Rescorla, show that the incorporation of anticipations is indeed crucial for simulating the behavior observed in rats. Despite the simplicity of the tasks, we show that the observed behavior reaches beyond the capabilities of model-free reinforcement learning as well as model-based reinforcement learning without on-line generalization. Possible future impacts of anticipations in adaptive learning systems are outlined.


Sign in / Sign up

Export Citation Format

Share Document