On-line EM Algorithm and Reinforcement Learning

Learning Classifier Systems (LCSs) are rule-based adaptive systems that have both Reinforcement Learning (RL) and rule-discovery mechanisms for effective and practical on-line learning. With the aim of establishing a common theoretical basis between LCSs and RL algorithms to share each field's findings, a detailed analysis was performed to compare the learning processes of these two approaches. Based on our previous work on deriving an equivalence between the Zeroth-level Classifier System (ZCS) and Q-learning with Function Approximation (FA), this paper extends the analysis to the influence of actually applying the conditions for this equivalence. Comparative experiments have revealed interesting implications: (1) ZCS's original parameter, the deduction rate, plays a role in stabilizing the action selection, but (2) from the Reinforcement Learning perspective, such a process inhibits the ability to accurately estimate values for the entire state-action space, thus limiting the performance of ZCS in problems requiring accurate value estimation.

Download Full-text

Approximating discrete mapping of chaotic dynamical system based on on-line EM algorithm

ICONIP'99. ANZIIS'99 & ANNES'99 & ACNN'99. 6th International Conference on Neural Information Processing. Proceedings (Cat. No.99EX378) ◽

10.1109/iconip.1999.844674 ◽

2003 ◽

Author(s):

W. Yoshida ◽

S. Ishii ◽

M. Sato

Keyword(s):

Dynamical System ◽

Em Algorithm ◽

Chaotic Dynamical System ◽

On Line

Download Full-text

Kalman Filter Control Embedded into the Reinforcement Learning Framework

Neural Computation ◽

10.1162/089976604772744884 ◽

2004 ◽

Vol 16 (3) ◽

pp. 491-499 ◽

Cited By ~ 9

Author(s):

István Szita ◽

András Lőrincz

Keyword(s):

Optimal Control ◽

Kalman Filter ◽

Reinforcement Learning ◽

Learning Rule ◽

Slight Modification ◽

Linear Quadratic ◽

Filter Model ◽

Learning Framework ◽

Value Estimation ◽

On Line

There is a growing interest in using Kalman filter models in brain modeling. The question arises whether Kalman filter models can be used on-line not only for estimation but for control. The usual method of optimal control of Kalman filter makes use of off-line backward recursion, which is not satisfactory for this purpose. Here, it is shown that a slight modification of the linear-quadratic-gaussian Kalman filter model allows the on-line estimation of optimal control by using reinforcement learning and overcomes this difficulty. Moreover, the emerging learning rule for value estimation exhibits a Hebbian form, which is weighted by the error of the value estimation.

Download Full-text