Is Gradient Descent Update Consistent with Accuracy-Based Learning Classifier System?

Learning Classifier Systems (LCSs) are rule-based adaptive systems that have both Reinforcement Learning (RL) and rule-discovery mechanisms for effective and practical online learning. An analysis of the reinforcement process of XCS, one of the currently mainstream LCSs, is performed from the aspect of RL. Upon comparing XCS's update method with gradient-descent-based parameter update in RL, differences are found in the following elements: (1) residual term, (2) gradient term, and (3) payoff definition. All possible combinations of the variants in each element are implemented and tested on multi-step benchmark problems. This revealed that few specific combinations work effectively with XCS's accuracy-based rule-discovery process, while pure gradient-descent-based update showed the worst performance.

Download Full-text

ZCS Redux

Evolutionary Computation ◽

10.1162/106365602320169848 ◽

2002 ◽

Vol 10 (2) ◽

pp. 185-205 ◽

Cited By ~ 37

Author(s):

Larry Bull ◽

Jacob Hurst

Keyword(s):

Genetic Algorithms ◽

Difference Equation ◽

Optimal Performance ◽

Learning Classifier Systems ◽

Rule Discovery ◽

Classifier Systems ◽

Learning Classifier System ◽

Classifier System ◽

Learning Classifier ◽

Simple Difference

Learning classifier systems traditionally use genetic algorithms to facilitate rule discovery, where rule fitness is payoff based. Current research has shifted to the use of accuracy-based fitness. This paper re-examines the use of a particular payoff-based learning classifier system—ZCS. By using simple difference equation models of ZCS, we show that this system is capable of optimal performance subject to appropriate parameter settings. This is demonstrated for both single- and multistep tasks. Optimal performance of ZCS in well-known, multistep maze tasks is then presented to support the findings from the models.

Download Full-text

Research on Multi-robot Path Planning Methods Based on Learning Classifier System with Gradient Descent Methods

Advances in Intelligent and Soft Computing - Advances in Computer Science and Information Engineering ◽

10.1007/978-3-642-30223-7_37 ◽

2012 ◽

pp. 229-234

Author(s):

Jie Shao ◽

JunPeng Zhang ◽

ChengDong Zhao

Keyword(s):

Path Planning ◽

Gradient Descent ◽

Descent Methods ◽

Robot Path Planning ◽

Learning Classifier System ◽

Classifier System ◽

Learning Classifier ◽

Gradient Descent Methods ◽

Planning Methods ◽

Robot Path

Download Full-text

Is a Learning Classifier System a Type of Neural Network?

Evolutionary Computation ◽

10.1162/evco.1994.2.1.19 ◽

1994 ◽

Vol 2 (1) ◽

pp. 19-36 ◽

Cited By ~ 39

Author(s):

Robert E. Smith ◽

H. Brown Cribbs

Keyword(s):

The Other ◽

Adaptive Genetic Algorithm ◽

Diverse Population ◽

Distinguishing Characteristic ◽

Classifier Systems ◽

Learning Classifier System ◽

Classifier System ◽

System A ◽

Learning Classifier ◽

The Relationship

This paper suggests a simple analogy between learning classifier systems (LCSs) and neural networks (NNs). By clarifying the relationship between LCSs and NNs, the paper indicates how techniques from one can be utilized in the other. The paper points out that the primary distinguishing characteristic of the LCS is its use of a co-adaptive genetic algorithm (GA), where the end product of evolution is a diverse population of individuals that cooperate to perform useful computation. This stands in contrast to typical GA/NN schemes, where a population of networks is employed to evolve a single, optimized network. To fully illustrate the LCS/NN analogy used in this paper, an LCS-like NN is implemented and tested. The test is constructed to run parallel to a similar GA/NN study that did not employ a co-adaptive GA. The test illustrates the LCS/NN analogy and suggests an interesting new method for applying GAs in NNs. Final comments discuss extensions of this work and suggest how LCS and NN studies can further benefit each other.

Download Full-text

Analyzing Strength-Based Classifier System from Reinforcement Learning Perspective

Journal of Advanced Computational Intelligence and Intelligent Informatics ◽

10.20965/jaciii.2009.p0631 ◽

2009 ◽

Vol 13 (6) ◽

pp. 631-639

Author(s):

Atsushi Wada ◽

◽

Keiki Takadama ◽

◽

Keyword(s):

Reinforcement Learning ◽

Adaptive Systems ◽

Classifier Systems ◽

Q Learning ◽

State Action ◽

Classifier System ◽

Learning Classifier ◽

Value Estimation ◽

On Line ◽

On Line Learning

Learning Classifier Systems (LCSs) are rule-based adaptive systems that have both Reinforcement Learning (RL) and rule-discovery mechanisms for effective and practical on-line learning. With the aim of establishing a common theoretical basis between LCSs and RL algorithms to share each field's findings, a detailed analysis was performed to compare the learning processes of these two approaches. Based on our previous work on deriving an equivalence between the Zeroth-level Classifier System (ZCS) and Q-learning with Function Approximation (FA), this paper extends the analysis to the influence of actually applying the conditions for this equivalence. Comparative experiments have revealed interesting implications: (1) ZCS's original parameter, the deduction rate, plays a role in stabilizing the action selection, but (2) from the Reinforcement Learning perspective, such a process inhibits the ability to accurately estimate values for the entire state-action space, thus limiting the performance of ZCS in problems requiring accurate value estimation.

Download Full-text

Smart Contact Tracing and Classifier System for Covid-19 Cases

International Journal of Emerging Trends in Engineering Research ◽

10.30534/ijeter/2021/06992021 ◽

2021 ◽

Vol 9 (9) ◽

pp. 1239-1245

Keyword(s):

Operating System ◽

Short Range ◽

Contact Tracing ◽

Classifier Systems ◽

Learning Classifier System ◽

Classifier System ◽

Incremental Method ◽

Learning Classifier ◽

Rule Set ◽

Tracing Method

The growing shreds of evidence and spread of COVID-19 in recent times have shown that to effortlessly and optimally tackle the rate at which COVID-19 infected individuals affect uninfected individuals has become a pressing challenge. This demands the need for a smart contact tracing method for COVID-19 contact tracing. This paper reviewed and analysed the available contact tracing models, contact tracing applications used by 36 countries, and their underlined classifier systems and techniques being used for COVID-19 contact tracing, machine learning classifier methods and ways in which these classifiers are evaluated. The incremental method was adopted because it results in a step-by-step rule set that continually changes. Three categories of learning classifier systems were also studied and recommended the Smartphone Mobile Bluetooth (BLE) and Michigan learning classifier system because it offers a short-range communication that is available regardless of the operating system and classifies based on set rules quickly and faster.

Download Full-text

Balancing Specificity and Generality in a Panmictic-Based Rule-Discovery Learning Classifier System

Lecture Notes in Computer Science - Learning Classifier Systems ◽

10.1007/978-3-540-40029-5_1 ◽

2003 ◽

pp. 1-19

Author(s):

William N. L. Browne

Keyword(s):

Discovery Learning ◽

Rule Discovery ◽

Learning Classifier System ◽

Classifier System ◽

Learning Classifier

Download Full-text

Self-adaptation of parameters in a learning classifier system ensemble machine

International Journal of Applied Mathematics and Computer Science ◽

10.2478/v10006-010-0012-8 ◽

2010 ◽

Vol 20 (1) ◽

pp. 157-174 ◽

Cited By ~ 4

Author(s):

Maciej Troć ◽

Olgierd Unold

Keyword(s):

Classifier Systems ◽

Learning Classifier System ◽

Classifier System ◽

System A ◽

Learning Classifier ◽

Ensemble Machine Learning ◽

One Step ◽

Self Adaptation ◽

Selection Of ◽

Self Adaptive

Self-adaptation of parameters in a learning classifier system ensemble machineSelf-adaptation is a key feature of evolutionary algorithms (EAs). Although EAs have been used successfully to solve a wide variety of problems, the performance of this technique depends heavily on the selection of the EA parameters. Moreover, the process of setting such parameters is considered a time-consuming task. Several research works have tried to deal with this problem; however, the construction of algorithms letting the parameters adapt themselves to the problem is a critical and open problem of EAs. This work proposes a novel ensemble machine learning method that is able to learn rules, solve problems in a parallel way and adapt parameters used by its components. A self-adaptive ensemble machine consists of simultaneously working extended classifier systems (XCSs). The proposed ensemble machine may be treated as a meta classifier system. A new self-adaptive XCS-based ensemble machine was compared with two other XCS-based ensembles in relation to one-step binary problems: Multiplexer, One Counts, Hidden Parity, and randomly generated Boolean functions, in a noisy version as well. Results of the experiments have shown the ability of the model to adapt the mutation rate and the tournament size. The results are analyzed in detail.

Download Full-text

Dynamical Genetic Programming in XCSF

Evolutionary Computation ◽

10.1162/evco_a_00080 ◽

2013 ◽

Vol 21 (3) ◽

pp. 361-387 ◽

Cited By ~ 4

Author(s):

Richard J. Preen ◽

Larry Bull

Keyword(s):

Genetic Programming ◽

Financial Time Series ◽

Learning Problems ◽

Classifier Systems ◽

Learning Classifier System ◽

Classifier System ◽

Financial Time ◽

Learning Classifier ◽

Temporal Intervals ◽

Arithmetic Networks

A number of representation schemes have been presented for use within learning classifier systems, ranging from binary encodings to artificial neural networks. This paper presents results from an investigation into using a temporally dynamic symbolic representation within the XCSF learning classifier system. In particular, dynamical arithmetic networks are used to represent the traditional condition-action production system rules to solve continuous-valued reinforcement learning problems and to perform symbolic regression, finding competitive performance with traditional genetic programming on a number of composite polynomial tasks. In addition, the network outputs are later repeatedly sampled at varying temporal intervals to perform multistep-ahead predictions of a financial time series.

Download Full-text

Performance and Efficiency of Memetic Pittsburgh Learning Classifier Systems

Evolutionary Computation ◽

10.1162/evco.2009.17.3.307 ◽

2009 ◽

Vol 17 (3) ◽

pp. 307-342 ◽

Cited By ~ 37

Author(s):

Jaume Bacardit ◽

Natalio Krasnogor

Keyword(s):

Learning Classifier Systems ◽

Classification Rules ◽

Classifier Systems ◽

Learning Classifier System ◽

Classifier System ◽

Learning Classifier ◽

Rule Sets ◽

Rule Set ◽

Computational Resources ◽

Robust To Noise

In this paper we empirically evaluate several local search (LS) mechanisms that heuristically edit classification rules and rule sets to improve their performance. Two kinds of operators are studied, (1) rule-wise operators, which edit individual rules, and (2) a rule set-wise operator, which takes the rules from N parents (N ≥ 2) to generate a new offspring, selecting the minimum subset of candidate rules that obtains maximum training accuracy. Moreover, various ways of integrating these operators within the evolutionary cycle of learning classifier systems are studied. The combinations of LS operators and policies are integrated in a Pittsburgh approach framework that we call MPLCS for memetic Pittsburgh learning classifier system. MPLCS is systematically evaluated using various metrics. Several datasets were employed with the objective of identifying which combination of operators and policies scale well, are robust to noise, generate compact solutions, and use the least amount of computational resources to solve the problems.

Download Full-text

Using the XCS Classifier System for Multi-objective Reinforcement Learning Problems

Artificial Life ◽

10.1162/artl.2007.13.1.69 ◽

2007 ◽

Vol 13 (1) ◽

pp. 69-86 ◽

Cited By ~ 6

Author(s):

Matthew Studley ◽

Larry Bull

Keyword(s):

Action Selection ◽

Single Step ◽

Learning Problems ◽

Theoretical Studies ◽

Classifier Systems ◽

Learning Classifier System ◽

Classifier System ◽

Multi Objective ◽

Learning Classifier ◽

Selection Policy

We investigate the performance of a learning classifier system in some simple multi-objective, multi-step maze problems, using both random and biased action-selection policies for exploration. Results show that the choice of action-selection policy can significantly affect the performance of the system in such environments. Further, this effect is directly related to population size, and we relate this finding to recent theoretical studies of learning classifier systems in single-step problems.

Download Full-text