Towards Self-organized Control: Using Neural Cellular Automata to Robustly Control a Cart-pole Agent

Neural cellular automata (Neural CA) are a recent framework used to model biological phenomena emerging from multicellular organisms. In these systems, artificial neural networks are used as update rules for cellular automata. Neural CA are end-to-end differentiable systems where the parameters of the neural network can be learned to achieve a particular task. In this work, we used neural CA to control a cart-pole agent. The observations of the environment are transmitted in input cells while the values of output cells are used as a readout of the system. We trained the model using deep-Q learning where the states of the output cells were used as the Q-value estimates to be optimized. We found that the computing abilities of the cellular automata were maintained over several hundreds of thousands of iterations, producing an emergent stable behavior in the environment it controls for thousands of steps. Moreover, the system demonstrated life-like phenomena such as a developmental phase, regeneration after damage, stability despite a noisy environment, and robustness to unseen disruption such as input deletion.

Download Full-text

STABILITY ANALYSIS OF CELLULAR AUTOMATA GENERATED CLUSTERS

Mathematical Modelling and Analysis ◽

10.3846/13926292.2005.9637273 ◽

2005 ◽

Vol 12 (1) ◽

pp. 83-90

Author(s):

R. Šiugždaite

Keyword(s):

Cellular Automata ◽

Power Law ◽

State Parameter ◽

Urban System ◽

Complex Behaviour ◽

Cellular Automata Model ◽

Self Organized ◽

Tai Ji ◽

Long Time ◽

Ca Model

The development of regional urban system still remains one of the main problems during the human race history. There are a lot of problems inside this system like overcrowded cities and decaying countryside. All these situations can be reproduced by modelling them using Cellular Automata (CA) [1, 2, 5]. CA models implement algorithms with simple rules and parameter controls, but the result can be a complex behaviour. A stability of naturally formed self‐organized urban system depends on its critical state parameter τ in the power law log(f(x)) = ‐τlog(x). If the system reaches self‐organized critical (SOC) state then it remains in it for a long time. The CA model URBACAM (URBAnistic Cellular Automata Model) describes the long‐lasting term behaviour and shows that the change in behaviour is sensitive to the urban parameter τ of the power law. Regionines urbanistines sistemos vystymasis išlieka viena iš opiausiu problemu žmonijos istorijoje. Keletas tokiu uždaviniu kaip miestu perpildymas, nykstančios kaimo vietoves ir t.t. gali būti nesunkiai modeliuojami naudojant lasteliu automatus (LA). LA metodas ypatingas tuo, kad realizuoja algoritma paprastu taisykliu bei parametru valdymo pagalba, tačiau rezultate galima gauti sudetinga elgsena. Natūraliai susiformavusiu urbanistiniu sistemu stabilumas priklauso nuo sistemos krizines savirangos būsenos (KSB) parametro τ. Jei sistema pasiekia KSB, tai ji ilga laika išlieka joje. LA modelis URBACAM charakterizuoja ilgalaike elgsena ir parodo, jog modelyje jos kitimus itakoja eksponentinio desnio urbanistinis parametras τ.

Download Full-text

Self-organized 3D-Printing Patterns Simulated by Cellular Automata

Recent Advances in Natural Computing - Mathematics for Industry ◽

10.1007/978-4-431-55429-5_1 ◽

2015 ◽

pp. 1-14 ◽

Cited By ~ 1

Author(s):

Yasusi Kanada

Keyword(s):

3D Printing ◽

Cellular Automata ◽

Self Organized

Download Full-text

Extended Q-Learning: Reinforcement Learning Using Self-Organized State Space

RoboCup 2000: Robot Soccer World Cup IV - Lecture Notes in Computer Science ◽

10.1007/3-540-45324-5_11 ◽

2001 ◽

pp. 129-138 ◽

Cited By ~ 2

Author(s):

Shuichi Enokida ◽

Takeshi Ohasi ◽

Takaichi Yoshida ◽

Toshiaki Ejima

Keyword(s):

Reinforcement Learning ◽

State Space ◽

Q Learning ◽

Self Organized ◽

Learning Reinforcement

Download Full-text

Towards Better Interpretability in Deep Q-Networks

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33014561 ◽

2019 ◽

Vol 33 ◽

pp. 4561-4569 ◽

Cited By ~ 2

Author(s):

Raghuram Mandyam Annasamy ◽

Katia Sycara

Keyword(s):

Neural Network ◽

Network Architecture ◽

Empirical Studies ◽

Superior Performance ◽

Training Algorithms ◽

Neural Network Architecture ◽

Q Learning ◽

The Neural Network ◽

Learning Techniques ◽

Out Of Sample

Deep reinforcement learning techniques have demonstrated superior performance in a wide variety of environments. As improvements in training algorithms continue at a brisk pace, theoretical or empirical studies on understanding what these networks seem to learn, are far behind. In this paper we propose an interpretable neural network architecture for Q-learning which provides a global explanation of the model’s behavior using key-value memories, attention and reconstructible embeddings. With a directed exploration strategy, our model can reach training rewards comparable to the state-of-the-art deep Q-learning models. However, results suggest that the features extracted by the neural network are extremely shallow and subsequent testing using out-of-sample examples shows that the agent can easily overfit to trajectories seen during training.

Download Full-text

ON MEMORY AND STRUCTURAL DYNAMISM IN EXCITABLE CELLULAR AUTOMATA WITH DEFENSIVE INHIBITION

International Journal of Bifurcation and Chaos ◽

10.1142/s0218127408020458 ◽

2008 ◽

Vol 18 (02) ◽

pp. 527-539 ◽

Cited By ~ 6

Author(s):

RAMÓN ALONSO-SANZ ◽

ANDREW ADAMATZKY

Keyword(s):

Cellular Automata ◽

Short Term Memory ◽

Temporal Dynamics ◽

Short Term ◽

Resting Cell ◽

Fixed Topology ◽

Dynamical Topology ◽

Novel Complex ◽

Spatio Temporal ◽

Update Rules

Commonly studied cellular automata are memoryless and have fixed topology of connections between cells. However by allowing updates of links and short-term memory in cells we may potentially discover novel complex regimes of spatio-temporal dynamics. Moreover, by adding memory and dynamical topology to state update rules we somehow forge elementary but nontraditional models of neurons networks (aka neuron layers in frontal parts). In the present paper, we demonstrate how this can be done on a self-inhibitory excitable cellular automata. These automata imitate a phenomenon of inhibition caused by hight-strength stimulus: a resting cell excites if there are one or two excited neighbors, the cell remains resting otherwise. We modify the automaton by allowing cells to have few-steps memories, and create links between neighboring cells removed or generated depending on the states of the cells.

Download Full-text