scholarly journals Towards Self-organized Control: Using Neural Cellular Automata to Robustly Control a Cart-pole Agent

2021 ◽  
Vol 1 ◽  
pp. 1-14
Author(s):  
Alexandre Variengien ◽  
◽  
Sidney Pontes-Filho ◽  
Tom Eivind Glover ◽  
Stefano Nichele ◽  
...  

Neural cellular automata (Neural CA) are a recent framework used to model biological phenomena emerging from multicellular organisms. In these systems, artificial neural networks are used as update rules for cellular automata. Neural CA are end-to-end differentiable systems where the parameters of the neural network can be learned to achieve a particular task. In this work, we used neural CA to control a cart-pole agent. The observations of the environment are transmitted in input cells while the values of output cells are used as a readout of the system. We trained the model using deep-Q learning where the states of the output cells were used as the Q-value estimates to be optimized. We found that the computing abilities of the cellular automata were maintained over several hundreds of thousands of iterations, producing an emergent stable behavior in the environment it controls for thousands of steps. Moreover, the system demonstrated life-like phenomena such as a developmental phase, regeneration after damage, stability despite a noisy environment, and robustness to unseen disruption such as input deletion.

2005 ◽  
Vol 12 (1) ◽  
pp. 83-90
Author(s):  
R. Šiugždaite

The development of regional urban system still remains one of the main problems during the human race history. There are a lot of problems inside this system like overcrowded cities and decaying countryside. All these situations can be reproduced by modelling them using Cellular Automata (CA) [1, 2, 5]. CA models implement algorithms with simple rules and parameter controls, but the result can be a complex behaviour. A stability of naturally formed self‐organized urban system depends on its critical state parameter τ in the power law log(f(x)) = ‐τlog(x). If the system reaches self‐organized critical (SOC) state then it remains in it for a long time. The CA model URBACAM (URBAnistic Cellular Automata Model) describes the long‐lasting term behaviour and shows that the change in behaviour is sensitive to the urban parameter τ of the power law. Regionines urbanistines sistemos vystymasis išlieka viena iš opiausiu problemu žmonijos istorijoje. Keletas tokiu uždaviniu kaip miestu perpildymas, nykstančios kaimo vietoves ir t.t. gali būti nesunkiai modeliuojami naudojant lasteliu automatus (LA). LA metodas ypatingas tuo, kad realizuoja algoritma paprastu taisykliu bei parametru valdymo pagalba, tačiau rezultate galima gauti sudetinga elgsena. Natūraliai susiformavusiu urbanistiniu sistemu stabilumas priklauso nuo sistemos krizines savirangos būsenos (KSB) parametro τ. Jei sistema pasiekia KSB, tai ji ilga laika išlieka joje. LA modelis URBACAM charakterizuoja ilgalaike elgsena ir parodo, jog modelyje jos kitimus itakoja eksponentinio desnio urbanistinis parametras τ.


Author(s):  
Raghuram Mandyam Annasamy ◽  
Katia Sycara

Deep reinforcement learning techniques have demonstrated superior performance in a wide variety of environments. As improvements in training algorithms continue at a brisk pace, theoretical or empirical studies on understanding what these networks seem to learn, are far behind. In this paper we propose an interpretable neural network architecture for Q-learning which provides a global explanation of the model’s behavior using key-value memories, attention and reconstructible embeddings. With a directed exploration strategy, our model can reach training rewards comparable to the state-of-the-art deep Q-learning models. However, results suggest that the features extracted by the neural network are extremely shallow and subsequent testing using out-of-sample examples shows that the agent can easily overfit to trajectories seen during training.


2008 ◽  
Vol 18 (02) ◽  
pp. 527-539 ◽  
Author(s):  
RAMÓN ALONSO-SANZ ◽  
ANDREW ADAMATZKY

Commonly studied cellular automata are memoryless and have fixed topology of connections between cells. However by allowing updates of links and short-term memory in cells we may potentially discover novel complex regimes of spatio-temporal dynamics. Moreover, by adding memory and dynamical topology to state update rules we somehow forge elementary but nontraditional models of neurons networks (aka neuron layers in frontal parts). In the present paper, we demonstrate how this can be done on a self-inhibitory excitable cellular automata. These automata imitate a phenomenon of inhibition caused by hight-strength stimulus: a resting cell excites if there are one or two excited neighbors, the cell remains resting otherwise. We modify the automaton by allowing cells to have few-steps memories, and create links between neighboring cells removed or generated depending on the states of the cells.


2013 ◽  
Vol 20 (3) ◽  
pp. 441-455 ◽  
Author(s):  
Ana Galindo-Serrano ◽  
Lorenza Giupponi

Sign in / Sign up

Export Citation Format

Share Document