Excitable networks for finite state computation with continuous time recurrent neural networks

We have recently shown that when initialized with “small” weights, recurrent neural networks (RNNs) with standard sigmoid-type activation functions are inherently biased toward Markov models; even prior to any training, RNN dynamics can be readily used to extract finite memory machines (Hammer & Tiňo, 2002; Tiňo, Čerňanský, &Beňušková, 2002a, 2002b). Following Christiansen and Chater (1999), we refer to this phenomenon as the architectural bias of RNNs. In this article, we extend our work on the architectural bias in RNNs by performing a rigorous fractal analysis of recurrent activation patterns. We assume the network is driven by sequences obtained by traversing an underlying finite-state transition diagram&a scenario that has been frequently considered in the past, for example, when studying RNN-based learning and implementation of regular grammars and finite-state transducers. We obtain lower and upper bounds on various types of fractal dimensions, such as box counting and Hausdorff dimensions. It turns out that not only can the recurrent activations inside RNNs with small initial weights be explored to build Markovian predictive models, but also the activations form fractal clusters, the dimension of which can be bounded by the scaled entropy of the underlying driving source. The scaling factors are fixed and are given by the RNN parameters.

Download Full-text

Noisy recurrent neural networks: the continuous-time case

IEEE Transactions on Neural Networks ◽

10.1109/72.712164 ◽

1998 ◽

Vol 9 (5) ◽

pp. 913-936 ◽

Cited By ~ 17

Author(s):

S. Das ◽

O. Olurotimi

Keyword(s):

Neural Networks ◽

Recurrent Neural Networks ◽

Continuous Time

Download Full-text

Stability Analysis of a General Class of Continuous-Time Recurrent Neural Networks

Advances in Neural Networks – ISNN 2009 - Lecture Notes in Computer Science ◽

10.1007/978-3-642-01507-6_40 ◽

2009 ◽

pp. 340-346

Author(s):

Chaojin Fu ◽

Zhongsheng Wang

Keyword(s):

Neural Networks ◽

Stability Analysis ◽

Recurrent Neural Networks ◽

Continuous Time ◽

General Class

Download Full-text

A Continuous-time Learning Rule for Memristor–based Recurrent Neural Networks

2019 26th IEEE International Conference on Electronics, Circuits and Systems (ICECS) ◽

10.1109/icecs46596.2019.8964918 ◽

2019 ◽

Author(s):

Gianluca Zoppo ◽

Francesco Marrone ◽

Fernando Corinto

Keyword(s):

Neural Networks ◽

Recurrent Neural Networks ◽

Continuous Time ◽

Learning Rule

Download Full-text

Stability Analysis of Evolved Continuous Time Recurrent Neural Networks that Balance a Double Inverted Pendulum on a Cart

2007 International Joint Conference on Neural Networks ◽

10.1109/ijcnn.2007.4371383 ◽

2007 ◽

Author(s):

Federico Vicentini

Keyword(s):

Neural Networks ◽

Stability Analysis ◽

Recurrent Neural Networks ◽

Continuous Time ◽

Inverted Pendulum

Download Full-text

Biped locomotion control with evolved adaptive center-crossing continuous time recurrent neural networks

Neurocomputing ◽

10.1016/j.neucom.2012.01.009 ◽

2012 ◽

Vol 86 ◽

pp. 86-96 ◽

Cited By ~ 8

Author(s):

José Santos ◽

Ángel Campo

Keyword(s):

Neural Networks ◽

Recurrent Neural Networks ◽

Continuous Time ◽

Biped Locomotion ◽

Locomotion Control

Download Full-text

Fluctuation-driven learning rule for continuous-time recurrent neural networks and its application to dynamical system control

Systems and Computers in Japan ◽

10.1002/1520-684x(200103)32:3<14::aid-scj2>3.0.co;2-u ◽

2001 ◽

Vol 32 (3) ◽

pp. 14-23 ◽

Cited By ~ 2

Author(s):

Kazuhisa Watanabe ◽

Takahiro Haba ◽

Noboru Kudo ◽

Takahumi Oohori

Keyword(s):

Dynamical System ◽

Neural Networks ◽

Recurrent Neural Networks ◽

Continuous Time ◽

Learning Rule ◽

System Control

Download Full-text

EXPERIMENTAL COMPARISON OF THE EFFECT OF ORDER IN RECURRENT NEURAL NETWORKS

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s0218001493000431 ◽

1993 ◽

Vol 07 (04) ◽

pp. 849-872 ◽

Cited By ~ 30

Author(s):

CLIFFORD B. MILLER ◽

C. LEE GILES

Keyword(s):

Neural Networks ◽

Recurrent Neural Networks ◽

Internal State ◽

Second Order ◽

Convergence Time ◽

Experimental Comparison ◽

Grammatical Inference ◽

Neural Net ◽

First Order ◽

Finite State

There has been much interest in increasing the computational power of neural networks. In addition there has been much interest in “designing” neural networks better suited to particular problems. Increasing the “order” of the connectivity of a neural network permits both. Though order has played a significant role in feedforward neural networks, its role in dynamically driven recurrent networks is still being understood. This work explores the effect of order in learning grammars. We present an experimental comparison of first order and second order recurrent neural networks, as applied to the task of grammatical inference. We show that for the small grammars studied these two neural net architectures have comparable learning and generalization power, and that both are reasonably capable of extracting the correct finite state automata for the language in question. However, for a larger randomly-generated ten-state grammar, second order networks significantly outperformed the first order networks, both in convergence time and generalization capability. We show that these networks learn faster the more neurons they have (our experiments used up to 10 hidden neurons), but that the solutions found by smaller networks are usually of better quality (in terms of generalization performance after training). Second order nets have the advantage that they converge more quickly to a solution and can find it more reliably than first order nets, but that the second order solutions tend to be of poorer quality than those of the first order if both architectures are trained to the same error tolerance. Despite this, second order nets can more successfully extract finite state machines using heuristic clustering techniques applied to the internal state representations. We speculate that this may be due to restrictions on the ability of first order architecture to fully make use of its internal state representation power and that this may have implications for the performance of the two architectures when scaled up to larger problems.

Download Full-text