catastrophic interference Latest Research Papers

2021 ◽

Author(s):

Suryanarayana Maddu Maddu ◽

Dominik Sturm ◽

Christian L. Müller ◽

Ivo F. Sbalzarini

Keyword(s):

Neural Networks ◽

Scale Model ◽

Seamless Integration ◽

Multi Scale ◽

Physical Equation ◽

Active Turbulence ◽

Data Fidelity ◽

Catastrophic Interference ◽

Sequential Training ◽

Magnitude Improvement

Abstract We characterize and remedy a failure mode that may arise from multi-scale dynamics with scale imbalances during training of deep neural networks, such as Physics Informed Neural Networks (PINNs). PINNs are popular machine-learning templates that allow for seamless integration of physical equation models with data. Their training amounts to solving an optimization problem over a weighted sum of data-fidelity and equation-fidelity objectives. Conflicts between objectives can arise from scale imbalances, heteroscedasticity in the data, stiffness of the physical equation, or from catastrophic interference during sequential training. We explain the training pathology arising from this and propose a simple yet effective inverse Dirichlet weighting strategy to alleviate the issue. We compare with Sobolev training of neural networks, providing the baseline of analytically ε-optimal training. We demonstrate the effectiveness of inverse Dirichlet weighting in various applications, including a multi-scale model of active turbulence, where we show orders of magnitude improvement in accuracy and convergence over conventional PINN training. For inverse modeling using sequential training, we find that inverse Dirichlet weighting protects a PINN against catastrophic forgetting.

Download Full-text

Neuromorphic learning with Mott insulator NiO

Proceedings of the National Academy of Sciences ◽

10.1073/pnas.2017239118 ◽

2021 ◽

Vol 118 (39) ◽

pp. e2017239118

Author(s):

Zhen Zhang ◽

Sandip Mondal ◽

Subhasish Mandal ◽

Jason M. Allred ◽

Neda Alsadat Aghamiri ◽

...

Keyword(s):

Building Blocks ◽

Mott Insulator ◽

Biological Species ◽

Time Interval ◽

Learning Behavior ◽

Experimental Approaches ◽

Artificial Neural ◽

The Stability ◽

External Stimuli ◽

Catastrophic Interference

Habituation and sensitization (nonassociative learning) are among the most fundamental forms of learning and memory behavior present in organisms that enable adaptation and learning in dynamic environments. Emulating such features of intelligence found in nature in the solid state can serve as inspiration for algorithmic simulations in artificial neural networks and potential use in neuromorphic computing. Here, we demonstrate nonassociative learning with a prototypical Mott insulator, nickel oxide (NiO), under a variety of external stimuli at and above room temperature. Similar to biological species such as Aplysia, habituation and sensitization of NiO possess time-dependent plasticity relying on both strength and time interval between stimuli. A combination of experimental approaches and first-principles calculations reveals that such learning behavior of NiO results from dynamic modulation of its defect and electronic structure. An artificial neural network model inspired by such nonassociative learning is simulated to show advantages for an unsupervised clustering task in accuracy and reducing catastrophic interference, which could help mitigate the stability–plasticity dilemma. Mott insulators can therefore serve as building blocks to examine learning behavior noted in biology and inspire new learning algorithms for artificial intelligence.

Download Full-text

Catastrophic Interference in Reinforcement Learning: A Solution Based on Context Division and Knowledge Distillation

10.36227/techrxiv.15105492.v1 ◽

2021 ◽

Author(s):

Tiantian Zhang ◽

Xueqian Wang ◽

Bin Liang ◽

Bo Yuan

Keyword(s):

Neural Networks ◽

Reinforcement Learning ◽

Training Data ◽

Learning Ability ◽

High Dimensional ◽

Online Clustering ◽

Computational Overhead ◽

Knowledge Distillation ◽

The Stability ◽

Catastrophic Interference

The powerful learning ability of deep neural networks enables reinforcement learning (RL) agents to learn competent control policies directly from high-dimensional and continuous environments. In theory, to achieve stable performance, neural networks assume i.i.d. inputs, which unfortunately does no hold in the general RL paradigm where the training data is temporally correlated and non-stationary. This issue may lead to the phenomenon of "catastrophic interference" (a.k.a. "catastrophic forgetting") and the collapse in performance as later training is likely to overwrite and interfer with previously learned good policies. In this paper, we introduce the concept of "context" into the single-task RL and develop a novel scheme, termed as Context Division and Knowledge Distillation (CDaKD) driven RL, to divide all states experienced during training into a series of contexts. Its motivation is to mitigate the challenge of aforementioned catastrophic interference in deep RL, thereby improving the stability and plasticity of RL models. At the heart of CDaKD is a value function, parameterized by a neural network feature extractor shared across all contexts, and a set of output heads, each specializing on an individual context. In CDaKD, we exploit online clustering to achieve context division, and interference is further alleviated by a knowledge distillation regularization term on the output layers for learned contexts. In addition, to effectively obtain the context division in high-dimensional state spaces (e.g., image inputs), we perform clustering in the lower-dimensional representation space of a randomly initialized convolutional encoder, which is fixed throughout training. Our results show that, with various replay memory capacities, CDaKD can consistently improve the performance of existing RL algorithms on classic OpenAI Gym tasks and the more complex high-dimensional Atari tasks, incurring only moderate computational overhead.

Download Full-text

Catastrophic Interference in Reinforcement Learning: A Solution Based on Context Division and Knowledge Distillation

10.36227/techrxiv.15105492 ◽

2021 ◽

Author(s):

Tiantian Zhang ◽

Xueqian Wang ◽

Bin Liang ◽

Bo Yuan

Keyword(s):

Neural Networks ◽

Reinforcement Learning ◽

Training Data ◽

Learning Ability ◽

High Dimensional ◽

Online Clustering ◽

Computational Overhead ◽

Knowledge Distillation ◽

The Stability ◽

Catastrophic Interference

The powerful learning ability of deep neural networks enables reinforcement learning (RL) agents to learn competent control policies directly from high-dimensional and continuous environments. In theory, to achieve stable performance, neural networks assume i.i.d. inputs, which unfortunately does no hold in the general RL paradigm where the training data is temporally correlated and non-stationary. This issue may lead to the phenomenon of "catastrophic interference" (a.k.a. "catastrophic forgetting") and the collapse in performance as later training is likely to overwrite and interfer with previously learned good policies. In this paper, we introduce the concept of "context" into the single-task RL and develop a novel scheme, termed as Context Division and Knowledge Distillation (CDaKD) driven RL, to divide all states experienced during training into a series of contexts. Its motivation is to mitigate the challenge of aforementioned catastrophic interference in deep RL, thereby improving the stability and plasticity of RL models. At the heart of CDaKD is a value function, parameterized by a neural network feature extractor shared across all contexts, and a set of output heads, each specializing on an individual context. In CDaKD, we exploit online clustering to achieve context division, and interference is further alleviated by a knowledge distillation regularization term on the output layers for learned contexts. In addition, to effectively obtain the context division in high-dimensional state spaces (e.g., image inputs), we perform clustering in the lower-dimensional representation space of a randomly initialized convolutional encoder, which is fixed throughout training. Our results show that, with various replay memory capacities, CDaKD can consistently improve the performance of existing RL algorithms on classic OpenAI Gym tasks and the more complex high-dimensional Atari tasks, incurring only moderate computational overhead.

Download Full-text

Neural Architecture Search of Deep Priors: Towards Continual Learning without Catastrophic Interference

10.1109/cvprw53098.2021.00391 ◽

2021 ◽

Author(s):

Martin Mundt ◽

Iuliia Pliushch ◽

Visvanathan Ramesh

Keyword(s):

Neural Architecture ◽

Catastrophic Interference ◽

Continual Learning

Download Full-text

Spatial Memory in a Spiking Neural Network with Robot Embodiment

Sensors ◽

10.3390/s21082678 ◽

2021 ◽

Vol 21 (8) ◽

pp. 2678

Author(s):

Sergey A. Lobov ◽

Alexey I. Zharinov ◽

Valeri A. Makarov ◽

Victor B. Kazantsev

Keyword(s):

Neural Network ◽

Spatial Memory ◽

Internal Representation ◽

Learning Curves ◽

Global Network ◽

Spiking Neural Network ◽

Interference Phenomenon ◽

Brain Functioning ◽

Information Characteristics ◽

Catastrophic Interference

Cognitive maps and spatial memory are fundamental paradigms of brain functioning. Here, we present a spiking neural network (SNN) capable of generating an internal representation of the external environment and implementing spatial memory. The SNN initially has a non-specific architecture, which is then shaped by Hebbian-type synaptic plasticity. The network receives stimuli at specific loci, while the memory retrieval operates as a functional SNN response in the form of population bursts. The SNN function is explored through its embodiment in a robot moving in an arena with safe and dangerous zones. We propose a measure of the global network memory using the synaptic vector field approach to validate results and calculate information characteristics, including learning curves. We show that after training, the SNN can effectively control the robot’s cognitive behavior, allowing it to avoid dangerous regions in the arena. However, the learning is not perfect. The robot eventually visits dangerous areas. Such behavior, also observed in animals, enables relearning in time-evolving environments. If a dangerous zone moves into another place, the SNN remaps positive and negative areas, allowing escaping the catastrophic interference phenomenon known for some AI architectures. Thus, the robot adapts to changing world.

Download Full-text

Alleviating Catastrophic Interference in Online Learning via Varying Scale of Backward Queried Data

10.1007/978-3-030-92238-2_21 ◽

2021 ◽

pp. 247-256

Author(s):

Gio Huh

Keyword(s):

Online Learning ◽

Catastrophic Interference

Download Full-text

Modeling time perception in rats: Evidence for catastrophic interference in animal learning

Proceedings of the Twenty First Annual Conference of the Cognitive Science Society ◽

10.4324/9781410603494-35 ◽

2020 ◽

pp. 173-178

Author(s):

Robot M. French ◽

André Ferrara

Keyword(s):

Time Perception ◽

Animal Learning ◽

Catastrophic Interference

Download Full-text

Catastrophic Interference in Predictive Neural Network Models of Distributional Semantics

Computational Brain & Behavior ◽

10.1007/s42113-020-00089-5 ◽

2020 ◽

Author(s):

Willa M. Mannering ◽

Michael N. Jones

Keyword(s):

Neural Network ◽

Network Models ◽

Distributional Semantics ◽

Neural Network Models ◽

Catastrophic Interference

Download Full-text

Prevention of catastrophic interference and imposing active forgetting with generative methods

Neurocomputing ◽

10.1016/j.neucom.2020.03.024 ◽

2020 ◽

Vol 400 ◽

pp. 73-85

Author(s):

Sergey Sukhov ◽

Mikhail Leontev ◽

Alexander Miheev ◽

Kirill Sviatov

Keyword(s):

Generative Methods ◽

Catastrophic Interference

Download Full-text

catastrophic interference
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Inverse Dirichlet Weighting Enables Reliable Training of Physics Informed Neural Networks

Neuromorphic learning with Mott insulator NiO

Catastrophic Interference in Reinforcement Learning: A Solution Based on Context Division and Knowledge Distillation

Catastrophic Interference in Reinforcement Learning: A Solution Based on Context Division and Knowledge Distillation

Neural Architecture Search of Deep Priors: Towards Continual Learning without Catastrophic Interference

Spatial Memory in a Spiking Neural Network with Robot Embodiment

Alleviating Catastrophic Interference in Online Learning via Varying Scale of Backward Queried Data

Modeling time perception in rats: Evidence for catastrophic interference in animal learning

Catastrophic Interference in Predictive Neural Network Models of Distributional Semantics

Prevention of catastrophic interference and imposing active forgetting with generative methods

Export Citation Format

catastrophic interferenceRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Inverse Dirichlet Weighting Enables Reliable Training of Physics Informed Neural Networks

Neuromorphic learning with Mott insulator NiO

Catastrophic Interference in Reinforcement Learning: A Solution Based on Context Division and Knowledge Distillation

Catastrophic Interference in Reinforcement Learning: A Solution Based on Context Division and Knowledge Distillation

Neural Architecture Search of Deep Priors: Towards Continual Learning without Catastrophic Interference

Spatial Memory in a Spiking Neural Network with Robot Embodiment

Alleviating Catastrophic Interference in Online Learning via Varying Scale of Backward Queried Data

Modeling time perception in rats: Evidence for catastrophic interference in animal learning

Catastrophic Interference in Predictive Neural Network Models of Distributional Semantics

Prevention of catastrophic interference and imposing active forgetting with generative methods

catastrophic interference
Recently Published Documents