A hypothesis for basal ganglia-dependent reinforcement learning in the songbird

Songbirds and humans share the ability to adaptively modify their vocalizations based on sensory feedback. Prior studies have focused primarily on the role that auditory feedback plays in shaping vocal output throughout life. In contrast, it is unclear whether and how non-auditory information drives vocal plasticity. Here, we first used a reinforcement learning paradigm to establish that non-auditory feedback can drive vocal learning in adult songbirds. We then assessed the role of a songbird basal ganglia-thalamocortical pathway critical to auditory vocal learning in this novel form of vocal plasticity. We found that both this circuit and its dopaminergic inputs are necessary for non-auditory vocal learning, demonstrating that this pathway is not specialized exclusively for auditory-driven vocal learning. The ability of this circuit to use both auditory and non-auditory information to guide vocal learning may reflect a general principle for the neural systems that support vocal plasticity across species.

Download Full-text

Attentional Reinforcement Learning in the Brain

New Generation Computing ◽

10.1007/s00354-019-00081-z ◽

2020 ◽

Vol 38 (1) ◽

pp. 49-64 ◽

Cited By ~ 2

Author(s):

Hiroshi Yamakawa

Keyword(s):

Deep Learning ◽

Reinforcement Learning ◽

Basal Ganglia ◽

Language Processing ◽

Information Source ◽

Attention Mechanism ◽

Transmission Route ◽

Thalamic Relay ◽

Signal Changes ◽

The Brain

AbstractRecently, attention mechanisms have significantly boosted the performance of natural language processing using deep learning. An attention mechanism can select the information to be used, such as by conducting a dictionary lookup; this information is then used, for example, to select the next utterance word in a sentence. In neuroscience, the basis of the function of sequentially selecting words is considered to be the cortico-basal ganglia-thalamocortical loop. Here, we first show that the attention mechanism used in deep learning corresponds to the mechanism in which the cerebral basal ganglia suppress thalamic relay cells in the brain. Next, we demonstrate that, in neuroscience, the output of the basal ganglia is associated with the action output in the actor of reinforcement learning. Based on these, we show that the aforementioned loop can be generalized as reinforcement learning that controls the transmission of the prediction signal so as to maximize the prediction reward. We call this attentional reinforcement learning (ARL). In ARL, the actor selects the information transmission route according to the attention, and the prediction signal changes according to the context detected by the information source of the route. Hence, ARL enables flexible action selection that depends on the situation, unlike traditional reinforcement learning, wherein the actor must directly select an action.

Download Full-text

Combining Self-organizing Maps with Mixtures of Experts: Application to an Actor-Critic Model of Reinforcement Learning in the Basal Ganglia

From Animals to Animats 9 - Lecture Notes in Computer Science ◽

10.1007/11840541_33 ◽

2006 ◽

pp. 394-405 ◽

Cited By ~ 8

Author(s):

Mehdi Khamassi ◽

Louis-Emmanuel Martinet ◽

Agnès Guillot

Keyword(s):

Reinforcement Learning ◽

Basal Ganglia ◽

Self Organizing Maps ◽

Mixtures Of Experts ◽

Self Organizing

Download Full-text

Actor–Critic Models of Reinforcement Learning in the Basal Ganglia: From Natural to Artificial Rats

Adaptive Behavior ◽

10.1177/105971230501300205 ◽

2005 ◽

Vol 13 (2) ◽

pp. 131-148 ◽

Cited By ~ 39

Author(s):

Mehdi Khamassi ◽

Loïc Lachèze ◽

Benoît Girard ◽

Alain Berthoz ◽

Agnès Guillot

Keyword(s):

Reinforcement Learning ◽

Basal Ganglia

Download Full-text

Neural mechanisms of birdsong learning: basal ganglia circuits and reinforcement learning model

Hikaku seiri seikagaku(Comparative Physiology and Biochemistry) ◽

10.3330/hikakuseiriseika.29.58 ◽

2012 ◽

Vol 29 (2) ◽

pp. 58-69

Author(s):

Satoshi KOJIMA

Keyword(s):

Reinforcement Learning ◽

Basal Ganglia ◽

Learning Model ◽

Neural Mechanisms ◽

Reinforcement Learning Model

Download Full-text

Theory of reinforcement learning and motivation in the basal ganglia

10.1101/174524 ◽

2017 ◽

Cited By ~ 1

Author(s):

Rafal Bogacz

Keyword(s):

Synaptic Plasticity ◽

Reinforcement Learning ◽

Basal Ganglia ◽

Dopaminergic Neurons ◽

Neural Circuits ◽

Negative Consequences ◽

Striatal Neurons ◽

Motivational State ◽

Level Of Activity ◽

Dopaminergic Modulation

AbstractThis paper proposes how the neural circuits in vertebrates select actions on the basis of past experience and the current motivational state. According to the presented theory, the basal ganglia evaluate the utility of considered actions by combining the positive consequences (e.g. nutrition) scaled by the motivational state (e.g. hunger) with the negative consequences (e.g. effort). The theory suggests how the basal ganglia compute utility by combining the positive and negative consequences encoded in the synaptic weights of striatal Go and No-Go neurons, and the motivational state carried by neuromodulators including dopamine. Furthermore, the theory suggests how the striatal neurons to learn separately about consequences of actions, and how the dopaminergic neurons themselves learn what level of activity they need to produce to optimize behaviour. The theory accounts for the effects of dopaminergic modulation on behaviour, patterns of synaptic plasticity in striatum, and responses of dopaminergic neurons in diverse situations.

Download Full-text

ACE (Actor-Critic-Explorer) Paradigm for Reinforcement Learning in Basal Ganglia: Highlighting the Role of the Indirect Pathway

Advances in Cognitive Science ◽

10.4135/9788132107910.n6 ◽

2014 ◽

pp. 71-90

Author(s):

Denny Joseph ◽

Garipelli Gangadhar ◽

V. Srinivasa Chakravarthy

Keyword(s):

Reinforcement Learning ◽

Basal Ganglia ◽

Indirect Pathway

Download Full-text

THE ROLE OF THE BASAL GANGLIA IN EXPLORATION IN A NEURAL MODEL BASED ON REINFORCEMENT LEARNING

International Journal of Neural Systems ◽

10.1142/s0129065706000548 ◽

2006 ◽

Vol 16 (02) ◽

pp. 111-124 ◽

Cited By ~ 35

Author(s):

D. SRIDHARAN ◽

P. S. PRASHANTH ◽

V. S. CHAKRAVARTHY

Keyword(s):

Reinforcement Learning ◽

Basal Ganglia ◽

State Space ◽

Chaotic Systems ◽

Neural Model ◽

Learning System ◽

Substantia Nigra Pars Compacta ◽

Water Pool ◽

Oscillatory Dynamics

We present a computational model of basal ganglia as a key player in exploratory behavior. The model describes exploration of a virtual rat in a simulated water pool experiment. The virtual rat is trained using a reward-based or reinforcement learning paradigm which requires units with stochastic behavior for exploration of the system's state space. We model the Subthalamic Nucleus-Globus Pallidus externa (STN-GPe) segment of the basal ganglia as a pair of neuronal layers with oscillatory dynamics, exhibiting a variety of dynamic regimes such as chaos, traveling waves and clustering. Invoking the property of chaotic systems to explore state-space, we suggest that the complex exploratory dynamics of STN-GPe system in conjunction with dopamine-based reward signaling from the Substantia Nigra pars compacta (SNc) present the two key ingredients of a reinforcement learning system.

Download Full-text

Corrigendum to “A hypothesis for basal ganglia-dependent reinforcement learning in the songbird” [Neuroscience 198 (2011) 152–170]

Neuroscience ◽

10.1016/j.neuroscience.2013.07.020 ◽

2013 ◽

Vol 255 ◽

pp. 301

Author(s):

M.S. FEE ◽

J.H. Goldberg

Keyword(s):

Reinforcement Learning ◽

Basal Ganglia

Download Full-text