An Integrative Theory of Anterior Cingulate Cortex Function: Option Selection in Hierarchical Reinforcement Learning

Despite continual debate for the past thirty years about the function of anterior cingulate cortex (ACC), its key contribution to neurocognition remains unknown. Here we review computational models that illustrate three core principles of ACC function (related to hierarchy, world models and cost), as well as four constraints on the neural implementation of these principles (related to modularity, binding, encoding and learning and regulation). These observations suggest a role for ACC in hierarchical model-based hierarchical reinforcement learning, which instantiates a mechanism for motivating the execution of high-level plans.

Download Full-text

Anterior cingulate cortex represents action-state predictions and causally mediates model-based reinforcement learning in a two-step decision task

10.1101/126292 ◽

2017 ◽

Cited By ~ 8

Author(s):

Thomas Akam ◽

Ines Rodrigues-Vaz ◽

Ivo Marcelo ◽

Xiangyu Zhang ◽

Michael Pereira ◽

...

Keyword(s):

Reinforcement Learning ◽

Anterior Cingulate Cortex ◽

State Transition ◽

Cingulate Cortex ◽

Anterior Cingulate ◽

State Transitions ◽

Decision Task ◽

Transition Structure ◽

Model Based ◽

Subsequent Choice

SummaryThe anterior cingulate cortex (ACC) is implicated in learning the value of actions, but it remains poorly understood whether and how it contributes to model-based mechanisms that use action-state predictions and afford behavioural flexibility. To isolate these mechanisms, we developed a multi-step decision task for mice in which both action-state transition probabilities and reward probabilities changed over time. Calcium imaging revealed ramps of choice-selective neuronal activity, followed by an evolving representation of the state reached and trial outcome, with different neuronal populations representing reward in different states. ACC neurons represented the current action-state transition structure, whether state transitions were expected or surprising, and the predicted state given chosen action. Optogenetic inhibition of ACC blocked the influence of action-state transitions on subsequent choice, without affecting the influence of rewards. These data support a role for ACC in model-based reinforcement learning, specifically in using action-state transitions to guide subsequent choice.HighlightsA novel two-step task disambiguates model-based and model-free RL in mice.ACC represents all trial events, reward representation is contextualised by state.ACC represents action-state transition structure, predicted states, and surprise.Inhibiting ACC impedes action-state transitions from influencing subsequent choice.

Download Full-text

The Expected Value of Control: An Integrative Theory of Anterior Cingulate Cortex Function

Neuron ◽

10.1016/j.neuron.2013.07.007 ◽

2013 ◽

Vol 79 (2) ◽

pp. 217-240 ◽

Cited By ~ 895

Author(s):

Amitai Shenhav ◽

Matthew M. Botvinick ◽

Jonathan D. Cohen

Keyword(s):

Anterior Cingulate Cortex ◽

Cingulate Cortex ◽

Expected Value ◽

Anterior Cingulate ◽

Integrative Theory

Download Full-text

Pure Correlates of Exploration and Exploitation in the Human Brain

10.1101/103135 ◽

2017 ◽

Author(s):

Tommy C. Blanchard ◽

Samuel J. Gershman

Keyword(s):

Prefrontal Cortex ◽

Reinforcement Learning ◽

Human Brain ◽

Anterior Cingulate Cortex ◽

Fundamental Problem ◽

Cingulate Cortex ◽

Ventromedial Prefrontal Cortex ◽

Anterior Cingulate ◽

Exploration And Exploitation ◽

Exploration Exploitation

AbstractBalancing exploration and exploitation is a fundamental problem in reinforcement learning. Previous neuroimaging studies of the exploration-exploitation dilemma could not completely disentangle these two processes, making it difficult to unambiguously identify their neural signatures. We overcome this problem using a task in which subjects can either observe (pure exploration) or bet (pure exploitation). Insula and dorsal anterior cingulate cortex showed significantly greater activity on observe trials compared to bet trials, suggesting that these regions play a role in driving exploration. A model-based analysis of task performance suggested that subjects chose to observe until a critical evidence threshold was reached. We observed a neural signature of this evidence accumulation process in ventromedial prefrontal cortex. These findings support theories positing an important role for anterior cingulate cortex in exploration, while also providing a new perspective on the roles of insula and ventromedial prefrontal cortex.Significance StatementSitting down at a familiar restaurant, you may choose to order an old favorite or sample a new dish. In reinforcement learning theory, this is known as the exploration-exploitation dilemma. The optimal solution is known to be intractable; therefore, humans must use heuristic strategies. Behavioral studies have revealed several candidate strategies, but identifying the neural mechanisms underlying these strategies is complicated due to the fact that exploration and exploitation are not perfectly dissociable in standard tasks. Using an “observe or bet” task, we identify for the first time pure neural correlates of exploration and exploitation in the human brain.

Download Full-text