scholarly journals Reinforcement regulates timing variability in thalamus

eLife ◽  
2020 ◽  
Vol 9 ◽  
Author(s):  
Jing Wang ◽  
Eghbal Hosseini ◽  
Nicolas Meirhaeghe ◽  
Adam Akkad ◽  
Mehrdad Jazayeri

Learning reduces variability but variability can facilitate learning. This paradoxical relationship has made it challenging to tease apart sources of variability that degrade performance from those that improve it. We tackled this question in a context-dependent timing task requiring humans and monkeys to flexibly produce different time intervals with different effectors. We identified two opposing factors contributing to timing variability: slow memory fluctuation that degrades performance and reward-dependent exploratory behavior that improves performance. Signatures of these opposing factors were evident across populations of neurons in the dorsomedial frontal cortex (DMFC), DMFC-projecting neurons in the ventrolateral thalamus, and putative target of DMFC in the caudate. However, only in the thalamus were the performance-optimizing regulation of variability aligned to the slow performance-degrading memory fluctuations. These findings reveal how variability caused by exploratory behavior might help to mitigate other undesirable sources of variability and highlight a potential role for thalamocortical projections in this process.

2019 ◽  
Author(s):  
Jing Wang ◽  
Eghbal Hosseini ◽  
Nicolas Meirhaeghe ◽  
Adam Akkad ◽  
Mehrdad Jazayeri

AbstractLearning reduces variability but variability can facilitate learning. This paradoxical relationship has made it challenging to tease apart sources of variability that degrade performance from those that improve it. We tackled this question in a context-dependent timing task requiring humans and monkeys to flexibly produce different time intervals with different effectors. Subjects’ timing variability featured two novel and context-specific sources of variability: (1) slow memory-contingent fluctuations of the mean that degraded performance, and (2) fast reinforcement-dependent regulation of variance that improved performance. Signatures of these processes were evident across populations of neurons in multiple nodes of the cortico-basal ganglia circuits. However, only in a region of the thalamus involved in flexible control of timing were the slow performance-degrading fluctuations aligned to performance-optimizing regulation of variance. These findings provide direct evidence that the nervous system makes strategic use of exploratory variance to guard against other undesirable sources of variability.


Neuroscience ◽  
2016 ◽  
Vol 336 ◽  
pp. 1-11 ◽  
Author(s):  
Philip A. Blankenship ◽  
Sarah L. Stuebing ◽  
Shawn S. Winter ◽  
Joseph L. Cheatwood ◽  
James D. Benson ◽  
...  

Science ◽  
2019 ◽  
Vol 364 (6441) ◽  
pp. eaav8911 ◽  
Author(s):  
Morteza Sarafyazd ◽  
Mehrdad Jazayeri

Humans process information hierarchically. In the presence of hierarchies, sources of failures are ambiguous. Humans resolve this ambiguity by assessing their confidence after one or more attempts. To understand the neural basis of this reasoning strategy, we recorded from dorsomedial frontal cortex (DMFC) and anterior cingulate cortex (ACC) of monkeys in a task in which negative outcomes were caused either by misjudging the stimulus or by a covert switch between two stimulus-response contingency rules. We found that both areas harbored a representation of evidence supporting a rule switch. Additional perturbation experiments revealed that ACC functioned downstream of DMFC and was directly and specifically involved in inferring covert rule switches. These results‏ reveal the computational principles of hierarchical reasoning, as implemented by cortical circuits.


Sign in / Sign up

Export Citation Format

Share Document