reward prediction error Latest Research Papers

In order to develop effective treatments for anhedonia we need to understand its underlying neurobiological mechanisms. Anhedonia is conceptually strongly linked to reward processing, which involves a variety of cognitive and neural operations. This article reviews the evidence for impairments in experiencing hedonic response (pleasure), reward valuation, and reward learning based on outcomes (commonly conceptualised in terms of “reward prediction error”). Synthesizing behavioural and neuroimaging findings, we examine case-control studies of patients with depression and schizophrenia, including those focusing specifically on anhedonia. Overall, there is reliable evidence that depression and schizophrenia are associated with disrupted reward processing. In contrast to the historical definition of anhedonia, there is surprisingly limited evidence for impairment in the ability to experience pleasure in depression and schizophrenia. There is some evidence that learning about reward and reward prediction error signals are impaired in depression and schizophrenia, but the literature is inconsistent. The strongest evidence is for impairments in the representation of reward value and how this is used to guide action. Future studies would benefit from focusing on impairments in reward processing specifically in anhedonic samples, including transdiagnostically, and from using designs separating different components of reward processing, formulating them in computational terms, and moving beyond cross-sectional designs to provide an assessment of causality.

Download Full-text

Dopamine mediates the bidirectional update of interval timing

10.1101/2021.11.02.466803 ◽

2021 ◽

Author(s):

Anthony M.V. Jakob ◽

John G Mikhael ◽

Allison E Hamilos ◽

John A Assad ◽

Samuel J Gershman

Keyword(s):

Reinforcement Learning ◽

Prediction Error ◽

Interval Timing ◽

Substantia Nigra Pars Compacta ◽

Subjective Time ◽

Reward Prediction Error ◽

Learning Tasks ◽

Reward Prediction ◽

Reward Delivery ◽

Speed Up

The role of dopamine as a reward prediction error signal in reinforcement learning tasks has been well-established over the past decades. Recent work has shown that the reward prediction error interpretation can also account for the effects of dopamine on interval timing by controlling the speed of subjective time. According to this theory, the timing of the dopamine signal relative to reward delivery dictates whether subjective time speeds up or slows down: Early DA signals speed up subjective time and late signals slow it down. To test this bidirectional prediction, we reanalyzed measurements of dopaminergic neurons in the substantia nigra pars compacta of mice performing a self-timed movement task. Using the slope of ramping dopamine activity as a read-out of subjective time speed, we found that trial-by-trial changes in the slope could be predicted from the timing of dopamine activity on the previous trial. This result provides a key piece of evidence supporting a unified computational theory of reinforcement learning and interval timing.

Download Full-text

Reward prediction error in the ERP following unconditioned aversive stimuli

Scientific Reports ◽

10.1038/s41598-021-99408-4 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Harry J. Stewardson ◽

Thomas D. Sambrook

Keyword(s):

Reinforcement Learning ◽

Prediction Error ◽

Dopamine Neurons ◽

Prediction Errors ◽

Temporal Difference ◽

Dopamine System ◽

Reward Prediction Error ◽

Reward Prediction ◽

Midbrain Dopamine ◽

Human Participants

AbstractReinforcement learning in humans and other animals is driven by reward prediction errors: deviations between the amount of reward or punishment initially expected and that which is obtained. Temporal difference methods of reinforcement learning generate this reward prediction error at the earliest time at which a revision in reward or punishment likelihood is signalled, for example by a conditioned stimulus. Midbrain dopamine neurons, believed to compute reward prediction errors, generate this signal in response to both conditioned and unconditioned stimuli, as predicted by temporal difference learning. Electroencephalographic recordings of human participants have suggested that a component named the feedback-related negativity (FRN) is generated when this signal is carried to the cortex. If this is so, the FRN should be expected to respond equivalently to conditioned and unconditioned stimuli. However, very few studies have attempted to measure the FRN’s response to unconditioned stimuli. The present study attempted to elicit the FRN in response to a primary aversive stimulus (electric shock) using a design that varied reward prediction error while holding physical intensity constant. The FRN was strongly elicited, but earlier and more transiently than typically seen, suggesting that it may incorporate other processes than the midbrain dopamine system.

Download Full-text

The effect of effort on reward prediction error signals in midbrain dopamine neurons

Current Opinion in Behavioral Sciences ◽

10.1016/j.cobeha.2021.07.004 ◽

2021 ◽

Vol 41 ◽

pp. 152-159

Author(s):

Shingo Tanaka ◽

Jessica E Taylor ◽

Masamichi Sakagami

Keyword(s):

Prediction Error ◽

Dopamine Neurons ◽

Reward Prediction Error ◽

Reward Prediction ◽

Midbrain Dopamine ◽

Midbrain Dopamine Neurons

Download Full-text

17.3 ASSOCIATIONS BETWEEN CHILDHOOD MALTREATMENT, ALCOHOL USE DISORDER, AND NEURO-COMPUTATIONAL IMPAIRMENTS IN REWARD PREDICTION ERROR (RPE) REPRESENTATION

Journal of the American Academy of Child & Adolescent Psychiatry ◽

10.1016/j.jaac.2021.07.651 ◽

2021 ◽

Vol 60 (10) ◽

pp. S283-S284

Author(s):

Joseph Aloi

Keyword(s):

Alcohol Use ◽

Childhood Maltreatment ◽

Prediction Error ◽

Alcohol Use Disorder ◽

Reward Prediction Error ◽

Reward Prediction

Download Full-text

Differential effects of positive and negative reward prediction error on saccade response time adaptation in a reversal learning visual stop-signal task

Journal of Vision ◽

10.1167/jov.21.9.2707 ◽

2021 ◽

Vol 21 (9) ◽

pp. 2707

Author(s):

Amirsaman Sajad ◽

Andrew Tomarken ◽

Aran Sullivan ◽

Geoffrey Woodman ◽

Jeffrey Schall

Keyword(s):

Response Time ◽

Reversal Learning ◽

Prediction Error ◽

Stop Signal ◽

Stop Signal Task ◽

Reward Prediction Error ◽

Differential Effects ◽

Reward Prediction

Download Full-text

Dopamine, Updated: Reward Prediction Error and Beyond

Current Opinion in Neurobiology ◽

10.1016/j.conb.2020.10.012 ◽

2021 ◽

Vol 67 ◽

pp. 123-130

Author(s):

Talia N. Lerner ◽

Ashley L. Holloway ◽

Jillian L. Seiler

Keyword(s):

Prediction Error ◽

Reward Prediction Error ◽

Reward Prediction

Download Full-text

The Enhancement of the Reward Prediction Error Signal in the Midbrain Dopamine Neuron by the Cost Paid for the Reward

10.1007/978-981-16-0317-4_48 ◽

2021 ◽

pp. 289-290

Author(s):

Masamichi Sakagami ◽

Shingo Tanaka ◽

John O’Doherty

Keyword(s):

Prediction Error ◽

Dopamine Neuron ◽

Error Signal ◽

Reward Prediction Error ◽

Reward Prediction ◽

Midbrain Dopamine ◽

The Cost

Download Full-text

Momentary subjective well-being depends on learning and not reward

eLife ◽

10.7554/elife.57977 ◽

2020 ◽

Vol 9 ◽

Author(s):

Bastien Blain ◽

Robb B Rutledge

Keyword(s):

Prediction Error ◽

Computational Modelling ◽

Well Being ◽

Learning Task ◽

Adaptive Behaviour ◽

Subjective Well Being ◽

Reward Prediction Error ◽

Reward Prediction ◽

The Difference ◽

Momentary Happiness

Subjective well-being or happiness is often associated with wealth. Recent studies suggest that momentary happiness is associated with reward prediction error, the difference between experienced and predicted reward, a key component of adaptive behaviour. We tested subjects in a reinforcement learning task in which reward size and probability were uncorrelated, allowing us to dissociate between the contributions of reward and learning to happiness. Using computational modelling, we found convergent evidence across stable and volatile learning tasks that happiness, like behaviour, is sensitive to learning-relevant variables (i.e. probability prediction error). Unlike behaviour, happiness is not sensitive to learning-irrelevant variables (i.e. reward prediction error). Increasing volatility reduces how many past trials influence behaviour but not happiness. Finally, depressive symptoms reduce happiness more in volatile than stable environments. Our results suggest that how we learn about our world may be more important for how we feel than the rewards we actually receive.

Download Full-text

Differential reinforcement encoding along the hippocampal long axis helps resolve the explore–exploit dilemma

Nature Communications ◽

10.1038/s41467-020-18864-0 ◽

2020 ◽

Vol 11 (1) ◽

Author(s):

Alexandre Y. Dombrovski ◽

Beatriz Luna ◽

Michael N. Hallquist

Keyword(s):

Reinforcement Learning ◽

Prediction Error ◽

Differential Reinforcement ◽

Cognitive Maps ◽

Learning Task ◽

Natural Environments ◽

Reward Prediction Error ◽

Reward Function ◽

Reward Prediction ◽

Reward Information

Abstract When making decisions, should one exploit known good options or explore potentially better alternatives? Exploration of spatially unstructured options depends on the neocortex, striatum, and amygdala. In natural environments, however, better options often cluster together, forming structured value distributions. The hippocampus binds reward information into allocentric cognitive maps to support navigation and foraging in such spaces. Here we report that human posterior hippocampus (PH) invigorates exploration while anterior hippocampus (AH) supports the transition to exploitation on a reinforcement learning task with a spatially structured reward function. These dynamics depend on differential reinforcement representations in the PH and AH. Whereas local reward prediction error signals are early and phasic in the PH tail, global value maximum signals are delayed and sustained in the AH body. AH compresses reinforcement information across episodes, updating the location and prominence of the value maximum and displaying goal cell-like ramping activity when navigating toward it.

Download Full-text

reward prediction error
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Pleasure, reward value and prediction error in anhedonia

Dopamine mediates the bidirectional update of interval timing

Reward prediction error in the ERP following unconditioned aversive stimuli

The effect of effort on reward prediction error signals in midbrain dopamine neurons

17.3 ASSOCIATIONS BETWEEN CHILDHOOD MALTREATMENT, ALCOHOL USE DISORDER, AND NEURO-COMPUTATIONAL IMPAIRMENTS IN REWARD PREDICTION ERROR (RPE) REPRESENTATION

Differential effects of positive and negative reward prediction error on saccade response time adaptation in a reversal learning visual stop-signal task

Dopamine, Updated: Reward Prediction Error and Beyond

The Enhancement of the Reward Prediction Error Signal in the Midbrain Dopamine Neuron by the Cost Paid for the Reward

Momentary subjective well-being depends on learning and not reward

Differential reinforcement encoding along the hippocampal long axis helps resolve the explore–exploit dilemma

Export Citation Format

reward prediction errorRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Pleasure, reward value and prediction error in anhedonia

Dopamine mediates the bidirectional update of interval timing

Reward prediction error in the ERP following unconditioned aversive stimuli

The effect of effort on reward prediction error signals in midbrain dopamine neurons

17.3 ASSOCIATIONS BETWEEN CHILDHOOD MALTREATMENT, ALCOHOL USE DISORDER, AND NEURO-COMPUTATIONAL IMPAIRMENTS IN REWARD PREDICTION ERROR (RPE) REPRESENTATION

Differential effects of positive and negative reward prediction error on saccade response time adaptation in a reversal learning visual stop-signal task

Dopamine, Updated: Reward Prediction Error and Beyond

The Enhancement of the Reward Prediction Error Signal in the Midbrain Dopamine Neuron by the Cost Paid for the Reward

Momentary subjective well-being depends on learning and not reward

Differential reinforcement encoding along the hippocampal long axis helps resolve the explore–exploit dilemma

reward prediction error
Recently Published Documents