probabilistic reinforcement
Recently Published Documents


TOTAL DOCUMENTS

31
(FIVE YEARS 7)

H-INDEX

10
(FIVE YEARS 0)

2021 ◽  
Vol 12 (1) ◽  
pp. 7
Author(s):  
Dorota Frydecka ◽  
Błażej Misiak ◽  
Patryk Piotrowski ◽  
Tomasz Bielawski ◽  
Edyta Pawlak ◽  
...  

Schizophrenia spectrum disorders (SZ) are characterized by impairments in probabilistic reinforcement learning (RL), which is associated with dopaminergic circuitry encompassing the prefrontal cortex and basal ganglia. However, there are no studies examining dopaminergic genes with respect to probabilistic RL in SZ. Thus, the aim of our study was to examine the impact of dopaminergic genes on performance assessed by the Probabilistic Selection Task (PST) in patients with SZ in comparison to healthy control (HC) subjects. In our study, we included 138 SZ patients and 188 HC participants. Genetic analysis was performed with respect to the following genetic polymorphisms: rs4680 in COMT, rs907094 in DARP-32, rs2734839, rs936461, rs1800497, and rs6277 in DRD2, rs747302 and rs1800955 in DRD4 and rs28363170 and rs2975226 in DAT1 genes. The probabilistic RL task was completed by 59 SZ patients and 95 HC subjects. SZ patients performed significantly worse in acquiring reinforcement contingencies during the task in comparison to HCs. We found no significant association between genetic polymorphisms and RL among SZ patients; however, among HC participants with respect to the DAT1 rs28363170 polymorphism, individuals with 10-allele repeat genotypes performed better in comparison to 9-allele repeat carriers. The present study indicates the relevance of the DAT1 rs28363170 polymorphism in RL in HC participants.


2021 ◽  
Author(s):  
Virginie Patt ◽  
Daniela Palombo ◽  
Michael Esterman ◽  
Mieke Verfaellie

Simple probabilistic reinforcement learning is recognized as a striatum-based learning system, but in recent years, has also been associated with hippocampal involvement. The present study examined whether such involvement may be attributed to observation-based learning processes, running in parallel to striatum-based reinforcement learning. A computational model of observation-based learning (OL), mirroring classic models of reinforcement-based learning (RL), was constructed and applied to the neuroimaging dataset of Palombo, Hayes, Reid, & Verfaellie (2019). Hippocampal contributions to value-based learning: Converging evidence from fMRI and amnesia. Cognitive, Affective & Behavioral Neuroscience, 19(3), 523–536. Results suggested that observation-based learning processes may indeed take place concomitantly to reinforcement learning and involve activation of the hippocampus and central orbitofrontal cortex (cOFC). However, rather than independent mechanisms running in parallel, the brain correlates of the OL and RL prediction errors indicated collaboration between systems, with direct implication of the hippocampus in computations of the discrepancy between the expected and actual reinforcing values of actions. These findings are consistent with previous accounts of a role for the hippocampus in encoding the strength of observed stimulus-outcome associations, with updating of such associations through striatal reinforcement-based computations. Additionally, enhanced negative prediction error signaling was found in the anterior insula with greater use of OL over RL processes. This result may suggest an additional mode of collaboration between OL and RL systems, implicating the error monitoring network.


2021 ◽  
Vol 17 (7) ◽  
pp. e1008524
Author(s):  
Liyu Xia ◽  
Sarah L. Master ◽  
Maria K. Eckstein ◽  
Beth Baribault ◽  
Ronald E. Dahl ◽  
...  

In the real world, many relationships between events are uncertain and probabilistic. Uncertainty is also likely to be a more common feature of daily experience for youth because they have less experience to draw from than adults. Some studies suggest probabilistic learning may be inefficient in youths compared to adults, while others suggest it may be more efficient in youths in mid adolescence. Here we used a probabilistic reinforcement learning task to test how youth age 8-17 (N = 187) and adults age 18-30 (N = 110) learn about stable probabilistic contingencies. Performance increased with age through early-twenties, then stabilized. Using hierarchical Bayesian methods to fit computational reinforcement learning models, we show that all participants’ performance was better explained by models in which negative outcomes had minimal to no impact on learning. The performance increase over age was driven by 1) an increase in learning rate (i.e. decrease in integration time scale); 2) a decrease in noisy/exploratory choices. In mid-adolescence age 13-15, salivary testosterone and learning rate were positively related. We discuss our findings in the context of other studies and hypotheses about adolescent brain development.


2021 ◽  
Author(s):  
Bianca Westhoff ◽  
Neeltje E. Blankenstein ◽  
Elisabeth Schreuders ◽  
Eveline A. Crone ◽  
Anna C. K. van Duijvenvoorde

AbstractLearning which of our behaviors benefit others contributes to social bonding and being liked by others. An important period for the development of (pro)social behavior is adolescence, in which peers become more salient and relationships intensify. It is, however, unknown how learning to benefit others develops across adolescence and what the underlying cognitive and neural mechanisms are. In this functional neuroimaging study, we assessed learning for self and others (i.e., prosocial learning) and the concurring neural tracking of prediction errors across adolescence (ages 9-21, N=74). Participants performed a two-choice probabilistic reinforcement learning task in which outcomes resulted in monetary consequences for themselves, an unknown other, or no one. Participants from all ages were able to learn for themselves and others, but learning for others showed a more protracted developmental trajectory. Prediction errors for self were observed in the ventral striatum and showed no age-related differences. However, prediction error coding for others was specifically observed in the ventromedial prefrontal cortex and showed age-related increases. These results reveal insights into the computational mechanisms of learning for others across adolescence, and highlight that learning for self and others show different age-related patterns.


Author(s):  
Lisa M Wooldridge ◽  
Jack Bergman ◽  
Diego A Pizzagalli ◽  
Brian D Kangas

Abstract Background Anhedonia, the loss of pleasure in previously rewarding activities, is a prominent feature of major depressive disorder and often resistant to first-line antidepressant treatment. A paucity of translatable cross-species tasks to assess subdomains of anhedonia, including reward learning, presents a major obstacle to the development of effective therapeutics. One assay of reward learning characterized by orderly behavioral and pharmacological findings in both humans and rats is the probabilistic reward task. In this computerized task, subjects make discriminations across numerous trials in which correct responses to one alternative are rewarded more often (rich) than correct responses to the other (lean). Healthy control subjects reliably develop a response bias to the rich alternative. However, participants with major depressive disorder as well as rats exposed to chronic stress typically exhibit a blunted response bias. Methods The present studies validated a touchscreen-based probabilistic reward task for the marmoset, a small nonhuman primate with considerable translational value. First, probabilistic reinforcement contingencies were parametrically examined. Next, the effects of ketamine (1.0–10.0 mg/kg), a US Food and Drug Administration-approved rapid-acting antidepressant, and phencyclidine (0.01–0.1 mg/kg), a pharmacologically similar N-methyl-D-aspartate receptor antagonist with no known antidepressant efficacy, were evaluated. Results Increases in the asymmetry of rich:lean probabilistic contingencies produced orderly increases in response bias. Consistent with their respective clinical profiles, ketamine but not phencyclidine produced dose-related increases in response bias at doses that did not reduce task discriminability. Conclusions Collectively, these findings confirm task and pharmacological sensitivity in the marmoset, which may be useful in developing medications to counter anhedonia across neuropsychiatric disorders.


2020 ◽  
Author(s):  
Liyu Xia ◽  
Sarah L Master ◽  
Maria K Eckstein ◽  
Beth Baribault ◽  
Ronald E Dahl ◽  
...  

AbstractIn the real world, many relationships between events are uncertain and probabilistic. Uncertainty is also likely to be a more common feature of daily experience for youth because they have less experience to draw from than adults. Some studies suggests probabilistic learning may be inefficient in youth compared to adults [1], while others suggest it may be more efficient in youth that are in mid adolescence [2, 3]. Here we used a probabilistic reinforcement learning task to test how youth age 8-17 (N = 187) and adults age 18-30 (N = 110) learn about stable probabilistic contingencies. Performance increased with age through early-twenties, then stabilized. Using hierarchical Bayesian methods to fit computational reinforcement learning models, we show that all participants’ performance was better explained by models in which negative outcomes had minimal to no impact on learning. The performance increase over age was driven by 1) an increase in learning rate (i.e. decrease in integration time horizon); 2) a decrease in noisy/exploratory choices. In mid-adolescence age 13-15, salivary testosterone and learning rate were positively related. We discuss our findings in the context of other studies and hypotheses about adolescent brain development.Author summaryAdolescence is a time of great uncertainty. It is also a critical time for brain development, learning, and decision making in social and educational domains. There are currently contradictory findings about learning in adolescence. We sought to better isolate how learning from stable probabilistic contingencies changes during adolescence with a task that previously showed interesting results in adolescents. We collected a relatively large sample size (297 participants) across a wide age range (8-30), to trace the adolescent developmental trajectory of learning under stable but uncertain conditions. We found that age in our sample was positively associated with higher learning rates and lower choice exploration. Within narrow age bins, we found that higher saliva testosterone levels were associated with higher learning rates in participants age 13-15 years. These findings can help us better isolate the trajectory of maturation of core learning and decision making processes during adolescence.


2020 ◽  
Vol 264 ◽  
pp. 400-406
Author(s):  
Julia O. Linke ◽  
Georgia Koppe ◽  
Vanessa Scholz ◽  
Philipp Kanske ◽  
Daniel Durstewitz ◽  
...  

2018 ◽  
Vol 127 (8) ◽  
pp. 807-817 ◽  
Author(s):  
Snežana Urošević ◽  
Tate Halverson ◽  
Eric A. Youngstrom ◽  
Monica Luciana

IEEE Access ◽  
2018 ◽  
Vol 6 ◽  
pp. 49721-49731 ◽  
Author(s):  
Yue Zhang ◽  
Bin Song ◽  
Su Gao ◽  
Xiaojiang Du ◽  
Mohsen Guizani

Sign in / Sign up

Export Citation Format

Share Document