scholarly journals Neurocomputational mechanisms underlying the subjective value of information

2021 ◽  
Vol 4 (1) ◽  
Author(s):  
Ariel X.-A. Goh ◽  
Daniel Bennett ◽  
Stefan Bode ◽  
Trevor T.-J. Chong

AbstractHumans have a striking desire to actively seek new information, even when it is devoid of any instrumental utility. However, the mechanisms that drive individuals’ subjective preference for information remain unclear. Here, we used fMRI to examine the processing of subjective information value, by having participants decide how much effort they were willing to trade-off for non-instrumental information. We showed that choices were best described by a model that accounted for: (1) the variability in individuals’ estimates of uncertainty, (2) their desire to reduce that uncertainty, and (3) their subjective preference for positively valenced information. Model-based analyses revealed the anterior cingulate as a key node that encodes the subjective value of information across multiple stages of decision-making – including when information was prospectively valued, and when the outcome was definitively delivered. These findings emphasise the multidimensionality of information value, and reveal the neurocomputational mechanisms underlying the variability in individuals’ desire to physically pursue informative outcomes.

2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Matthew Jiwa ◽  
Patrick S. Cooper ◽  
Trevor T.-J. Chong ◽  
Stefan Bode

AbstractCuriosity pervades all aspects of human behaviour and decision-making. Recent research indicates that the value of information is determined by its propensity to reduce uncertainty, and the hedonic value of the outcomes it predicts. Previous findings also indicate a preference for options that are freely chosen, compared to equivalently valued alternatives that are externally assigned. Here, we asked whether the value of information also varies as a function of self- or externally-imposed choices. Participants rated their preference for information that followed either a self-chosen decision, or an externally imposed condition. Our results showed that choosing a lottery significantly increased the subjective value of information about the outcome. Computational modelling indicated that this change in information-seeking behaviour was not due to changes in the subjective probability of winning, but instead reflected an independent effect of choosing on the value of resolving uncertainty. These results demonstrate that agency over a prospect is an important source of information value.


2016 ◽  
Author(s):  
Miriam C Klein-Flügge ◽  
Steven W Kennerley ◽  
Karl Friston ◽  
Sven Bestmann

AbstractIntegrating costs and benefits is crucial for optimal decision-making. While much is known about decisions that involve outcome-related costs (e.g., delay, risk), many of our choices are attached to actions and require an evaluation of the associated motor costs. Yet how the brain incorporates motor costs into choices remains largely unclear. We used human functional magnetic resonance imaging during choices involving monetary reward and physical effort to identify brain regions that serve as a choice comparator for effort-reward trade-offs. By independently varying both options' effort and reward levels, we were able to identify the neural signature of a comparator mechanism. A network involving supplementary motor area (SMA) and the caudal portion of dorsal anterior cingulate cortex (dACC) encoded the difference in reward (positively) and effort levels (negatively) between chosen and unchosen choice options. We next modelled effort-discounted subjective values using a novel behavioural model. This revealed that the same network of regions involving dACC and SMA encoded the difference between the chosen and unchosen options' subjective values, and that activity was best described using a concave model of effort-discounting. In addition, this signal reflected how precisely value determined participants' choices. By contrast, separate signals in SMA and ventro-medial PFC (vmPFC) correlated with participants' tendency to avoid effort and seek reward, respectively. This suggests that the critical neural signature of decision-making for choices involving motor costs is found in human cingulate cortex and not vmPFC as typically reported for outcome-based choice. Furthermore, distinct frontal circuits ‘drive’ behaviour towards reward-maximization and effort-minimization.Significance StatementThe neural processes that govern the trade-off between expected benefits and motor costs remain largely unknown. This is striking because energetic requirements play an integral role in our day-to-day choices and instrumental behaviour, and a diminished willingness to exert effort is a characteristic feature of a range of neurological disorders. We use a new behavioural characterization of how humans trade-off reward-maximization with effort-minimization to examine the neural signatures that underpin such choices, using BOLD MRI neuroimaging data. We find the critical neural signature of decision-making, a signal that reflects the comparison of value between choice options, in human cingulate cortex, whereas two distinct brain circuits ‘drive’ behaviour towards reward-maximization or effort-minimization.


2018 ◽  
Vol 115 (22) ◽  
pp. E5233-E5242 ◽  
Author(s):  
Amanda R. Arulpragasam ◽  
Jessica A. Cooper ◽  
Makiah R. Nuutinen ◽  
Michael T. Treadway

We are presented with choices each day about how to invest our effort to achieve our goals. Critically, these decisions must frequently be made under conditions of incomplete information, where either the effort required or possible reward to be gained is uncertain. Such choices therefore require the development of potential value estimates to guide effortful goal-directed behavior. To date, however, the neural mechanisms for this expectation process are unknown. Here, we used computational fMRI during an effort-based decision-making task where trial-wise information about effort costs and reward magnitudes was presented separately over time, thereby allowing us to model distinct effort/reward computations as choice-relevant information unfolded. We found that ventromedial prefrontal cortex (vmPFC) encoded expected subjective value. Further, activity in dorsal anterior cingulate (dACC) and anterior insula (aI) reflected both effort discounting as well as a subjective value prediction error signal derived from trial history. While prior studies have identified these regions as being involved in effort-based decision making, these data demonstrate their specific role in the formation and maintenance of subjective value estimates as relevant information becomes available.


2020 ◽  
Author(s):  
Maya Zhe Wang ◽  
Benjamin Y. Hayden

ABSTRACTCuriosity refers to a desire for information that is not driven by immediate strategic or instrumental concerns. Latent earning refers to a form of learning that is not directly driven by standard reinforcement learning processes. We propose that curiosity serves the purpose of motivating latent learning. Thus, while latent learning is often treated as an incidental or passive process, in practice it most often reflects a strong evolved pressure to consume large amounts of information. That large volume of information in turn allows curious decision makers to generate sophisticated representations of the structure of their environment, known as cognitive maps. Cognitive maps facilitate adaptive and flexible behavior while maintaining its adaptivity and flexibility via map updates based on new information. Here we describe data supporting the idea that orbitofrontal cortex (OFC) and dorsal anterior cingulate cortex (dACC) play complementary roles in curiosity-driven learning. Specifically, we propose that (1) OFC tracks the innate value of information and incorporates new information into a detailed cognitive map; and (2) dACC tracks the environmental demands and information availability to then use the cognitive map for guiding behavior.


2019 ◽  
Author(s):  
Karima Chakroun ◽  
David Mathar ◽  
Antonius Wiehler ◽  
Florian Ganzer ◽  
Jan Peters

SummaryA central issue in reinforcement learning and decision-making is whether to exploit knowledge of reward values, or to explore novel options. Although it is widely hypothesized that dopamine neurotransmission plays a key role in regulating this balance, causal evidence for a role of dopamine in human exploration is still lacking. Here, we use a combination of computational modeling, pharmacological intervention and functional magnetic resonance imaging (fMRI) to test for a causal effect of dopamine transmission on the exploration-exploitation trade-off in humans. 31 healthy male subjects performed a restless four-armed bandit task in a within-subjects design under three drug conditions: 150mg of the dopamine precursor L-dopa, 2mg of the D2 receptor antagonist haloperidol, and placebo. In all conditions, choice behavior was best explained by an extension of an established Bayesian learning model accounting for perseveration, uncertainty-based exploration and random exploration. Uncertainty-based exploration was attenuated under L-dopa compared to placebo and haloperidol. There was no evidence for a modulation of prediction error signaling or categorical effects of exploration/exploitation under L-dopa, whereas model-based fMRI revealed that L-dopa attenuated neural representations of overall uncertainty in insula and dorsal anterior cingulate cortex. Our results highlight the computational role of these regions in exploration and suggest that dopamine modulates exploration by modulating how this circuit tracks accumulating uncertainty during decision-making.


Author(s):  
James M. McKendry ◽  
Thomas P. Enderwick ◽  
Paul C. Harrison

Following formulation of a subjective value model and laboratory validation of some of its key assumptions, the model was applied to the airborne antisubmarine warfare situation by pooling judgments of experienced personnel selected from operational airborne-antisubmarine-warfare squadrons. When scaled, these judgments permitted structuring of three sets of airborne-antisubmarine-warfare problems which varied in terms of the perceived value of information provided to crews. The dependent variable was adequacy of performance on realistic exercises in a training simulator. Personnel who made the original subjective judgments and others as well were employed. The proportion of search area remaining per unit time decreased as a linear function of perceived information value as predicted. The subjective model accounted for approximately 90% of the observed between-group variance, thereby demonstrating its efficacy in a limited real-world situation.


2019 ◽  
Author(s):  
Allison D. Shapiro ◽  
Scott T. Grafton

AbstractTwo fundamental goals of decision making are to select actions that maximize rewards while minimizing costs and to have strong confidence in the accuracy of a judgment. Neural signatures of these two forms of value: the subjective value (SV) of choice alternatives and the value of the judgment (confidence), have both been observed in ventromedial prefrontal cortex (vmPFC). However, the relationship between these dual value signals and their relative time courses are unknown. We recorded fMRI while 28 men and women performed a two-phase Ap-Av task with mixed-outcomes of monetary rewards paired with painful shock stimuli. Neural responses were measured during offer valuation (offer phase) and choice valuation (commit phase) and analyzed with respect to observed decision outcomes, model-estimated SV and confidence. During the offer phase, vmPFC tracked SV and decision outcomes, but it not confidence. During the commit phase, vmPFC tracked confidence, computed as the quadratic extension of SV, but it bore no significant relationship with the offer valuation itself, nor the decision. In fact, vmPFC responses from the commit phase were selective for confidence even for rejected offers, wherein confidence and SV were inversely related. Conversely, activation of the cognitive control network, including within lateral prefrontal cortex (lPFC) and dorsal anterior cingulate cortex (dACC) was associated with ambivalence, during both the offer and commit phases. Taken together, our results reveal complementary representations in vmPFC during value-based decision making that temporally dissociate such that offer valuation (SV) emerges before decision valuation (confidence).


2021 ◽  
Vol 8 ◽  
Author(s):  
Meng Xia ◽  
Tom Carruthers ◽  
Richard Kindong ◽  
Libin Dai ◽  
Zhe Geng ◽  
...  

Fisheries researchers have focused on the value of information (VOI) in fisheries management and trade-offs since scientists and managers realized that information from different resources has different contribution in the management process. We picked seven indicators, which are log-normal annual catch observation error (Cobs), annual catch observation bias (Cbias), log-normal annual index observation error (Iobs), maximum length observation bias (Linfbias), observed natural mortality rate bias (Mbias), observed von Bertalanffy growth parameter K bias (Kbias), and catch-at-age sample size (CAA_nsamp), and built operating models (OMs) to simulate fisheries dynamics, and then applied management strategy evaluation (MSE). Relative yield is chosen as the result to evaluate the contribution of the seven indicators. Within the parameter range, there was not much information value reflected from fisheries-dependent parameters including Cobs, Cbias, and Iobs. On the other hand, for fisheries-independent parameters such as Kbias, Mbias, and Linfbias, similar tendency of the information value was showed in the results, in which the relative yield goes down from the upper bound to the lower bound of the interval. CAA_nsamp had no impact on the yield after over 134 individuals. The VOI analysis contributes to the trade-offs in the decision-making process. Information with more value is more worthy to collect in case of waste of time and money so that we could make the best use of scientific effort. But we still need to improve the simulation process such as enhancing the diversity and predictability in an OM. More parameters are on the way to be tested in order to collect optimum information for management and decision-making.


Author(s):  
Bahador Bahrami

Evidence for and against the idea that “two heads are better than one” is abundant. This chapter considers the contextual conditions and social norms that predict madness or wisdom of crowds to identify the adaptive value of collective decision-making beyond increased accuracy. Similarity of competence among members of a collective impacts collective accuracy, but interacting individuals often seem to operate under the assumption that they are equally competent even when direct evidence suggest the opposite and dyadic performance suffers. Cross-cultural data from Iran, China, and Denmark support this assumption of similarity (i.e., equality bias) as a sensible heuristic that works most of the time and simplifies social interaction. Crowds often trade off accuracy for other collective benefits such as diffusion of responsibility and reduction of regret. Consequently, two heads are sometimes better than one, but no-one holds the collective accountable, not even for the most disastrous of outcomes.


Sign in / Sign up

Export Citation Format

Share Document