An Optimal Policy Model for Concurrent Uncertainty Estimation During Decision Making

2021 ◽  
Author(s):  
Xiaodong Li ◽  
Ruixin Su ◽  
Yilin Chen ◽  
Tianming Yang

We often postpone or even avoid making decisions when we feel uncertain. Uncertainty estimation is not an afterthought of decision making but a dynamic process that accompanies decision making in parallel and affects decision making. To study concurrent uncertainty estimation during decision making, we adapted the classic random-dots motion direction discrimination task to allow a reaction-time measure of uncertainty responses. Subjects were asked to judge whether a patch of random dots was moving left or right. In addition, they could seek assistance by choosing to look at a second stimulus that had the same direction but high coherence any time during the task. The task allows us to measure the reaction time of both the perceptual decisions and the uncertainty responses. The subjects were more likely to choose the uncertainty response when the motion coherence was low, while their reaction times of the uncertainty responses showed individual variations. To account for the subjects' behavior, we created an optimal policy decision model in which decisions are based on the value functions computed from the accumulated evidence using a drift-diffusion process. Model simulations captured key features of the subjects' choices, reaction times, and proportions of uncertainty responses. Varying model parameters explained individual variations in the subjects and the correlations between decision accuracy, proportions of uncertainty responses, and reaction times at the individual level. Our model links perceptual decisions and value-based decisions and indicates that concurrent uncertainty estimation may be based on comparisons between values of uncertainty responses and perceptual decisions, both of which may be derived from the same evidence accumulation process during decision making. It provides a theoretical framework for future investigations, including the ones that aim at the underlying neural mechanism.

2021 ◽  
Author(s):  
Teppei Matsui ◽  
Yoshiki Hattori ◽  
Kaho Tsumura ◽  
Ryuta Aoki ◽  
Masaki Takeda ◽  
...  

In real life, humans make decisions by taking into account multiple independent factors, such as delay and probability. Cognitive psychology suggests that cognitive control mechanisms play a key role when facing such complex task conditions. However, in value-based decision-making, it still remains unclear to what extent cognitive control mechanisms become essential when the task condition is complex. In this study, we investigated decision-making behaviors and underlying neural mechanisms using a multifactor gambling task where participants simultaneously considered probability and delay. Decision-making behavior in the multifactor task was modulated by both probability and delay. The behavioral effect of probability was stronger than delay, consistent with previous studies. Furthermore, in a subset of conditions that recruited fronto-parietal activations, reaction times were paradoxically elongated despite lower probabilistic uncertainty. Notably, such a reaction time elongation did not occur in control tasks involving single factors. Meta-analysis of brain activations suggested an association between the paradoxical increase of reaction time and strategy switching. Together, these results suggest a novel aspect of complex value-based decision-makings that is strongly influenced by fronto-parietal cognitive control.


2021 ◽  
Author(s):  
Adetola Adegbola ◽  
Arnold Yuan

Deterioration is a major problem facing engineering structures, systems and components (SSCs). To maintain the structural integrity and safe operation of such SSCs all through their service life, it is important to understand how degradation phenomena progress over time and space. Hence degradation modelling has been increasingly used to model existing deterioration, predict future deterioration as well as provide input for infrastructure management in terms of inspection and maintenance decision making. As deterioration is known to be random, modelling of spatial and temporal uncertainty remains a crucial challenge for infrastructure asset professionals. The main objective of the thesis is to develop sophisticated models for characterizing spatial and temporal uncertainties in deterioration modelling with a view to enhancing decision making under uncertainty. The thesis proposes a two-dimensional copula-based gamma distributed random field for the spatial uncertainties, and a copula-based multivariate gamma process model to characterize stochastic dependence of multiple degradation phenomena. Techniques for estimating the model parameters and simulating the field or process, prediction of the remaining lifetime distribution as well as condition-based maintenance optimization are also presented. To study the extreme value distribution of the random field, the thesis also presents a numerical method based on the Karhunen-Loève expansion for evaluating extrema of both one- and two-dimensional homogeneous random fields. The simulation results are benchmarked against existing analytical models for special cases. In addition, the study also investigates the effect of parameter (epistemic) uncertainty on the extreme value distribution of the field. Finally, the thesis presents a practical application of the proposed copula-based gamma field by treating the wall profile of a feeder pipe as one- and twodimensional gamma fields. The thesis demonstrates a practical application of the multivariate gamma process model to rutting, cracking, and surface roughness of highway pavements. In summary, the proposed models have advanced the knowledge and techniques of stochastic deterioration modelling in the engineering field.


2018 ◽  
Vol 72 (6) ◽  
pp. 1379-1386
Author(s):  
Arnaud Destrebecqz ◽  
Michaël Vande Velde ◽  
Estibaliz San Anton ◽  
Axel Cleeremans ◽  
Julie Bertels

In a partial reinforcement schedule where a cue repeatedly predicts the occurrence of a target in consecutive trials, reaction times to the target tend to decrease in a monotonic fashion, while participants’ expectancies for the target decrease at the same time. This dissociation between reaction times and expectancies—the so-called Perruchet effect—challenges the propositional view of learning, which posits that human conditioned responses result from conscious inferences about the relationships between events. However, whether the reaction time pattern reflects the strength of a putative cue-target link, or only non-associative processes, such as motor priming, remains unclear. To address this issue, we implemented the Perruchet procedure in a two-choice reaction time task and compared reaction time patterns in an Experimental condition, in which a tone systematically preceded a visual target, and in a Control condition, in which the onset of the two stimuli were uncoupled. Participants’ expectancies regarding the target were recorded separately in an initial block. Reaction times decreased with the succession of identical trials in both conditions, reflecting the impact of motor priming. Importantly, reaction time slopes were steeper in the Experimental than in the Control condition, indicating an additional influence of the associative strength between the two stimuli. Interestingly, slopes were less steep for participants who showed the gambler’s fallacy in the initial block. In sum, our results suggest the mutual influences of motor priming, associative strength, and expectancies on performance. They are in line with a dual-process model of learning involving both a propositional reasoning process and an automatic link-formation mechanism.


1998 ◽  
Vol 86 (2) ◽  
pp. 403-410 ◽  
Author(s):  
Nobuyuki Inui ◽  
Kan-Ichiro Suzuki

This study examined effects of practice on timing of serial reactions by 7 adolescents diagnosed with autism by using a task requiring they track a series of timed lights. The adolescents showed significantly slower and more variable mean simple reaction time than 10 normal control subjects of the same age. On a task of tracking a serial light stimulation for 4 days, on the other hand, significant effects of practice on timing of serial reactions were observed for mean serial reaction times of them. In addition, from individual variations in reaction times and anticipatory reaction times, four of seven subjects with autism showed significant effects of practice. Analysis suggested that these autistic adolescents may be chunlung together the whole series of responses and are unable to coordinate the timing of individual responses with individual sumuli. Our data indicate that at least some adolescents with autism are able to form and utilise a motor program with practice.


2019 ◽  
Author(s):  
Samuel McDougle ◽  
Anne Collins

What determines the speed of our decisions? Various models of decision-making have focused on perceptual evidence, past experience, and task complexity as important factors determining the degree of deliberation needed for a decision. Here, we build on a sequential sampling decision-making framework to develop a new model that captures a range of reaction time (RT) effects by accounting for both working memory and instrumental learning processes. The model captures choices and RTs at various stages of learning, and in learning environments with varying complexity. Moreover, the model generalizes from tasks with deterministic reward contingencies to probabilistic ones. The model succeeds in part by incorporating prior uncertainty over actions when modeling RT. This straightforward process model provides a parsimonious account of decision dynamics during instrumental learning and makes unique predictions about internal representations of action values.


2020 ◽  
Vol 10 (1) ◽  
Author(s):  
Troy C. Dildine ◽  
Elizabeth A. Necka ◽  
Lauren Y. Atlas

AbstractSelf-report is the gold standard for measuring pain. However, decisions about pain can vary substantially within and between individuals. We measured whether self-reported pain is accompanied by metacognition and variations in confidence, similar to perceptual decision-making in other modalities. Eighty healthy volunteers underwent acute thermal pain and provided pain ratings followed by confidence judgments on continuous visual analogue scales. We investigated whether eye fixations and reaction time during pain rating might serve as implicit markers of confidence. Confidence varied across trials and increased confidence was associated with faster pain rating reaction times. The association between confidence and fixations varied across individuals as a function of the reliability of individuals’ association between temperature and pain. Taken together, this work indicates that individuals can provide metacognitive judgments of pain and extends research on confidence in perceptual decision-making to pain.


2019 ◽  
Vol 22 ◽  
Author(s):  
Adele Diederich ◽  
Wenjia Joyce Zhao

Abstract Dual process theories of decision making describe choice as the result of an automatic System 1, which is quick to activate but behaves impulsively, and a deliberative System 2, which is slower to activate but makes decisions in a rational and controlled manner. However, most existent dual process theories are verbal descriptions and do not generate testable qualitative and quantitative predictions. In this paper, we describe a formalized dynamic dual process model framework of intertemporal choice that allows for precise, experimentally testable predictions regarding choice probability and response time distributions. The framework is based on two-stage stochastic process models to account for the two postulated systems and to capture the dynamics and uncertainty involved in decision making. Using quasi closed form solutions, we illustrate how different factors (timing of System 1, time constraint, and preferences in both systems), which are reflected in the model parameters, influence qualitative and quantitative model predictions. Furthermore, we show how an existing static-deterministic model on intertemporal choice can be implemented in the framework allowing for testable predictions. The proposed framework can bring novel insights into the processes underlying intertemporal choices.


2019 ◽  
Author(s):  
Carly A Shevinsky ◽  
Pamela Reinagel

AbstractA stochastic visual motion discrimination task is widely used to study rapid decision-making in humans and animals. Among trials of the same sensory difficulty within a block of fixed decision strategy, humans and monkeys are widely reported to make more errors in the individual trials with longer reaction times. This finding has posed a challenge for the drift-diffusion model of sensory decision-making, which in its basic form predicts that errors and correct responses should have the same reaction time distributions. We previously reported that rats also violate this model prediction, but in the opposite direction: for rats, motion discrimination accuracy was highest in the trials with the longest reaction times. To rule out task differences as the cause of our divergent finding in rats, the present study tested humans and rats using the same task and analyzed their data identically. We confirmed that rats’ accuracy increased with reaction time, whereas humans’ accuracy decreased with reaction time in the same task. These results were further verified using a new temporally-local analysis method, ruling out that the observed trend was an artifact of non-stationarity in the data of either species. The main effect was found whether the signal strength (motion coherence) was varied in randomly interleaved trials or held constant within a block. The magnitude of the effects increased with motion coherence. These results provide new constraints useful for refining and discriminating among the many alternative mathematical theories of decision-making.


2021 ◽  
Vol 12 (1) ◽  
Author(s):  
Lluís Hernández-Navarro ◽  
Ainhoa Hermoso-Mendizabal ◽  
Daniel Duque ◽  
Jaime de la Rocha ◽  
Alexandre Hyafil

AbstractStandard models of perceptual decision-making postulate that a response is triggered in reaction to stimulus presentation when the accumulated stimulus evidence reaches a decision threshold. This framework excludes however the possibility that informed responses are generated proactively at a time independent of stimulus. Here, we find that, in a free reaction time auditory task in rats, reactive and proactive responses coexist, suggesting that choice selection and motor initiation, commonly viewed as serial processes, are decoupled in general. We capture this behavior by a novel model in which proactive and reactive responses are triggered whenever either of two competing processes, respectively Action Initiation or Evidence Accumulation, reaches a bound. In both types of response, the choice is ultimately informed by the Evidence Accumulation process. The Action Initiation process readily explains premature responses, contributes to urgency effects at long reaction times and mediates the slowing of the responses as animals get satiated and tired during sessions. Moreover, it successfully predicts reaction time distributions when the stimulus was either delayed, advanced or omitted. Overall, these results fundamentally extend standard models of evidence accumulation in decision making by showing that proactive and reactive processes compete for the generation of responses.


2021 ◽  
Author(s):  
Adetola Adegbola ◽  
Arnold Yuan

Deterioration is a major problem facing engineering structures, systems and components (SSCs). To maintain the structural integrity and safe operation of such SSCs all through their service life, it is important to understand how degradation phenomena progress over time and space. Hence degradation modelling has been increasingly used to model existing deterioration, predict future deterioration as well as provide input for infrastructure management in terms of inspection and maintenance decision making. As deterioration is known to be random, modelling of spatial and temporal uncertainty remains a crucial challenge for infrastructure asset professionals. The main objective of the thesis is to develop sophisticated models for characterizing spatial and temporal uncertainties in deterioration modelling with a view to enhancing decision making under uncertainty. The thesis proposes a two-dimensional copula-based gamma distributed random field for the spatial uncertainties, and a copula-based multivariate gamma process model to characterize stochastic dependence of multiple degradation phenomena. Techniques for estimating the model parameters and simulating the field or process, prediction of the remaining lifetime distribution as well as condition-based maintenance optimization are also presented. To study the extreme value distribution of the random field, the thesis also presents a numerical method based on the Karhunen-Loève expansion for evaluating extrema of both one- and two-dimensional homogeneous random fields. The simulation results are benchmarked against existing analytical models for special cases. In addition, the study also investigates the effect of parameter (epistemic) uncertainty on the extreme value distribution of the field. Finally, the thesis presents a practical application of the proposed copula-based gamma field by treating the wall profile of a feeder pipe as one- and twodimensional gamma fields. The thesis demonstrates a practical application of the multivariate gamma process model to rutting, cracking, and surface roughness of highway pavements. In summary, the proposed models have advanced the knowledge and techniques of stochastic deterioration modelling in the engineering field.


Sign in / Sign up

Export Citation Format

Share Document