An Exploration/Exploitation Trade-off Between Mind-Wandering and Goal-Directed Thinking

Author(s):  
Chandra S. Sripada

Agents invariably face trade-offs between exploration, which increases informational stores and potentially opens up new opportunities, and exploitation, which utilizes existing informational stores to take advantage of known opportunities. This exploration/exploitation trade-off has been extensively studied in computer science and has been productively applied to multiple cognitive domains. In this chapter, this framework is extended to the ubiquitous alternation between two modes of serial thought: mind-wandering and goal-directed thought. The exploration/exploitation framework provides a new perspective on the functionality of mind-wandering and its pattern of regular switching with goal-directed thought. It also raises new hypotheses about the regulation of mind-wandering across time and differences in the propensity to mind-wander across individuals.

2021 ◽  
Author(s):  
Ketika Garg ◽  
Christopher T. Kello ◽  
Paul E Smaldino

Search requires balancing exploring for more options and exploiting the ones previously found. Individuals foraging in a group face another trade-off: whether to engage in social learning to exploit the solutions found by others or to solitarily search for unexplored solutions. Social learning can decrease the costs of finding new resources, but excessive social learning can decrease the exploration for new solutions. We study how these two trade-offs interact to influence search efficiency in a model of collective foraging under conditions of varying resource abundance, resource density, and group size. We modeled individual search strategies as Lévy walks, where a power-law exponent (μ) controlled the trade-off between exploitative and explorative movements in individual search. We modulated the trade-off between individual search and social learning using a selectivity parameter that determined how agents responded to social cues in terms of distance and likely opportunity costs. Our results show that social learning is favored in rich and clustered environments, but also that the benefits of exploiting social information are maximized by engaging in high levels of individual exploration. We show that selective use of social information can modulate the disadvantages of excessive social learning, especially in larger groups and with limited individual exploration. Finally, we found that the optimal combination of individual exploration and social learning gave rise to trajectories with μ ≈ 2 and provide support for the general optimality such patterns in search. Our work sheds light on the interplay between individual search and social learning, and has broader implications for collective search and problem-solving.


2021 ◽  
Vol 39 (4) ◽  
pp. 1-29
Author(s):  
Shijun Li ◽  
Wenqiang Lei ◽  
Qingyun Wu ◽  
Xiangnan He ◽  
Peng Jiang ◽  
...  

Static recommendation methods like collaborative filtering suffer from the inherent limitation of performing real-time personalization for cold-start users. Online recommendation, e.g., multi-armed bandit approach, addresses this limitation by interactively exploring user preference online and pursuing the exploration-exploitation (EE) trade-off. However, existing bandit-based methods model recommendation actions homogeneously. Specifically, they only consider the items as the arms, being incapable of handling the item attributes , which naturally provide interpretable information of user’s current demands and can effectively filter out undesired items. In this work, we consider the conversational recommendation for cold-start users, where a system can both ask the attributes from and recommend items to a user interactively. This important scenario was studied in a recent work  [54]. However, it employs a hand-crafted function to decide when to ask attributes or make recommendations. Such separate modeling of attributes and items makes the effectiveness of the system highly rely on the choice of the hand-crafted function, thus introducing fragility to the system. To address this limitation, we seamlessly unify attributes and items in the same arm space and achieve their EE trade-offs automatically using the framework of Thompson Sampling. Our Conversational Thompson Sampling (ConTS) model holistically solves all questions in conversational recommendation by choosing the arm with the maximal reward to play. Extensive experiments on three benchmark datasets show that ConTS outperforms the state-of-the-art methods Conversational UCB (ConUCB) [54] and Estimation—Action—Reflection model [27] in both metrics of success rate and average number of conversation turns.


Author(s):  
Roel van Dooren ◽  
Roy de Kleijn ◽  
Bernhard Hommel ◽  
Zsuzsika Sjoerds

AbstractThe exploration-exploitation trade-off shows conceptual, functional, and neural analogies with the persistence-flexibility trade-off. We investigated whether mood, which is known to modulate the persistence-flexibility balance, would similarly affect the exploration-exploitation trade-off in a foraging task. More specifically, we tested whether interindividual differences in foraging behavior can be predicted by mood-related arousal and valence. In 119 participants, we assessed mood-related interindividual differences in exploration-exploitation using a foraging task that included minimal task constraints to reduce paradigm-induced biases of individual control tendencies. We adopted the marginal value theorem as a model-based analysis approach, which approximates optimal foraging behavior by tackling the patch-leaving problem. To assess influences of mood on foraging, participants underwent either a positive or negative mood induction. Throughout the experiment, we assessed arousal and valence levels as predictors for explorative/exploitative behavior. Our mood manipulation affected participants' arousal and valence ratings as expected. Moreover, mood-related arousal was found to predict exploration while valence predicted exploitation, which only partly matched our expectations and thereby the proposed conceptual overlap with flexibility and persistence, respectively. The current study provides a first insight into how processes related to arousal and valence differentially modulate foraging behavior. Our results imply that the relationship between exploration-exploitation and flexibility-persistence is more complicated than the semantic overlap between these terms might suggest, thereby calling for further research on the functional, neural, and neurochemical underpinnings of both trade-offs.


2021 ◽  
Author(s):  
Qiu-Shi Li ◽  
Yao-Gen Shu ◽  
Wen-Bo Fu ◽  
Zhong-Can Ou-Yang ◽  
Ming Li

DNA replication is a high-fidelity information-copying processes which is realized by DNA polymerase (DNAP). The high fidelity was explained on the basis of the well-known kinetic-proofreading mechanism (KPR), under which the so-called fidelity-speed trade-off was studied theoretically. However, numerous biochemical experiments have shown that the high fidelity of DNA replication is achieved due to the initial discrimination of polymerase domain of DNAP, as well as the proofreading of the exonuclease domain of DNAP. This exonuclease-proofreading mechanism (EPR) is totally different from KPR. So the trade-off issues are worth being re-examined under EPR. In this paper, we use the first-passage method recently proposed by us to discuss the possible trade-offs in DNA replication under EPR. We show that there could be no fidelity-speed trade-off under EPR, i.e., the fidelity and the speed can be simultaneously enhanced by EPR in a large range of kinetic parameters. This provides a new perspective to understand the experimental data of the exonuclease activity of T7 DNAP and T4 DNAP. We also show that there exists the fidelity-proofreading cost trade-off, i.e., the fidelity is enhanced at the cost of increasing the futile hydrolysis of dNTP. A possible way to avoid this trade-off is to regulate the rate of DNAP translocation: slowing down the forward translocation (in the presence of the terminal mismatch) can enhance the fidelity without changing the speed and the proofreading cost. Our theoretical analysis offers deeper insights on the kinetics-function relation of DNAP.PACS numbers: 82.39.-k, 87.15.Rn, 87.16.A-


2021 ◽  
Vol 130 (3) ◽  
pp. 339-383
Author(s):  
Sara Aronowitz

Sometimes, we face choices between actions most likely to lead to valuable outcomes, and actions which put us in a better position to learn. These choices exemplify what is called the exploration/exploitation trade-off. In computer science and psychology, this trade-off has fruitfully been applied to modulating the way agents or systems make choices over time. This article extends the trade-off to belief. We can be torn between two ways of believing, one of which is expected to be more accurate in light of current evidence, whereas the other is expected to lead to more learning opportunity and accuracy in the long run. Further, it is sometimes rationally permissible to choose the latter. The article breaks down the features of action which give rise to the trade-off, and then argues that each feature applies equally well to belief. This conclusion is an instance of a systematic, foreseeable way in which what is rational to believe now depends on what one expects to be doing in the future. That is, epistemic rationality fundamentally concerns time.


2012 ◽  
Vol 11 (3) ◽  
pp. 118-126 ◽  
Author(s):  
Olive Emil Wetter ◽  
Jürgen Wegge ◽  
Klaus Jonas ◽  
Klaus-Helmut Schmidt

In most work contexts, several performance goals coexist, and conflicts between them and trade-offs can occur. Our paper is the first to contrast a dual goal for speed and accuracy with a single goal for speed on the same task. The Sternberg paradigm (Experiment 1, n = 57) and the d2 test (Experiment 2, n = 19) were used as performance tasks. Speed measures and errors revealed in both experiments that dual as well as single goals increase performance by enhancing memory scanning. However, the single speed goal triggered a speed-accuracy trade-off, favoring speed over accuracy, whereas this was not the case with the dual goal. In difficult trials, dual goals slowed down scanning processes again so that errors could be prevented. This new finding is particularly relevant for security domains, where both aspects have to be managed simultaneously.


2019 ◽  
Author(s):  
Anna Katharina Spälti ◽  
Mark John Brandt ◽  
Marcel Zeelenberg

People often have to make trade-offs. We study three types of trade-offs: 1) "secular trade-offs" where no moral or sacred values are at stake, 2) "taboo trade-offs" where sacred values are pitted against financial gain, and 3) "tragic trade-offs" where sacred values are pitted against other sacred values. Previous research (Critcher et al., 2011; Tetlock et al., 2000) demonstrated that tragic and taboo trade-offs are not only evaluated by their outcomes, but are also evaluated based on the time it took to make the choice. We investigate two outstanding questions: 1) whether the effect of decision time differs for evaluations of decisions compared to decision makers and 2) whether moral contexts are unique in their ability to influence character evaluations through decision process information. In two experiments (total N = 1434) we find that decision time affects character evaluations, but not evaluations of the decision itself. There were no significant differences between tragic trade-offs and secular trade-offs, suggesting that the decisions structure may be more important in evaluations than moral context. Additionally, the magnitude of the effect of decision time shows us that decision time, may be of less practical use than expected. We thus urge, to take a closer examination of the processes underlying decision time and its perception.


2019 ◽  
Author(s):  
Kasper Van Mens ◽  
Joran Lokkerbol ◽  
Richard Janssen ◽  
Robert de Lange ◽  
Bea Tiemens

BACKGROUND It remains a challenge to predict which treatment will work for which patient in mental healthcare. OBJECTIVE In this study we compare machine algorithms to predict during treatment which patients will not benefit from brief mental health treatment and present trade-offs that must be considered before an algorithm can be used in clinical practice. METHODS Using an anonymized dataset containing routine outcome monitoring data from a mental healthcare organization in the Netherlands (n = 2,655), we applied three machine learning algorithms to predict treatment outcome. The algorithms were internally validated with cross-validation on a training sample (n = 1,860) and externally validated on an unseen test sample (n = 795). RESULTS The performance of the three algorithms did not significantly differ on the test set. With a default classification cut-off at 0.5 predicted probability, the extreme gradient boosting algorithm showed the highest positive predictive value (ppv) of 0.71(0.61 – 0.77) with a sensitivity of 0.35 (0.29 – 0.41) and area under the curve of 0.78. A trade-off can be made between ppv and sensitivity by choosing different cut-off probabilities. With a cut-off at 0.63, the ppv increased to 0.87 and the sensitivity dropped to 0.17. With a cut-off of at 0.38, the ppv decreased to 0.61 and the sensitivity increased to 0.57. CONCLUSIONS Machine learning can be used to predict treatment outcomes based on routine monitoring data.This allows practitioners to choose their own trade-off between being selective and more certain versus inclusive and less certain.


Author(s):  
Steven Bernstein

This commentary discusses three challenges for the promising and ambitious research agenda outlined in the volume. First, it interrogates the volume’s attempts to differentiate political communities of legitimation, which may vary widely in composition, power, and relevance across institutions and geographies, with important implications not only for who matters, but also for what gets legitimated, and with what consequences. Second, it examines avenues to overcome possible trade-offs from gains in empirical tractability achieved through the volume’s focus on actor beliefs and strategies. One such trade-off is less attention to evolving norms and cultural factors that may underpin actors’ expectations about what legitimacy requires. Third, it addresses the challenge of theory building that can link legitimacy sources, (de)legitimation practices, audiences, and consequences of legitimacy across different types of institutions.


Author(s):  
Lisa Best ◽  
Kimberley Fung-Loy ◽  
Nafiesa Ilahibaks ◽  
Sara O. I. Ramirez-Gomez ◽  
Erika N. Speelman

AbstractNowadays, tropical forest landscapes are commonly characterized by a multitude of interacting institutions and actors with competing land-use interests. In these settings, indigenous and tribal communities are often marginalized in landscape-level decision making. Inclusive landscape governance inherently integrates diverse knowledge systems, including those of indigenous and tribal communities. Increasingly, geo-information tools are recognized as appropriate tools to integrate diverse interests and legitimize the voices, values, and knowledge of indigenous and tribal communities in landscape governance. In this paper, we present the contribution of the integrated application of three participatory geo-information tools to inclusive landscape governance in the Upper Suriname River Basin in Suriname: (i) Participatory 3-Dimensional Modelling, (ii) the Trade-off! game, and (iii) participatory scenario planning. The participatory 3-dimensional modelling enabled easy participation of community members, documentation of traditional, tacit knowledge and social learning. The Trade-off! game stimulated capacity building and understanding of land-use trade-offs. The participatory scenario planning exercise helped landscape actors to reflect on their own and others’ desired futures while building consensus. Our results emphasize the importance of systematically considering tool attributes and key factors, such as facilitation, for participatory geo-information tools to be optimally used and fit with local contexts. The results also show how combining the tools helped to build momentum and led to diverse yet complementary insights, thereby demonstrating the benefits of integrating multiple tools to address inclusive landscape governance issues.


Sign in / Sign up

Export Citation Format

Share Document