An Exploration/Exploitation Trade-off Between Mind-Wandering and Goal-Directed Thinking

Mapping Intimacies ◽

10.1093/oxfordhb/9780190464745.013.28 ◽

2018 ◽

Author(s):

Chandra S. Sripada

Keyword(s):

Computer Science ◽

Mind Wandering ◽

Trade Off ◽

Cognitive Domains ◽

Trade Offs ◽

Directed Thinking ◽

New Perspective ◽

Exploration Exploitation ◽

Directed Thought

Agents invariably face trade-offs between exploration, which increases informational stores and potentially opens up new opportunities, and exploitation, which utilizes existing informational stores to take advantage of known opportunities. This exploration/exploitation trade-off has been extensively studied in computer science and has been productively applied to multiple cognitive domains. In this chapter, this framework is extended to the ubiquitous alternation between two modes of serial thought: mind-wandering and goal-directed thought. The exploration/exploitation framework provides a new perspective on the functionality of mind-wandering and its pattern of regular switching with goal-directed thought. It also raises new hypotheses about the regulation of mind-wandering across time and differences in the propensity to mind-wander across individuals.

Download Full-text

Individual exploration and selective social learning: Balancing exploration-exploitation trade-offs in collective foraging

10.1101/2021.11.10.468137 ◽

2021 ◽

Author(s):

Ketika Garg ◽

Christopher T. Kello ◽

Paul E Smaldino

Keyword(s):

Social Learning ◽

Social Information ◽

Social Cues ◽

Optimal Combination ◽

Trade Off ◽

Trade Offs ◽

Lévy Walks ◽

Power Law Exponent ◽

Collective Foraging ◽

Exploration Exploitation

Search requires balancing exploring for more options and exploiting the ones previously found. Individuals foraging in a group face another trade-off: whether to engage in social learning to exploit the solutions found by others or to solitarily search for unexplored solutions. Social learning can decrease the costs of finding new resources, but excessive social learning can decrease the exploration for new solutions. We study how these two trade-offs interact to influence search efficiency in a model of collective foraging under conditions of varying resource abundance, resource density, and group size. We modeled individual search strategies as Lévy walks, where a power-law exponent (μ) controlled the trade-off between exploitative and explorative movements in individual search. We modulated the trade-off between individual search and social learning using a selectivity parameter that determined how agents responded to social cues in terms of distance and likely opportunity costs. Our results show that social learning is favored in rich and clustered environments, but also that the benefits of exploiting social information are maximized by engaging in high levels of individual exploration. We show that selective use of social information can modulate the disadvantages of excessive social learning, especially in larger groups and with limited individual exploration. Finally, we found that the optimal combination of individual exploration and social learning gave rise to trajectories with μ ≈ 2 and provide support for the general optimality such patterns in search. Our work sheds light on the interplay between individual search and social learning, and has broader implications for collective search and problem-solving.

Download Full-text

Seamlessly Unifying Attributes and Items: Conversational Recommendation for Cold-start Users

ACM Transactions on Information Systems ◽

10.1145/3446427 ◽

2021 ◽

Vol 39 (4) ◽

pp. 1-29

Author(s):

Shijun Li ◽

Wenqiang Lei ◽

Qingyun Wu ◽

Xiangnan He ◽

Peng Jiang ◽

...

Keyword(s):

State Of The Art ◽

Cold Start ◽

User Preference ◽

Inherent Limitation ◽

Trade Off ◽

Thompson Sampling ◽

Trade Offs ◽

Benchmark Datasets ◽

Item Attributes ◽

Exploration Exploitation

Static recommendation methods like collaborative filtering suffer from the inherent limitation of performing real-time personalization for cold-start users. Online recommendation, e.g., multi-armed bandit approach, addresses this limitation by interactively exploring user preference online and pursuing the exploration-exploitation (EE) trade-off. However, existing bandit-based methods model recommendation actions homogeneously. Specifically, they only consider the items as the arms, being incapable of handling the item attributes , which naturally provide interpretable information of user’s current demands and can effectively filter out undesired items. In this work, we consider the conversational recommendation for cold-start users, where a system can both ask the attributes from and recommend items to a user interactively. This important scenario was studied in a recent work [54]. However, it employs a hand-crafted function to decide when to ask attributes or make recommendations. Such separate modeling of attributes and items makes the effectiveness of the system highly rely on the choice of the hand-crafted function, thus introducing fragility to the system. To address this limitation, we seamlessly unify attributes and items in the same arm space and achieve their EE trade-offs automatically using the framework of Thompson Sampling. Our Conversational Thompson Sampling (ConTS) model holistically solves all questions in conversational recommendation by choosing the arm with the maximal reward to play. Extensive experiments on three benchmark datasets show that ConTS outperforms the state-of-the-art methods Conversational UCB (ConUCB) [54] and Estimation—Action—Reflection model [27] in both metrics of success rate and average number of conversation turns.

Download Full-text

The exploration-exploitation trade-off in a foraging task is affected by mood-related arousal and valence

Cognitive Affective & Behavioral Neuroscience ◽

10.3758/s13415-021-00917-6 ◽

2021 ◽

Author(s):

Roel van Dooren ◽

Roy de Kleijn ◽

Bernhard Hommel ◽

Zsuzsika Sjoerds

Keyword(s):

Foraging Behavior ◽

Mood Induction ◽

Negative Mood ◽

Interindividual Differences ◽

Trade Off ◽

Marginal Value ◽

Trade Offs ◽

Foraging Task ◽

Exploration Exploitation ◽

Arousal And Valence

AbstractThe exploration-exploitation trade-off shows conceptual, functional, and neural analogies with the persistence-flexibility trade-off. We investigated whether mood, which is known to modulate the persistence-flexibility balance, would similarly affect the exploration-exploitation trade-off in a foraging task. More specifically, we tested whether interindividual differences in foraging behavior can be predicted by mood-related arousal and valence. In 119 participants, we assessed mood-related interindividual differences in exploration-exploitation using a foraging task that included minimal task constraints to reduce paradigm-induced biases of individual control tendencies. We adopted the marginal value theorem as a model-based analysis approach, which approximates optimal foraging behavior by tackling the patch-leaving problem. To assess influences of mood on foraging, participants underwent either a positive or negative mood induction. Throughout the experiment, we assessed arousal and valence levels as predictors for explorative/exploitative behavior. Our mood manipulation affected participants' arousal and valence ratings as expected. Moreover, mood-related arousal was found to predict exploration while valence predicted exploitation, which only partly matched our expectations and thereby the proposed conceptual overlap with flexibility and persistence, respectively. The current study provides a first insight into how processes related to arousal and valence differentially modulate foraging behavior. Our results imply that the relationship between exploration-exploitation and flexibility-persistence is more complicated than the semantic overlap between these terms might suggest, thereby calling for further research on the functional, neural, and neurochemical underpinnings of both trade-offs.

Download Full-text

The possible fidelity-speed-proofreading cost trade-offs in DNA replication due to the exonuclease proofreading

10.1101/2021.02.18.431768 ◽

2021 ◽

Author(s):

Qiu-Shi Li ◽

Yao-Gen Shu ◽

Wen-Bo Fu ◽

Zhong-Can Ou-Yang ◽

Ming Li

Keyword(s):

Dna Replication ◽

High Fidelity ◽

Trade Off ◽

Slowing Down ◽

Polymerase Domain ◽

Trade Offs ◽

New Perspective ◽

The Cost ◽

Kinetic Proofreading ◽

Hydrolysis Of

DNA replication is a high-fidelity information-copying processes which is realized by DNA polymerase (DNAP). The high fidelity was explained on the basis of the well-known kinetic-proofreading mechanism (KPR), under which the so-called fidelity-speed trade-off was studied theoretically. However, numerous biochemical experiments have shown that the high fidelity of DNA replication is achieved due to the initial discrimination of polymerase domain of DNAP, as well as the proofreading of the exonuclease domain of DNAP. This exonuclease-proofreading mechanism (EPR) is totally different from KPR. So the trade-off issues are worth being re-examined under EPR. In this paper, we use the first-passage method recently proposed by us to discuss the possible trade-offs in DNA replication under EPR. We show that there could be no fidelity-speed trade-off under EPR, i.e., the fidelity and the speed can be simultaneously enhanced by EPR in a large range of kinetic parameters. This provides a new perspective to understand the experimental data of the exonuclease activity of T7 DNAP and T4 DNAP. We also show that there exists the fidelity-proofreading cost trade-off, i.e., the fidelity is enhanced at the cost of increasing the futile hydrolysis of dNTP. A possible way to avoid this trade-off is to regulate the rate of DNAP translocation: slowing down the forward translocation (in the presence of the terminal mismatch) can enhance the fidelity without changing the speed and the proofreading cost. Our theoretical analysis offers deeper insights on the kinetics-function relation of DNAP.PACS numbers: 82.39.-k, 87.15.Rn, 87.16.A-

Download Full-text

Exploring by Believing

The Philosophical Review ◽

10.1215/00318108-8998825 ◽

2021 ◽

Vol 130 (3) ◽

pp. 339-383

Author(s):

Sara Aronowitz

Keyword(s):

Computer Science ◽

The Other ◽

Epistemic Rationality ◽

Current Evidence ◽

Trade Off ◽

Learning Opportunity ◽

Long Run ◽

The Future ◽

Exploration Exploitation ◽

Over Time

Sometimes, we face choices between actions most likely to lead to valuable outcomes, and actions which put us in a better position to learn. These choices exemplify what is called the exploration/exploitation trade-off. In computer science and psychology, this trade-off has fruitfully been applied to modulating the way agents or systems make choices over time. This article extends the trade-off to belief. We can be torn between two ways of believing, one of which is expected to be more accurate in light of current evidence, whereas the other is expected to lead to more learning opportunity and accuracy in the long run. Further, it is sometimes rationally permissible to choose the latter. The article breaks down the features of action which give rise to the trade-off, and then argues that each feature applies equally well to belief. This conclusion is an instance of a systematic, foreseeable way in which what is rational to believe now depends on what one expects to be doing in the future. That is, epistemic rationality fundamentally concerns time.

Download Full-text

Dual Goals for Speed and Accuracy on the Same Performance Task

Journal of Personnel Psychology ◽

10.1027/1866-5888/a000063 ◽

2012 ◽

Vol 11 (3) ◽

pp. 118-126 ◽

Cited By ~ 3

Author(s):

Olive Emil Wetter ◽

Jürgen Wegge ◽

Klaus Jonas ◽

Klaus-Helmut Schmidt

Keyword(s):

Memory Scanning ◽

Performance Task ◽

Performance Tasks ◽

Trade Off ◽

Test Experiment ◽

Trade Offs ◽

New Finding ◽

Sternberg Paradigm ◽

Speed Accuracy ◽

Speed And Accuracy

In most work contexts, several performance goals coexist, and conflicts between them and trade-offs can occur. Our paper is the first to contrast a dual goal for speed and accuracy with a single goal for speed on the same task. The Sternberg paradigm (Experiment 1, n = 57) and the d2 test (Experiment 2, n = 19) were used as performance tasks. Speed measures and errors revealed in both experiments that dual as well as single goals increase performance by enhancing memory scanning. However, the single speed goal triggered a speed-accuracy trade-off, favoring speed over accuracy, whereas this was not the case with the dual goal. In difficult trials, dual goals slowed down scanning processes again so that errors could be prevented. This new finding is particularly relevant for security domains, where both aspects have to be managed simultaneously.

Download Full-text

The Effects of Decision Time on Perceptions of Decisions and Decision Makers in (Moral) Trade-Off Scenarios

10.31234/osf.io/37t2r ◽

2019 ◽

Author(s):

Anna Katharina Spälti ◽

Mark John Brandt ◽

Marcel Zeelenberg

Keyword(s):

Decision Process ◽

Decision Time ◽

Decision Makers ◽

Process Information ◽

Total N ◽

Trade Off ◽

Financial Gain ◽

Trade Offs ◽

Sacred Values ◽

Moral Context

People often have to make trade-offs. We study three types of trade-offs: 1) "secular trade-offs" where no moral or sacred values are at stake, 2) "taboo trade-offs" where sacred values are pitted against financial gain, and 3) "tragic trade-offs" where sacred values are pitted against other sacred values. Previous research (Critcher et al., 2011; Tetlock et al., 2000) demonstrated that tragic and taboo trade-offs are not only evaluated by their outcomes, but are also evaluated based on the time it took to make the choice. We investigate two outstanding questions: 1) whether the effect of decision time differs for evaluations of decisions compared to decision makers and 2) whether moral contexts are unique in their ability to influence character evaluations through decision process information. In two experiments (total N = 1434) we find that decision time affects character evaluations, but not evaluations of the decision itself. There were no significant differences between tragic trade-offs and secular trade-offs, suggesting that the decisions structure may be more important in evaluations than moral context. Additionally, the magnitude of the effect of decision time shows us that decision time, may be of less practical use than expected. We thus urge, to take a closer examination of the processes underlying decision time and its perception.

Download Full-text

Predicting Undesired Treatment Outcome in Mental Healthcare: Machine Learning Study (Preprint)

10.2196/preprints.17235 ◽

2019 ◽

Author(s):

Kasper Van Mens ◽

Joran Lokkerbol ◽

Richard Janssen ◽

Robert de Lange ◽

Bea Tiemens

Keyword(s):

Machine Learning ◽

Treatment Outcome ◽

Mental Health Treatment ◽

Mental Healthcare ◽

Machine Learning Algorithms ◽

Gradient Boosting ◽

Trade Off ◽

Trade Offs ◽

Outcome Monitoring ◽

Extreme Gradient Boosting

BACKGROUND It remains a challenge to predict which treatment will work for which patient in mental healthcare. OBJECTIVE In this study we compare machine algorithms to predict during treatment which patients will not benefit from brief mental health treatment and present trade-offs that must be considered before an algorithm can be used in clinical practice. METHODS Using an anonymized dataset containing routine outcome monitoring data from a mental healthcare organization in the Netherlands (n = 2,655), we applied three machine learning algorithms to predict treatment outcome. The algorithms were internally validated with cross-validation on a training sample (n = 1,860) and externally validated on an unseen test sample (n = 795). RESULTS The performance of the three algorithms did not significantly differ on the test set. With a default classification cut-off at 0.5 predicted probability, the extreme gradient boosting algorithm showed the highest positive predictive value (ppv) of 0.71(0.61 – 0.77) with a sensitivity of 0.35 (0.29 – 0.41) and area under the curve of 0.78. A trade-off can be made between ppv and sensitivity by choosing different cut-off probabilities. With a cut-off at 0.63, the ppv increased to 0.87 and the sensitivity dropped to 0.17. With a cut-off of at 0.38, the ppv decreased to 0.61 and the sensitivity increased to 0.57. CONCLUSIONS Machine learning can be used to predict treatment outcomes based on routine monitoring data.This allows practitioners to choose their own trade-off between being selective and more certain versus inclusive and less certain.

Download Full-text

Challenges in the Empirical Study of Global Governance Legitimacy

10.1093/oso/9780198826873.003.0011 ◽

2018 ◽

Cited By ~ 1

Author(s):

Steven Bernstein

Keyword(s):

Empirical Study ◽

Global Governance ◽

Research Agenda ◽

Cultural Factors ◽

Theory Building ◽

Trade Off ◽

Trade Offs ◽

Different Types ◽

Political Communities

This commentary discusses three challenges for the promising and ambitious research agenda outlined in the volume. First, it interrogates the volume’s attempts to differentiate political communities of legitimation, which may vary widely in composition, power, and relevance across institutions and geographies, with important implications not only for who matters, but also for what gets legitimated, and with what consequences. Second, it examines avenues to overcome possible trade-offs from gains in empirical tractability achieved through the volume’s focus on actor beliefs and strategies. One such trade-off is less attention to evolving norms and cultural factors that may underpin actors’ expectations about what legitimacy requires. Third, it addresses the challenge of theory building that can link legitimacy sources, (de)legitimation practices, audiences, and consequences of legitimacy across different types of institutions.

Download Full-text

Toward Inclusive Landscape Governance in Contested Landscapes: Exploring the Contribution of Participatory Tools in the Upper Suriname River Basin

Environmental Management ◽

10.1007/s00267-021-01504-8 ◽

2021 ◽

Author(s):

Lisa Best ◽

Kimberley Fung-Loy ◽

Nafiesa Ilahibaks ◽

Sara O. I. Ramirez-Gomez ◽

Erika N. Speelman

Keyword(s):

Land Use ◽

River Basin ◽

Scenario Planning ◽

Community Members ◽

Trade Off ◽

3 Dimensional ◽

Landscape Governance ◽

Information Tools ◽

Tribal Communities ◽

Trade Offs

AbstractNowadays, tropical forest landscapes are commonly characterized by a multitude of interacting institutions and actors with competing land-use interests. In these settings, indigenous and tribal communities are often marginalized in landscape-level decision making. Inclusive landscape governance inherently integrates diverse knowledge systems, including those of indigenous and tribal communities. Increasingly, geo-information tools are recognized as appropriate tools to integrate diverse interests and legitimize the voices, values, and knowledge of indigenous and tribal communities in landscape governance. In this paper, we present the contribution of the integrated application of three participatory geo-information tools to inclusive landscape governance in the Upper Suriname River Basin in Suriname: (i) Participatory 3-Dimensional Modelling, (ii) the Trade-off! game, and (iii) participatory scenario planning. The participatory 3-dimensional modelling enabled easy participation of community members, documentation of traditional, tacit knowledge and social learning. The Trade-off! game stimulated capacity building and understanding of land-use trade-offs. The participatory scenario planning exercise helped landscape actors to reflect on their own and others’ desired futures while building consensus. Our results emphasize the importance of systematically considering tool attributes and key factors, such as facilitation, for participatory geo-information tools to be optimally used and fit with local contexts. The results also show how combining the tools helped to build momentum and led to diverse yet complementary insights, thereby demonstrating the benefits of integrating multiple tools to address inclusive landscape governance issues.

Download Full-text