Comparison between model-based and non-isothermal model-free computational procedures for prediction of conversion-time curves of calcium carbonate decomposition

Representation, abstraction, and simple-minded sophisticates

Behavioral and Brain Sciences ◽

10.1017/s0140525x19002942 ◽

2020 ◽

Vol 43 ◽

Author(s):

Peter Dayan

Keyword(s):

Decision Theory ◽

Predictive Coding ◽

Bayesian Decision Theory ◽

Bayesian Decision ◽

Model Based ◽

Model Free

Abstract Bayesian decision theory provides a simple formal elucidation of some of the ways that representation and representational abstraction are involved with, and exploit, both prediction and its rather distant cousin, predictive coding. Both model-free and model-based methods are involved.

Download Full-text

Shaping Model-Free Reinforcement-Learning with Model-Based Pseudorewards

10.32470/ccn.2018.1191-0 ◽

2018 ◽

Author(s):

Paul Krueger ◽

Thomas Griffiths

Keyword(s):

Reinforcement Learning ◽

Model Based ◽

Model Free

Download Full-text

Model-Based and Model-Free Social Cognition

10.31234/osf.io/ue6j2 ◽

2019 ◽

Author(s):

Leor M Hackel ◽

Jeffrey Jordan Berg ◽

Björn Lindström ◽

David Amodio

Keyword(s):

Reinforcement Learning ◽

Social Cognition ◽

Learning Strategies ◽

Memory Systems ◽

Learning Task ◽

Financial Advisors ◽

Model Based ◽

Model Free ◽

Systems Model ◽

Task Assessment

Do habits play a role in our social impressions? To investigate the contribution of habits to the formation of social attitudes, we examined the roles of model-free and model-based reinforcement learning in social interactions—computations linked in past work to habit and planning, respectively. Participants in this study learned about novel individuals in a sequential reinforcement learning paradigm, choosing financial advisors who led them to high- or low-paying stocks. Results indicated that participants relied on both model-based and model-free learning, such that each independently predicted choice during the learning task and self-reported liking in a post-task assessment. Specifically, participants liked advisors who could provide large future rewards as well as advisors who had provided them with large rewards in the past. Moreover, participants varied in their use of model-based and model-free learning strategies, and this individual difference influenced the way in which learning related to self-reported attitudes: among participants who relied more on model-free learning, model-free social learning related more to post-task attitudes. We discuss implications for attitudes, trait impressions, and social behavior, as well as the role of habits in a memory systems model of social cognition.

Download Full-text

Faculty Opinions recommendation of States versus rewards: dissociable neural prediction error signals underlying model-based and model-free reinforcement learning.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.4125957.4076054 ◽

2010 ◽

Author(s):

Susan Courtney

Keyword(s):

Reinforcement Learning ◽

Prediction Error ◽

Model Based ◽

Model Free

Download Full-text

Author Correction: Reliance on model-based and model-free control in obesity

Scientific Reports ◽

10.1038/s41598-021-83028-z ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Lieneke K. Janssen ◽

Florian P. Mahner ◽

Florian Schlagenhauf ◽

Lorenz Deserno ◽

Annette Horstmann

Keyword(s):

Model Based ◽

Model Free ◽

Model Free Control

An amendment to this paper has been published and can be accessed via a link at the top of the paper.

Download Full-text

Model-Based and Model-Free Control of DC-DC Converters with High-Order Dynamics and Limited Measurements

IEEE Transactions on Industrial Electronics ◽

10.1109/tie.2020.3001845 ◽

2020 ◽

pp. 1-1

Author(s):

Javier Loranca ◽

Jonathan Carlos Mayo Maldonado ◽

Gerardo Escobar ◽

Carlos Villarreal-Hernandez ◽

Thabiso Maupong ◽

...

Keyword(s):

High Order ◽

Model Based ◽

Model Free ◽

Model Free Control

Download Full-text

Transition from ‘model-based’ to ‘model-free’ behavioral control in addiction: Involvement of the orbitofrontal cortex and dorsolateral striatum

Neuropharmacology ◽

10.1016/j.neuropharm.2013.05.033 ◽

2014 ◽

Vol 76 ◽

pp. 407-415 ◽

Cited By ~ 43

Author(s):

Federica Lucantonio ◽

Daniele Caprioli ◽

Geoffrey Schoenbaum

Keyword(s):

Orbitofrontal Cortex ◽

Behavioral Control ◽

Dorsolateral Striatum ◽

Model Based ◽

Model Free

Download Full-text

Combining Model-Based Design and Model-Free Policy Optimization to Learn Safe, Stabilizing Controllers

IFAC-PapersOnLine ◽

10.1016/j.ifacol.2021.08.468 ◽

2021 ◽

Vol 54 (5) ◽

pp. 19-24

Author(s):

Tyler Westenbroek ◽

Ayush Agrawal ◽

Fernando Castañeda ◽

S Shankar Sastry ◽

Koushil Sreenath

Keyword(s):

Stabilizing Controllers ◽

Model Based ◽

Model Free ◽

Policy Optimization

Download Full-text

Silhouette based gait recognition based on the area features using both model free and model based approaches

2013 IEEE International Conference on Technologies for Homeland Security (HST) ◽

10.1109/ths.2013.6699062 ◽

2013 ◽

Cited By ~ 3

Author(s):

Arihant Kochhar ◽

Divyesh Gupta ◽

Madasu Hanmandlu ◽

Shantaram Vasikarla

Keyword(s):

Gait Recognition ◽

Model Based ◽

Model Free

Download Full-text

Proximal policy optimization with model-based methods

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-211935 ◽

2022 ◽

pp. 1-12

Author(s):

Shuailong Li ◽

Wei Zhang ◽

Huiwen Zhang ◽

Xin Zhang ◽

Yuquan Leng

Keyword(s):

Reinforcement Learning ◽

State Of The Art ◽

Transition Model ◽

Practical Applications ◽

Original Algorithm ◽

Policy Performance ◽

Model Based ◽

Model Free ◽

Future State ◽

Policy Optimization

Model-free reinforcement learning methods have successfully been applied to practical applications such as decision-making problems in Atari games. However, these methods have inherent shortcomings, such as a high variance and low sample efficiency. To improve the policy performance and sample efficiency of model-free reinforcement learning, we propose proximal policy optimization with model-based methods (PPOMM), a fusion method of both model-based and model-free reinforcement learning. PPOMM not only considers the information of past experience but also the prediction information of the future state. PPOMM adds the information of the next state to the objective function of the proximal policy optimization (PPO) algorithm through a model-based method. This method uses two components to optimize the policy: the error of PPO and the error of model-based reinforcement learning. We use the latter to optimize a latent transition model and predict the information of the next state. For most games, this method outperforms the state-of-the-art PPO algorithm when we evaluate across 49 Atari games in the Arcade Learning Environment (ALE). The experimental results show that PPOMM performs better or the same as the original algorithm in 33 games.

Download Full-text