“Liking” as an early and editable draft of long-run affective value

Psychological and neural distinctions between the technical concepts of “liking” and “wanting” pose important problems for motivated choice for goods. Why could we “want” something that we do not “like,” or “like” something but be unwilling to exert effort to acquire it? Here, we suggest a framework for answering these questions through the medium of reinforcement learning. We consider “liking” to provide immediate, but preliminary and ultimately cancellable, information about the true, long-run worth of a good. Such initial estimates, viewed through the lens of what is known as potential-based shaping, help solve the temporally complex learning problems faced by animals.

Download Full-text

'Liking' as a First Draft of the Affective Future

10.31234/osf.io/g7zfq ◽

2021 ◽

Author(s):

Peter Dayan

Keyword(s):

Reinforcement Learning ◽

Learning Problems ◽

Complex Learning ◽

Long Run

The psychological and neural distinctions between the technical concepts of 'liking' and 'wanting' pose some important problems for motivated choice for goods. Why should it be that we could `want' something that we do not `like', or `like' something that we would not be willing to exert any effort to acquire? Here, we suggest a framework for answering these questions through the medium of reinforcement learning. We consider 'liking' to provide immediate, but preliminary and ultimately cancellable, information about the true, long-run worth of a good. Such preliminary estimates, viewed through the lens of what is known as potential-based shaping, generally facilitate the temporally complex learning problems that animals face.

Download Full-text

Learning and control

10.1093/oso/9780199674923.003.0026 ◽

2018 ◽

Author(s):

Ivan Herreros

Keyword(s):

Machine Learning ◽

Reinforcement Learning ◽

Brain Function ◽

Control Strategies ◽

Learning Problems ◽

Animal Learning ◽

Feed Forward Control ◽

Machine Learning Applications ◽

And Control

This chapter discusses basic concepts from control theory and machine learning to facilitate a formal understanding of animal learning and motor control. It first distinguishes between feedback and feed-forward control strategies, and later introduces the classification of machine learning applications into supervised, unsupervised, and reinforcement learning problems. Next, it links these concepts with their counterparts in the domain of the psychology of animal learning, highlighting the analogies between supervised learning and classical conditioning, reinforcement learning and operant conditioning, and between unsupervised and perceptual learning. Additionally, it interprets innate and acquired actions from the standpoint of feedback vs anticipatory and adaptive control. Finally, it argues how this framework of translating knowledge between formal and biological disciplines can serve us to not only structure and advance our understanding of brain function but also enrich engineering solutions at the level of robot learning and control with insights coming from biology.

Download Full-text

Benefits of combining dimensional attention and working memory for partially observable reinforcement learning problems

Proceedings of the 2021 ACM Southeast Conference ◽

10.1145/3409334.3452072 ◽

2021 ◽

Author(s):

Ngozi Omatu ◽

Joshua L. Phillips

Keyword(s):

Working Memory ◽

Reinforcement Learning ◽

Learning Problems ◽

Partially Observable

Download Full-text

Entorhinal and ventromedial prefrontal cortices abstract and generalize the structure of reinforcement learning problems

Neuron ◽

10.1016/j.neuron.2020.11.024 ◽

2020 ◽

Author(s):

Alon Boaz Baram ◽

Timothy Howard Muller ◽

Hamed Nili ◽

Mona Maria Garvert ◽

Timothy Edward John Behrens

Keyword(s):

Reinforcement Learning ◽

Learning Problems ◽

Prefrontal Cortices

Download Full-text

Policy iterations for reinforcement learning problems in continuous time and space — Fundamental theory and methods

Automatica ◽

10.1016/j.automatica.2020.109421 ◽

2021 ◽

Vol 126 ◽

pp. 109421

Author(s):

Jaeyoung Lee ◽

Richard S. Sutton

Keyword(s):

Reinforcement Learning ◽

Continuous Time ◽

Learning Problems ◽

Fundamental Theory ◽

Time And Space

Download Full-text

Hierarchical Reinforcement Learning

Encyclopedia of Artificial Intelligence ◽

10.4018/978-1-59904-849-9.ch122 ◽

2011 ◽

pp. 825-830

Author(s):

Carlos Diuk ◽

Michael Littman

Keyword(s):

Reinforcement Learning ◽

Learning Problems ◽

Underlying Structure ◽

Sequential Decision ◽

State Spaces ◽

Hierarchical Reinforcement Learning ◽

Markov Decision ◽

Finite Set ◽

State Abstraction ◽

Main Ideas

Reinforcement learning (RL) deals with the problem of an agent that has to learn how to behave to maximize its utility by its interactions with an environment (Sutton & Barto, 1998; Kaelbling, Littman & Moore, 1996). Reinforcement learning problems are usually formalized as Markov Decision Processes (MDP), which consist of a finite set of states and a finite number of possible actions that the agent can perform. At any given point in time, the agent is in a certain state and picks an action. It can then observe the new state this action leads to, and receives a reward signal. The goal of the agent is to maximize its long-term reward. In this standard formalization, no particular structure or relationship between states is assumed. However, learning in environments with extremely large state spaces is infeasible without some form of generalization. Exploiting the underlying structure of a problem can effect generalization and has long been recognized as an important aspect in representing sequential decision tasks (Boutilier et al., 1999). Hierarchical Reinforcement Learning is the subfield of RL that deals with the discovery and/or exploitation of this underlying structure. Two main ideas come into play in hierarchical RL. The first one is to break a task into a hierarchy of smaller subtasks, each of which can be learned faster and easier than the whole problem. Subtasks can also be performed multiple times in the course of achieving the larger task, reusing accumulated knowledge and skills. The second idea is to use state abstraction within subtasks: not every task needs to be concerned with every aspect of the state space, so some states can actually be abstracted away and treated as the same for the purpose of the given subtask.

Download Full-text

EDA-RL: EDA with Conditional Random Fields for Solving Reinforcement Learning Problems

Adaptation, Learning, and Optimization - Markov Networks in Evolutionary Computation ◽

10.1007/978-3-642-28900-2_14 ◽

2012 ◽

pp. 227-239

Author(s):

Hisashi Handa

Keyword(s):

Reinforcement Learning ◽

Random Fields ◽

Conditional Random Fields ◽

Learning Problems

Download Full-text

Neuroevolution for deep reinforcement learning problems

Proceedings of the Genetic and Evolutionary Computation Conference Companion on - GECCO '18 ◽

10.1145/3205651.3207875 ◽

2018 ◽

Author(s):

David Ha

Keyword(s):

Reinforcement Learning ◽

Learning Problems

Download Full-text

Neuroevolution for deep reinforcement learning problems

Proceedings of the Genetic and Evolutionary Computation Conference Companion on - GECCO '19 ◽

10.1145/3319619.3323370 ◽

2019 ◽

Author(s):

David Ha

Keyword(s):

Reinforcement Learning ◽

Learning Problems

Download Full-text

ANALYTICAL REVIEW OF MULTI-AGENT REINFORCEMENT LEARNING PROBLEMS

Vestnik komp iuternykh i informatsionnykh tekhnologii ◽

10.14489/vkit.2020.06.pp.048-056 ◽

2020 ◽

pp. 48-56

Author(s):

Yu. V. Dubenko

Keyword(s):

Reinforcement Learning ◽

Intelligent Agents ◽

Russian Language ◽

Learning Problems ◽

Multi Agent Systems ◽

Hierarchical Reinforcement Learning ◽

Collective Interaction ◽

Analytical Review ◽

Multi Agent ◽

Partially Observable

This paper is devoted to the problem of collective artificial intelligence in solving problems by intelligent agents in external environments. The environments may be: fully or partially observable, deterministic or stochastic, static or dynamic, discrete or continuous. The paper identifies problems of collective interaction of intelligent agents when they solve a class of tasks, which need to coordinate actions of agent group, e. g. task of exploring the territory of a complex infrastructure facility. It is revealed that the problem of reinforcement training in multi-agent systems is poorly presented in the press, especially in Russian-language publications. The article analyzes reinforcement learning, describes hierarchical reinforcement learning, presents basic methods to implement reinforcement learning. The concept of macro-action by agents integrated in groups is introduced. The main problems of intelligent agents collective interaction for problem solving (i. e. calculation of individual rewards for each agent; agent coordination issues; application of macro actions by agents integrated into groups; exchange of experience generated by various agents as part of solving a collective problem) are identified. The model of multi-agent reinforcement learning is described in details. The article describes problems of this approach building on existing solutions. Basic problems of multi-agent reinforcement learning are formulated in conclusion.

Download Full-text