Learning to Locomote: Understanding How Environment Design Matters for Deep Reinforcement Learning

Motion, Interaction and Games ◽

10.1145/3424636.3426907 ◽

2020 ◽

Author(s):

Daniele Reda ◽

Tianxin Tao ◽

Michiel van de Panne

Keyword(s):

Reinforcement Learning ◽

Environment Design

Download Full-text

Learning to Design Games: Strategic Environments in Reinforcement Learning

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2018/426 ◽

2018 ◽

Author(s):

Haifeng Zhang ◽

Jun Wang ◽

Zhiming Zhou ◽

Weinan Zhang ◽

Yin Wen ◽

...

Keyword(s):

Reinforcement Learning ◽

Game Design ◽

Signal Design ◽

Not Given ◽

Environment Design ◽

Space Design ◽

Strategic Environments ◽

Gradient Solution ◽

Policy Gradient ◽

Markov Decision

In typical reinforcement learning (RL), the environment is assumed given and the goal of the learning is to identify an optimal policy for the agent taking actions through its interactions with the environment. In this paper, we extend this setting by considering the environment is not given, but controllable and learnable through its interaction with the agent at the same time. This extension is motivated by environment design scenarios in the real-world, including game design, shopping space design and traffic signal design. Theoretically, we find a dual Markov decision process (MDP) w.r.t. the environment to that w.r.t. the agent, and derive a policy gradient solution to optimizing the parametrized environment. Furthermore, discontinuous environments are addressed by a proposed general generative framework. Our experiments on a Maze game design task show the effectiveness of the proposed algorithms in generating diverse and challenging Mazes against various agent settings.

Download Full-text

Supplemental Material for Reconciling Reinforcement Learning Models With Behavioral Extinction and Renewal: Implications for Addiction, Relapse, and Problem Gambling

Psychological Review ◽

10.1037/0033-295x.114.3.784.supp ◽

2007 ◽

Keyword(s):

Reinforcement Learning ◽

Problem Gambling ◽

Learning Models ◽

Behavioral Extinction ◽

Reinforcement Learning Models

Download Full-text

Bayes factors for reinforcement-learning models of the Iowa gambling task.

Decision ◽

10.1037/dec0000040 ◽

2016 ◽

Vol 3 (2) ◽

pp. 115-131 ◽

Author(s):

Helen Steingroever ◽

Ruud Wetzels ◽

Eric-Jan Wagenmakers

Keyword(s):

Reinforcement Learning ◽

Iowa Gambling Task ◽

Bayes Factors ◽

Gambling Task ◽

Learning Models ◽

Reinforcement Learning Models

Download Full-text

Analogical Reinforcement Learning With Two-Stage Memory Retrieval

PsycEXTRA Dataset ◽

10.1037/e528942014-705 ◽

2014 ◽

Author(s):

James Foster ◽

Matt Jones

Keyword(s):

Reinforcement Learning ◽

Memory Retrieval ◽

Download Full-text

Effects of Working Memory Capacity on the Speed and Accuracy of Learning in Reinforcement Learning Models

PsycEXTRA Dataset ◽

10.1037/e528942014-552 ◽

2014 ◽

Author(s):

Adnane Ez-Zizi ◽

Simon Farrell ◽

David Leslie

Keyword(s):

Working Memory ◽

Reinforcement Learning ◽

Working Memory Capacity ◽

Memory Capacity ◽

Learning Models ◽

Reinforcement Learning Models ◽

Speed And Accuracy

Download Full-text

Supplemental Material for Reinforcement Learning Models of Risky Choice and the Promotion of Risk-Taking by Losses Disguised as Wins in Rats

Journal of Experimental Psychology Animal Learning and Cognition ◽

10.1037/xan0000141.supp ◽

2017 ◽

Keyword(s):

Reinforcement Learning ◽

Risk Taking ◽

Risky Choice ◽

Learning Models ◽

Losses Disguised As Wins ◽

Reinforcement Learning Models

Download Full-text

Reinforcement learning of irrelevant stimulus-response associations modulates cognitive control.

Journal of Experimental Psychology Learning Memory and Cognition ◽

10.1037/xlm0000850 ◽

2020 ◽

Author(s):

Jinglu Chen ◽

Ling Tan ◽

Lu Liu ◽

Ling Wang

Keyword(s):

Reinforcement Learning ◽

Cognitive Control ◽

Irrelevant Stimulus ◽

Stimulus Response

Download Full-text

A Collaborative Scheduling Lane Changing Model for Intelligent Connected Vehicles Based on Deep Reinforcement Learning

10.1061/9780784483053.178 ◽

2020 ◽

Author(s):

Zheyu Cui ◽

Jianming Hu

Keyword(s):

Reinforcement Learning ◽

Connected Vehicles ◽

Lane Changing ◽

Collaborative Scheduling

Download Full-text

Multi-Agent Deep Reinforcement Learning for Decentralized Cooperative Traffic Signal Control

10.1061/9780784483053.039 ◽

2020 ◽

Author(s):

Yang Zhao ◽

Jian-Ming Hu ◽

Ming-Yang Gao ◽

Zuo Zhang

Keyword(s):

Reinforcement Learning ◽

Traffic Signal ◽

Signal Control ◽

Traffic Signal Control ◽

Download Full-text

Research on Signal Control Method of Single Intersection Based on Reinforcement Learning

10.1061/9780784483053.015 ◽

2020 ◽

Author(s):

Yilong Ren ◽

Le Zhang ◽

Han Jiang ◽

Chengsheng Liu

Keyword(s):

Reinforcement Learning ◽

Control Method ◽

Download Full-text