Improving RTS Game AI by Supervised Policy Learning, Tactical Search, and Deep Reinforcement Learning

Game AI is of great importance as games are simulations of reality. Recent research on game AI has shown much progress in various kinds of games, such as console games, board games and MOBA games. However, the exploration in RTS games remains a challenge for their huge state space, imperfect information, sparse rewards and various strategies. Besides, the typical card-based RTS games have complex card features and are still lacking solutions. We present a deep model SEAT (selection-attention) to play card-based RTS games. The SEAT model includes two parts, a selection part for card choice and an attention part for card usage, and it learns from scratch via deep reinforcement learning. Comprehensive experiments are performed on Clash Royale, a popular mobile card-based RTS game. Empirical results show that the SEAT model agent makes it to reach a high winning rate against rule-based agents and decision-tree-based agent.

Download Full-text

A Study on Application of Curriculum Learning in Deep Reinforcement Learning : Action Acquisition in Shooting Game AI as Example

10.1109/iwcia52852.2021.9626020 ◽

2021 ◽

Author(s):

Ikumi Kodaka ◽

Fumiaki Saitoh

Keyword(s):

Reinforcement Learning ◽

Game Ai

Download Full-text

Combining Case-Based Reasoning and Reinforcement Learning for Unit Navigation in Real-Time Strategy Game AI

Case-Based Reasoning Research and Development - Lecture Notes in Computer Science ◽

10.1007/978-3-319-11209-1_36 ◽

2014 ◽

pp. 511-525 ◽

Cited By ~ 10

Author(s):

Stefan Wender ◽

Ian Watson

Keyword(s):

Reinforcement Learning ◽

Real Time ◽

Case Based Reasoning ◽

Game Ai ◽

Strategy Game ◽

Real Time Strategy Game ◽

Case Based

Download Full-text

Playing Doom with Anticipator-A3C Based Agents Using Deep Reinforcement Learning and the ViZDoom Game-AI Research Platform

10.1007/978-3-030-77939-9_15 ◽

2021 ◽

pp. 503-562

Author(s):

Adil Khan ◽

Muhammad Naeem ◽

Asad Masood Khattak ◽

Muhammad Zubair Asghar ◽

Abdul Haseeb Malik

Keyword(s):

Reinforcement Learning ◽

Game Ai ◽

Research Platform

Download Full-text

Off-Policy Deep Reinforcement Learning by Bootstrapping the Covariate Shift

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33013647 ◽

2019 ◽

Vol 33 ◽

pp. 3647-3655

Author(s):

Carles Gelada ◽

Marc G. Bellemare

Keyword(s):

Reinforcement Learning ◽

Learning Algorithm ◽

Theoretical Perspective ◽

Empirical Evaluation ◽

Nonlinear Function ◽

Policy Learning ◽

Extensive Evaluation ◽

Probability Simplex ◽

Performance Gains ◽

Nonlinear Function Approximation

In this paper we revisit the method of off-policy corrections for reinforcement learning (COP-TD) pioneered by Hallak et al. (2017). Under this method, online updates to the value function are reweighted to avoid divergence issues typical of off-policy learning. While Hallak et al.’s solution is appealing, it cannot easily be transferred to nonlinear function approximation. First, it requires a projection step onto the probability simplex; second, even though the operator describing the expected behavior of the off-policy learning algorithm is convergent, it is not known to be a contraction mapping, and hence, may be more unstable in practice. We address these two issues by introducing a discount factor into COP-TD. We analyze the behavior of discounted COP-TD and find it better behaved from a theoretical perspective. We also propose an alternative soft normalization penalty that can be minimized online and obviates the need for an explicit projection step. We complement our analysis with an empirical evaluation of the two techniques in an off-policy setting on the game Pong from the Atari domain where we find discounted COP-TD to be better behaved in practice than the soft normalization penalty. Finally, we perform a more extensive evaluation of discounted COP-TD in 5 games of the Atari domain, where we find performance gains for our approach.

Download Full-text

Improving heuristic search for RTS-game unit micromanagement using reinforcement learning

2015 IEEE 4th Global Conference on Consumer Electronics (GCCE) ◽

10.1109/gcce.2015.7398675 ◽

2015 ◽

Author(s):

Supaphon Kamon ◽

Tung Due Nguyen ◽

Tomohiro Harada ◽

Ruck Thawonmas ◽

Ikuko Nishikawa

Keyword(s):

Reinforcement Learning ◽

Heuristic Search ◽

Rts Game

Download Full-text

Applying Reinforcement Learning for Game AI in a Tank-Battle Game

2009 Fourth International Conference on Innovative Computing, Information and Control (ICICIC) ◽

10.1109/icicic.2009.114 ◽

2009 ◽

Cited By ~ 1

Author(s):

Yung-Ping Fang ◽

I-Hsien Ting

Keyword(s):

Reinforcement Learning ◽

Game Ai

Download Full-text

Reinforcement learning with adaptive Kanerva coding for Xpilot game AI

2011 IEEE Congress of Evolutionary Computation (CEC) ◽

10.1109/cec.2011.5949796 ◽

2011 ◽

Cited By ~ 9

Author(s):

Martin Allen ◽

Phil Fritzsche

Keyword(s):

Reinforcement Learning ◽

Game Ai

Download Full-text

The Use of Reinforcement Learning in Gaming The Breakout Game Case Study.pdf

10.36227/techrxiv.12061728.v1 ◽

2020 ◽

Author(s):

Ao Chen ◽

Taresh Dewan ◽

Manva Trivedi ◽

Danning Jiang ◽

Aloukik Aditya ◽

...

Keyword(s):

Neural Network ◽

Reinforcement Learning ◽

Comparative Analysis ◽

Policy Learning ◽

Q Value ◽

Complex Environment ◽

Q Learning ◽

Hit Rate ◽

Optimal Action ◽

Good For

This paper provides a comparative analysis between Deep Q Network (DQN) and Double Deep Q Network (DDQN) algorithms based on their hit rate, out of which DDQN proved to be better for Breakout game. DQN is chosen over Basic Q learning because it understands policy learning using its neural network which is good for complex environment and DDQN is chosen as it solves overestimation problem (agent always choses non-optimal action for any state just because it has maximum Q-value) occurring in basic Q-learning.

Download Full-text

Explaining Reinforcement Learning to Mere Mortals: An Empirical Study

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2019/184 ◽

2019 ◽

Cited By ~ 5

Author(s):

Andrew Anderson ◽

Jonathan Dodge ◽

Amrita Sadarangani ◽

Zoe Juozapaitis ◽

Evan Newman ◽

...

Keyword(s):

Reinforcement Learning ◽

Empirical Study ◽

Mental Models ◽

Mental Model ◽

User Study ◽

Focus Of Attention ◽

Saliency Maps ◽

Treatment Experiment ◽

Rts Game ◽

The Impact

We present a user study to investigate the impact of explanations on non-experts? understanding of reinforcement learning (RL) agents. We investigate both a common RL visualization, saliency maps (the focus of attention), and a more recent explanation type, reward-decomposition bars (predictions of future types of rewards). We designed a 124 participant, four-treatment experiment to compare participants? mental models of an RL agent in a simple Real-Time Strategy (RTS) game. Our results show that the combination of both saliency and reward bars were needed to achieve a statistically significant improvement in mental model score over the control. In addition, our qualitative analysis of the data reveals a number of effects for further study.

Download Full-text