Study on Statistics Based Q-Learning Algorithm for Multi-agent System

Multi-Agent system has broad application in real world, whose security performance, however, is barely considered. Reinforcement learning is one of the most important methods to resolve Multi-Agent problems. At present, certain progress has been made in applying Multi-Agent reinforcement learning to robot system, man-machine match, and automatic, etc. However, in the above area, an agent may fall into unsafe states where the agent may find it difficult to bypass obstacles, to receive information from other agents and so on. Ensuring the safety of Multi-Agent system is of great importance in the above areas where an agent may fall into dangerous states that are irreversible, causing great damage. To solve the safety problem, in this paper we introduce a Multi-Agent Cooperation Q-Learning Algorithm based on Constrained Markov Game. In this method, safety constraints are added to the set of actions, and each agent, when interacting with the environment to search for optimal values, should be restricted by the safety rules, so as to obtain an optimal policy that satisfies the security requirements. Since traditional Multi-Agent reinforcement learning algorithm is no more suitable for the proposed model in this paper, a new solution is introduced for calculating the global optimum state-action function that satisfies the safety constraints. We take advantage of the Lagrange multiplier method to determine the optimal action that can be performed in the current state based on the premise of linearizing constraint functions, under conditions that the state-action function and the constraint function are both differentiable, which not only improves the efficiency and accuracy of the algorithm, but also guarantees to obtain the global optimal solution. The experiments verify the effectiveness of the algorithm.

Download Full-text

Statistics Based Q-learning Algorithm for Multi-Agent System and Application in RoboCup

Journal of Software ◽

10.4304/jsw.9.3.634-640 ◽

2014 ◽

Vol 9 (3) ◽

Author(s):

Ya Xie ◽

Zhonghua Huang

Keyword(s):

Learning Algorithm ◽

Multi Agent System ◽

Agent System ◽

Q Learning ◽

Multi Agent

Download Full-text

A novel multi-agent Q-learning algorithm in cooperative multi-agent system

Proceedings of the 3rd World Congress on Intelligent Control and Automation (Cat. No.00EX393) ◽

10.1109/wcica.2000.859964 ◽

2002 ◽

Cited By ~ 1

Author(s):

Ou Haitao ◽

Zhang Weidong ◽

Zhang Wenyuan ◽

Xu Xiaoming

Keyword(s):

Learning Algorithm ◽

Multi Agent System ◽

Agent System ◽

Q Learning ◽

Multi Agent

Download Full-text

Q-Learning Based Cooperative Multi-Agent System Applied to Coordination of Overcurrent Relays

Journal of Applied Sciences ◽

10.3923/jas.2008.3924.3930 ◽

2008 ◽

Vol 8 (21) ◽

pp. 3924-3930 ◽

Cited By ~ 1

Author(s):

J. Sadeh ◽

M. Rahimiyan

Keyword(s):

Multi Agent System ◽

Agent System ◽

Q Learning ◽

Overcurrent Relays ◽

Multi Agent

Download Full-text

Q Value Reinforcement Learning Algorithm Based on Multi Agent System

Journal of Physics Conference Series ◽

10.1088/1742-6596/1069/1/012094 ◽

2018 ◽

Vol 1069 ◽

pp. 012094 ◽

Cited By ~ 1

Author(s):

Xijie Yin ◽

Dongxin Yang

Keyword(s):

Reinforcement Learning ◽

Learning Algorithm ◽

Multi Agent System ◽

Q Value ◽

Agent System ◽

Multi Agent ◽

Reinforcement Learning Algorithm

Download Full-text

A Q-learning-based multi-agent system for data classification

Applied Soft Computing ◽

10.1016/j.asoc.2016.10.016 ◽

2017 ◽

Vol 52 ◽

pp. 519-531 ◽

Cited By ~ 17

Author(s):

Farhad Pourpanah ◽

Choo Jun Tan ◽

Chee Peng Lim ◽

Junita Mohamad-Saleh

Keyword(s):

Data Classification ◽

Multi Agent System ◽

Agent System ◽

Q Learning ◽

Multi Agent

Download Full-text

Fuzzy Q-Learning-Based Multi-agent System for Intelligent Traffic Control by a Game Theory Approach

Arabian Journal for Science and Engineering ◽

10.1007/s13369-017-3018-9 ◽

2017 ◽

Vol 43 (6) ◽

pp. 3241-3247 ◽

Cited By ~ 4

Author(s):

Abolghasem Daeichian ◽

Amir Haghani

Keyword(s):

Game Theory ◽

Traffic Control ◽

Theory Approach ◽

Multi Agent System ◽

Agent System ◽

Q Learning ◽

Multi Agent

Download Full-text

Reward-based epigenetic learning algorithm for a decentralised multi-agent system

International Journal of Intelligent Unmanned Systems ◽

10.1108/ijius-12-2018-0036 ◽

2020 ◽

Vol 8 (3) ◽

pp. 201-224

Author(s):

Faqihza Mukhlish ◽

John Page ◽

Michael Bain

Keyword(s):

Swarm Robotics ◽

Learning Algorithm ◽

Future Research ◽

Evolutionary Learning ◽

Multi Agent System ◽

Test Functions ◽

Agent System ◽

Content Type ◽

Multi Agent ◽

Future Research Directions

PurposeThis paper aims to propose a novel epigenetic learning (EpiLearn) algorithm, which is designed specifically for a decentralised multi-agent system such as swarm robotics.Design/methodology/approachFirst, this paper begins with overview of swarm robotics and the challenges in designing swarm behaviour automatically. This should indicate the direction of improvements required to enhance an automatic swarm design. Second, the evolutionary learning (EpiLearn) algorithm for a swarm system using an epigenetic layer is formulated and discussed. The algorithm is then tested through various test functions to investigate its performance. Finally, the results are discussed along with possible future research directions.FindingsThrough various test functions, the algorithm can solve non-local and many local minima problems. This article also shows that by using a reward system, the algorithm can handle the deceptive problem which often occurs in dynamic problems. Moreover, utilization of rewards from the environment in the form of a methylation process on the epigenetic layer improves the performance of traditional evolutionary algorithms applied to automatic swarm design. Finally, this article shows that a regeneration process that embeds an epigenetic layer in the inheritance process performs better than a traditional crossover operator in a swarm system.Originality/valueThis paper proposes a novel method for automatic swarm design by taking into account the importance of multi-agent settings and environmental characteristics surrounding the swarm. The novel evolutionary learning (EpiLearn) algorithm using an epigenetic layer gives the swarm the ability to perform co-evolution and co-learning.

Download Full-text