Multi-agent cooperation Q-learning algorithm based on constrained Markov Game

Yangyang Ge; Fei Zhu; Wei Huang; Peiyao Zhao; Quan Liu

doi:10.2298/csis191220009g

Damaged buildings recognition of post-earthquake high-resolution remote sensing images based on feature space and decision tree optimization

Computer Science and Information Systems ◽

10.2298/csis190817004w ◽

2020 ◽

Vol 17 (2) ◽

pp. 619-646

Author(s):

Chao Wang ◽

Xing Qiu ◽

Hui Liu ◽

Dan Li ◽

Kaiguang Zhao ◽

...

Keyword(s):

Reinforcement Learning ◽

Learning Algorithm ◽

Optimal Solution ◽

Security Requirements ◽

Multi Agent System ◽

Agent System ◽

Action Function ◽

State Action ◽

Safety Constraints ◽

Multi Agent

Multi-Agent system has broad application in real world, whose security performance, however, is barely considered. Reinforcement learning is one of the most important methods to resolve Multi-Agent problems. At present, certain progress has been made in applying Multi-Agent reinforcement learning to robot system, man-machine match, and automatic, etc. However, in the above area, an agent may fall into unsafe states where the agent may find it difficult to bypass obstacles, to receive information from other agents and so on. Ensuring the safety of Multi-Agent system is of great importance in the above areas where an agent may fall into dangerous states that are irreversible, causing great damage. To solve the safety problem, in this paper we introduce a Multi-Agent Cooperation Q-Learning Algorithm based on Constrained Markov Game. In this method, safety constraints are added to the set of actions, and each agent, when interacting with the environment to search for optimal values, should be restricted by the safety rules, so as to obtain an optimal policy that satisfies the security requirements. Since traditional Multi-Agent reinforcement learning algorithm is no more suitable for the proposed model in this paper, a new solution is introduced for calculating the global optimum state-action function that satisfies the safety constraints. We take advantage of the Lagrange multiplier method to determine the optimal action that can be performed in the current state based on the premise of linearizing constraint functions, under conditions that the state-action function and the constraint function are both differentiable, which not only improves the efficiency and accuracy of the algorithm, but also guarantees to obtain the global optimal solution. The experiments verify the effectiveness of the algorithm.

Download Full-text

Q Value Reinforcement Learning Algorithm Based on Multi Agent System

Journal of Physics Conference Series ◽

10.1088/1742-6596/1069/1/012094 ◽

2018 ◽

Vol 1069 ◽

pp. 012094 ◽

Cited By ~ 1

Author(s):

Xijie Yin ◽

Dongxin Yang

Keyword(s):

Reinforcement Learning ◽

Learning Algorithm ◽

Multi Agent System ◽

Q Value ◽

Agent System ◽

Multi Agent ◽

Reinforcement Learning Algorithm

Download Full-text

Statistics Based Q-learning Algorithm for Multi-Agent System and Application in RoboCup

Journal of Software ◽

10.4304/jsw.9.3.634-640 ◽

2014 ◽

Vol 9 (3) ◽

Author(s):

Ya Xie ◽

Zhonghua Huang

Keyword(s):

Learning Algorithm ◽

Multi Agent System ◽

Agent System ◽

Q Learning ◽

Multi Agent

Download Full-text

Research on decision-making strategy of soccer robot based on multi-agent reinforcement learning

International Journal of Advanced Robotic Systems ◽

10.1177/1729881420916960 ◽

2020 ◽

Vol 17 (3) ◽

pp. 172988142091696

Author(s):

Xiaoli Liu

Keyword(s):

Decision Making ◽

Reinforcement Learning ◽

Learning Algorithm ◽

Division Of Labour ◽

Learning System ◽

Selection Strategy ◽

Multi Agent System ◽

Soccer Robot ◽

Agent System ◽

Multi Agent

This article studies a multi-agent reinforcement learning algorithm based on agent action prediction. In multi-agent system, the action of learning agent selection is inevitably affected by the action of other agents, so the reinforcement learning system needs to consider the joint state and joint action of multi-agent based on this. In addition, the application of this method in the cooperative strategy learning of soccer robot is studied, so that the multi-agent system can pass through the environment. To realize the division of labour and cooperation of multi-robots, the interactive learning is used to master the behaviour strategy. Combined with the characteristics of decision-making of soccer robot, this article analyses the role transformation and experience sharing of multi-agent reinforcement learning, and applies it to the local attack strategy of soccer robot, uses this algorithm to learn the action selection strategy of the main robot in the team, and uses Matlab platform for simulation verification. The experimental results prove the effectiveness of the research method, and the superiority of the proposed method is validated compared with some simple methods.

Download Full-text

INFLUENCE LEARNING FOR MULTI-AGENT SYSTEM BASED ON REINFORCEMENT LEARNING

International Journal of Computing ◽

10.47839/ijc.11.1.549 ◽

2014 ◽

pp. 39-44

Author(s):

Anton Kabysh ◽

Vladimir Golovko ◽

Arunas Lipnickas

Keyword(s):

Reinforcement Learning ◽

Learning Algorithm ◽

Learning Rule ◽

Learning Approach ◽

Multi Agent System ◽

Agent System ◽

Additional Information ◽

Optimal Behavior ◽

Learning Technique ◽

Multi Agent

This paper describes a multi-agent influence learning approach and reinforcement learning adaptation to it. This learning technique is used for distributed, adaptive and self-organizing control in multi-agent system. This technique is quite simple and uses agent’s influences to estimate learning error between them. The best influences are rewarded via reinforcement learning which is a well-proven learning technique. It is shown that this learning rule supports positive-reward interactions between agents and does not require any additional information than standard reinforcement learning algorithm. This technique produces optimal behavior of multi-agent system with fast convergence patterns.

Download Full-text

A novel multi-agent Q-learning algorithm in cooperative multi-agent system

Proceedings of the 3rd World Congress on Intelligent Control and Automation (Cat. No.00EX393) ◽

10.1109/wcica.2000.859964 ◽

2002 ◽

Cited By ~ 1

Author(s):

Ou Haitao ◽

Zhang Weidong ◽

Zhang Wenyuan ◽

Xu Xiaoming

Keyword(s):

Learning Algorithm ◽

Multi Agent System ◽

Agent System ◽

Q Learning ◽

Multi Agent

Download Full-text

Improving Reinforcement Learning Algorithm Using Emotions in a Multi-agent System

Intelligent Virtual Agents - Lecture Notes in Computer Science ◽

10.1007/978-3-540-39396-2_64 ◽

2003 ◽

pp. 361-362

Author(s):

Roozbeh Daneshvar ◽

Caro Lucas

Keyword(s):

Reinforcement Learning ◽

Learning Algorithm ◽

Multi Agent System ◽

Agent System ◽

Multi Agent ◽

Reinforcement Learning Algorithm

Download Full-text

Study on Statistics Based Q-Learning Algorithm for Multi-agent System

2013 Fourth International Conference on Intelligent Systems Design and Engineering Applications ◽

10.1109/isdea.2013.541 ◽

2013 ◽

Author(s):

Xie Ya ◽

Huang Zhonghua

Keyword(s):

Learning Algorithm ◽

Multi Agent System ◽

Agent System ◽

Q Learning ◽

Multi Agent

Download Full-text

Improvement on Supporting Machine Learning Algorithm for Solving Problem in Immediate Decision Making

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.566.572 ◽

2012 ◽

Vol 566 ◽

pp. 572-579

Author(s):

Abdolkarim Niazi ◽

Norizah Redzuan ◽

Raja Ishak Raja Hamzah ◽

Sara Esfandiari

Keyword(s):

Reinforcement Learning ◽

Learning Algorithm ◽

Multi Agent Systems ◽

Combined Model ◽

Q Learning ◽

Agent Systems ◽

Multi Agent ◽

Case Base ◽

Case Base Reasoning ◽

Robotic Tool

In this paper, a new algorithm based on case base reasoning and reinforcement learning (RL) is proposed to increase the convergence rate of the reinforcement learning algorithms. RL algorithms are very useful for solving wide variety decision problems when their models are not available and they must make decision correctly in every state of system, such as multi agent systems, artificial control systems, robotic, tool condition monitoring and etc. In the propose method, we investigate how making improved action selection in reinforcement learning (RL) algorithm. In the proposed method, the new combined model using case base reasoning systems and a new optimized function is proposed to select the action, which led to an increase in algorithms based on Q-learning. The algorithm mentioned was used for solving the problem of cooperative Markov’s games as one of the models of Markov based multi-agent systems. The results of experiments Indicated that the proposed algorithms perform better than the existing algorithms in terms of speed and accuracy of reaching the optimal policy.

Download Full-text

Q-Learning Based Cooperative Multi-Agent System Applied to Coordination of Overcurrent Relays

Journal of Applied Sciences ◽

10.3923/jas.2008.3924.3930 ◽

2008 ◽

Vol 8 (21) ◽

pp. 3924-3930 ◽

Cited By ~ 1

Author(s):

J. Sadeh ◽

M. Rahimiyan

Keyword(s):

Multi Agent System ◽

Agent System ◽

Q Learning ◽

Overcurrent Relays ◽

Multi Agent

Download Full-text