A multi-agent system with reinforcement learning agents for biomedical text mining

As robotic systems become more prevalent, it is highly desirable for them to be able to operate in highly dynamic environments. A common approach is to use reinforcement learning to allow an agent controlling the robot to learn and adapt its behavior based on a reward function. This paper presents a novel multi-agent system that cooperates to control a single robot battle tank in a melee battle scenario, with no prior knowledge of its opponents’ strategies. The agents learn through reinforcement learning, and are loosely coupled by their reward functions. Each agent controls a different aspect of the robot’s behavior. In addition, the problem of delayed reward is addressed through a time-averaged reward applied to several sequential actions at once. This system was evaluated in a simulated melee combat scenario and was shown to learn to improve its performance over time. This was accomplished by each agent learning to pick specific battle strategies for each different opponent it faced.

Download Full-text

Fast reinforcement learning approach to cooperative behavior acquisition in multi-agent system

IEEE/RSJ International Conference on Intelligent Robots and System ◽

10.1109/irds.2002.1041500 ◽

2003 ◽

Cited By ~ 2

Author(s):

Songhao Piao ◽

Bingrong Hong

Keyword(s):

Reinforcement Learning ◽

Cooperative Behavior ◽

Learning Approach ◽

Multi Agent System ◽

Agent System ◽

Multi Agent

Download Full-text

Application of reinforcement learning on self-tuning PID controller for soccer robot multi-agent system

2013 Joint International Conference on Rural Information & Communication Technology and Electric-Vehicle Technology (rICT & ICeV-T) ◽

10.1109/rict-icevt.2013.6741546 ◽

2013 ◽

Cited By ~ 5

Author(s):

Aulia el Hakim ◽

Hilwadi Hindersah ◽

Estiko Rijanto

Keyword(s):

Reinforcement Learning ◽

Pid Controller ◽

Multi Agent System ◽

Soccer Robot ◽

Agent System ◽

Self Tuning ◽

Multi Agent

Download Full-text

A mathematical model for learning agents on a multi-agent system

Proceedings 2003 IEEE International Symposium on Computational Intelligence in Robotics and Automation ◽

10.1109/cira.2003.1222197 ◽

2004 ◽

Cited By ~ 1

Author(s):

M. Furukawa ◽

M. Watanabe ◽

M. Kinoshita ◽

Y. Kakazu

Keyword(s):

Mathematical Model ◽

Multi Agent System ◽

Agent System ◽

Learning Agents ◽

Multi Agent

Download Full-text

Multi-agent cooperation Q-learning algorithm based on constrained Markov Game

Computer Science and Information Systems ◽

10.2298/csis191220009g ◽

2020 ◽

Vol 17 (2) ◽

pp. 647-664

Author(s):

Yangyang Ge ◽

Fei Zhu ◽

Wei Huang ◽

Peiyao Zhao ◽

Quan Liu

Keyword(s):

Reinforcement Learning ◽

Learning Algorithm ◽

Multi Agent System ◽

Agent System ◽

Action Function ◽

Q Learning ◽

State Action ◽

Markov Game ◽

Safety Constraints ◽

Multi Agent

Multi-Agent system has broad application in real world, whose security performance, however, is barely considered. Reinforcement learning is one of the most important methods to resolve Multi-Agent problems. At present, certain progress has been made in applying Multi-Agent reinforcement learning to robot system, man-machine match, and automatic, etc. However, in the above area, an agent may fall into unsafe states where the agent may find it difficult to bypass obstacles, to receive information from other agents and so on. Ensuring the safety of Multi-Agent system is of great importance in the above areas where an agent may fall into dangerous states that are irreversible, causing great damage. To solve the safety problem, in this paper we introduce a Multi-Agent Cooperation Q-Learning Algorithm based on Constrained Markov Game. In this method, safety constraints are added to the set of actions, and each agent, when interacting with the environment to search for optimal values, should be restricted by the safety rules, so as to obtain an optimal policy that satisfies the security requirements. Since traditional Multi-Agent reinforcement learning algorithm is no more suitable for the proposed model in this paper, a new solution is introduced for calculating the global optimum state-action function that satisfies the safety constraints. We take advantage of the Lagrange multiplier method to determine the optimal action that can be performed in the current state based on the premise of linearizing constraint functions, under conditions that the state-action function and the constraint function are both differentiable, which not only improves the efficiency and accuracy of the algorithm, but also guarantees to obtain the global optimal solution. The experiments verify the effectiveness of the algorithm.

Download Full-text

Resource Management in a Multi-agent System by Means of Reinforcement Learning and Supervised Rule Learning

Computational Science – ICCS 2007 - Lecture Notes in Computer Science ◽

10.1007/978-3-540-72586-2_121 ◽

2007 ◽

pp. 864-871 ◽

Cited By ~ 4

Author(s):

Bartłomiej Śnieżyński

Keyword(s):

Reinforcement Learning ◽

Resource Management ◽

Rule Learning ◽

Multi Agent System ◽

Agent System ◽

Multi Agent

Download Full-text