Multi-Agent Reinforcement Learning Based on K-Means Clustering in Multi-Robot Cooperative Systems

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.216.75 ◽

2011 ◽

Vol 216 ◽

pp. 75-80 ◽

Cited By ~ 1

Author(s):

Chang An Liu ◽

Fei Liu ◽

Chun Yang Liu ◽

Hua Wu

Keyword(s):

Reinforcement Learning ◽

State Space ◽

Experimental Results ◽

Learning Ability ◽

Learning Method ◽

Q Learning ◽

State Space Explosion ◽

Multi Agent ◽

Robot Cooperation ◽

Multi Robot

To solve the curse of dimensionality problem in multi-agent reinforcement learning, a learning method based on k-means is presented in this paper. In this method, the environmental state is represented as key state factors. The state space explosion is avoided by classifying states into different clusters using k-means. The learning rate is improved by assigning different states to existent clusters, as well as corresponding strategy. Compared to traditional Q-learning, our experimental results of the multi-robot cooperation show that our scheme improves the team learning ability efficiently. Meanwhile, the cooperation efficiency can be enhanced successfully.

Download Full-text

Energy Optimization of Solar Micro-Grid Using Multi Agent Reinforcement Learning

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.787.843 ◽

2015 ◽

Vol 787 ◽

pp. 843-847

Author(s):

Leo Raju ◽

R.S. Milton ◽

S. Sakthiyanandan

Keyword(s):

Reinforcement Learning ◽

Energy Savings ◽

Learning Method ◽

Solar Pv ◽

Q Learning ◽

Pv Systems ◽

Model Free ◽

Individual Unit ◽

Multi Agent ◽

Micro Grid

In this paper, two solar Photovoltaic (PV) systems are considered; one in the department with capacity of 100 kW and the other in the hostel with capacity of 200 kW. Each one has battery and load. The capital cost and energy savings by conventional methods are compared and it is proved that the energy dependency from grid is reduced in solar micro-grid element, operating in distributed environment. In the smart grid frame work, the grid energy consumption is further reduced by optimal scheduling of the battery, using Reinforcement Learning. Individual unit optimization is done by a model free reinforcement learning method, called Q-Learning and it is compared with distributed operations of solar micro-grid using a Multi Agent Reinforcement Learning method, called Joint Q-Learning. The energy planning is designed according to the prediction of solar PV energy production and observed load pattern of department and the hostel. A simulation model was developed using Python programming.

Download Full-text

The Knowledge Sharing Based Reinforcement Learning Algorithm for Collective Behaviors of Mobile Robots

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.588-589.1515 ◽

2012 ◽

Vol 588-589 ◽

pp. 1515-1518

Author(s):

Yong Song ◽

Bing Liu ◽

Yi Bin Li

Keyword(s):

Reinforcement Learning ◽

Mobile Robots ◽

Knowledge Sharing ◽

State Space ◽

Learning Algorithm ◽

Collective Behaviors ◽

Q Learning ◽

Exponential Increase ◽

Multi Robot ◽

Reinforcement Learning Algorithm

Reinforcement learning algorithm for multi-robot may will become very slow when the number of robots is increasing resulting in an exponential increase of state space. A sequential Q-learning base on knowledge sharing is presented. The rule repository of robots behaviors is firstly initialized in the process of reinforcement learning. Mobile robots obtain present environmental state by sensors. Then the state will be matched to determine if the relevant behavior rule has been stored in database. If the rule is present, an action will be chosen in accordance with the knowledge and the rules, and the matching weight will be refined. Otherwise the new rule will be joined in the database. The robots learn according to a given sequence and share the behavior database. We examine the algorithm by multi-robot following-surrounding behavior, and find that the improved algorithm can effectively accelerate the convergence speed.

Download Full-text

Multi-robot Cooperation Based on Continuous Reinforcement Learning with Two State Space Representations

2013 IEEE International Conference on Systems, Man, and Cybernetics ◽

10.1109/smc.2013.760 ◽

2013 ◽

Cited By ~ 1

Author(s):

Toshiyuki Yasuda ◽

Kazuhiro Ohkura ◽

Kazuaki Yamada

Keyword(s):

Reinforcement Learning ◽

State Space ◽

Continuous Reinforcement ◽

Robot Cooperation ◽

Multi Robot

Download Full-text

A Multi-agent Reinforcement Learning Method for Role Differentiation Using State Space Filters with Fluctuation Parameters

Journal of Robotics Networking and Artificial Life ◽

10.2991/jrnal.k.210521.002 ◽

2021 ◽

Author(s):

Masato Nagayoshi ◽

Simon Elderton ◽

Hisashi Tamaki

Keyword(s):

Reinforcement Learning ◽

State Space ◽

Learning Method ◽

Role Differentiation ◽

Multi Agent

Download Full-text

Improvement on Supporting Machine Learning Algorithm for Solving Problem in Immediate Decision Making

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.566.572 ◽

2012 ◽

Vol 566 ◽

pp. 572-579

Author(s):

Abdolkarim Niazi ◽

Norizah Redzuan ◽

Raja Ishak Raja Hamzah ◽

Sara Esfandiari

Keyword(s):

Reinforcement Learning ◽

Learning Algorithm ◽

Multi Agent Systems ◽

Combined Model ◽

Q Learning ◽

Agent Systems ◽

Multi Agent ◽

Case Base ◽

Case Base Reasoning ◽

Robotic Tool

In this paper, a new algorithm based on case base reasoning and reinforcement learning (RL) is proposed to increase the convergence rate of the reinforcement learning algorithms. RL algorithms are very useful for solving wide variety decision problems when their models are not available and they must make decision correctly in every state of system, such as multi agent systems, artificial control systems, robotic, tool condition monitoring and etc. In the propose method, we investigate how making improved action selection in reinforcement learning (RL) algorithm. In the proposed method, the new combined model using case base reasoning systems and a new optimized function is proposed to select the action, which led to an increase in algorithms based on Q-learning. The algorithm mentioned was used for solving the problem of cooperative Markov’s games as one of the models of Markov based multi-agent systems. The results of experiments Indicated that the proposed algorithms perform better than the existing algorithms in terms of speed and accuracy of reaching the optimal policy.

Download Full-text

A Hierarchical Reinforcement Learning Based Approach for Multi-robot Cooperation in Unknown Environments

Advances in Intelligent and Soft Computing - Proceedings of the 2011 2nd International Congress on Computer Applications and Computational Science ◽

10.1007/978-3-642-28314-7_9 ◽

2012 ◽

pp. 69-74 ◽

Cited By ~ 3

Author(s):

Yifan Cai ◽

Simon X. Yang ◽

Xin Xu ◽

Gauri S. Mittal

Keyword(s):

Reinforcement Learning ◽

Hierarchical Reinforcement Learning ◽

Unknown Environments ◽

Robot Cooperation ◽

Multi Robot

Download Full-text

A Multi-agent Reinforcement Learning Method for Swarm Robots in Space Collaborative Exploration

2020 6th International Conference on Control, Automation and Robotics (ICCAR) ◽

10.1109/iccar49639.2020.9107997 ◽

2020 ◽

Author(s):

Yixin Huang ◽

Shufan Wu ◽

Zhongcheng Mu ◽

Xiangyu Long ◽

Sunhao Chu ◽

...

Keyword(s):

Reinforcement Learning ◽

Learning Method ◽

Swarm Robots ◽

Multi Agent ◽

Collaborative Exploration

Download Full-text

Distributed multi-agent deep reinforcement learning for cooperative multi-robot pursuit

The Journal of Engineering ◽

10.1049/joe.2019.1200 ◽

2020 ◽

Author(s):

Chao Yu ◽

Yinzhao Dong ◽

Yangning Li ◽

Yatong Chen

Keyword(s):

Reinforcement Learning ◽

Multi Agent ◽

Multi Robot

Download Full-text

Extended Q-Learning: Reinforcement Learning Using Self-Organized State Space

RoboCup 2000: Robot Soccer World Cup IV - Lecture Notes in Computer Science ◽

10.1007/3-540-45324-5_11 ◽

2001 ◽

pp. 129-138 ◽

Cited By ~ 2

Author(s):

Shuichi Enokida ◽

Takeshi Ohasi ◽

Takaichi Yoshida ◽

Toshiaki Ejima

Keyword(s):

Reinforcement Learning ◽

State Space ◽

Q Learning ◽

Self Organized ◽

Learning Reinforcement

Download Full-text

Multi-Robot Q-Learning over Community Perception Network with Homogeneous Delays

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.823.321 ◽

2013 ◽

Vol 823 ◽

pp. 321-325

Author(s):

Lu Jin ◽

Yue Quan Yang ◽

Chun Bo Ni ◽

Zhi Qiang Cao ◽

Yi Fei Kong

Keyword(s):

Robot Learning ◽

Learning Method ◽

Community Perception ◽

Q Value ◽

Robot System ◽

Q Learning ◽

Information Interaction ◽

Community Information ◽

Information Sharing Mechanism ◽

Multi Robot

With the more robots, the information interaction of multi-robot system becomes more sophisticated and important in a community perception network environment. By exploiting and fusing the learning information of robots in a perception community, the community information sharing mechanism is proposed, as well as updating rules of the community Q-value table. Moreover, considering the existence of delays of learning information transmission, an improved Q-learning method based on homogeneous delays is presented to improve the robot learning efficiency over the community perception network. Finally, the test experiments demonstrate the effectiveness of the proposed scheme.

Download Full-text