Research of Q-learning Algorithm of Multi-agent in Micro-grid Control System Based on Probability

In this paper, a new algorithm based on case base reasoning and reinforcement learning (RL) is proposed to increase the convergence rate of the reinforcement learning algorithms. RL algorithms are very useful for solving wide variety decision problems when their models are not available and they must make decision correctly in every state of system, such as multi agent systems, artificial control systems, robotic, tool condition monitoring and etc. In the propose method, we investigate how making improved action selection in reinforcement learning (RL) algorithm. In the proposed method, the new combined model using case base reasoning systems and a new optimized function is proposed to select the action, which led to an increase in algorithms based on Q-learning. The algorithm mentioned was used for solving the problem of cooperative Markov’s games as one of the models of Markov based multi-agent systems. The results of experiments Indicated that the proposed algorithms perform better than the existing algorithms in terms of speed and accuracy of reaching the optimal policy.

Download Full-text

Energy Optimization of Solar Micro-Grid Using Multi Agent Reinforcement Learning

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.787.843 ◽

2015 ◽

Vol 787 ◽

pp. 843-847

Author(s):

Leo Raju ◽

R.S. Milton ◽

S. Sakthiyanandan

Keyword(s):

Reinforcement Learning ◽

Energy Savings ◽

Learning Method ◽

Solar Pv ◽

Q Learning ◽

Pv Systems ◽

Model Free ◽

Individual Unit ◽

Multi Agent ◽

Micro Grid

In this paper, two solar Photovoltaic (PV) systems are considered; one in the department with capacity of 100 kW and the other in the hostel with capacity of 200 kW. Each one has battery and load. The capital cost and energy savings by conventional methods are compared and it is proved that the energy dependency from grid is reduced in solar micro-grid element, operating in distributed environment. In the smart grid frame work, the grid energy consumption is further reduced by optimal scheduling of the battery, using Reinforcement Learning. Individual unit optimization is done by a model free reinforcement learning method, called Q-Learning and it is compared with distributed operations of solar micro-grid using a Multi Agent Reinforcement Learning method, called Joint Q-Learning. The energy planning is designed according to the prediction of solar PV energy production and observed load pattern of department and the hostel. A simulation model was developed using Python programming.

Download Full-text

Multi-agent cooperation Q-learning algorithm based on constrained Markov Game

Computer Science and Information Systems ◽

10.2298/csis191220009g ◽

2020 ◽

Vol 17 (2) ◽

pp. 647-664

Author(s):

Yangyang Ge ◽

Fei Zhu ◽

Wei Huang ◽

Peiyao Zhao ◽

Quan Liu

Keyword(s):

Reinforcement Learning ◽

Learning Algorithm ◽

Multi Agent System ◽

Agent System ◽

Action Function ◽

Q Learning ◽

State Action ◽

Markov Game ◽

Safety Constraints ◽

Multi Agent

Multi-Agent system has broad application in real world, whose security performance, however, is barely considered. Reinforcement learning is one of the most important methods to resolve Multi-Agent problems. At present, certain progress has been made in applying Multi-Agent reinforcement learning to robot system, man-machine match, and automatic, etc. However, in the above area, an agent may fall into unsafe states where the agent may find it difficult to bypass obstacles, to receive information from other agents and so on. Ensuring the safety of Multi-Agent system is of great importance in the above areas where an agent may fall into dangerous states that are irreversible, causing great damage. To solve the safety problem, in this paper we introduce a Multi-Agent Cooperation Q-Learning Algorithm based on Constrained Markov Game. In this method, safety constraints are added to the set of actions, and each agent, when interacting with the environment to search for optimal values, should be restricted by the safety rules, so as to obtain an optimal policy that satisfies the security requirements. Since traditional Multi-Agent reinforcement learning algorithm is no more suitable for the proposed model in this paper, a new solution is introduced for calculating the global optimum state-action function that satisfies the safety constraints. We take advantage of the Lagrange multiplier method to determine the optimal action that can be performed in the current state based on the premise of linearizing constraint functions, under conditions that the state-action function and the constraint function are both differentiable, which not only improves the efficiency and accuracy of the algorithm, but also guarantees to obtain the global optimal solution. The experiments verify the effectiveness of the algorithm.

Download Full-text

Multiagent reinforcement learning using Non-Parametric Approximation

Respuestas ◽

10.22463/0122820x.1738 ◽

2018 ◽

Vol 23 (2) ◽

pp. 53-61

Author(s):

David Luviano Cruz ◽

Francesco José García Luna ◽

Luis Asunción Pérez Domínguez

Keyword(s):

Reinforcement Learning ◽

Hybrid Control ◽

Learning Algorithm ◽

Multi Agent Systems ◽

Generation Task ◽

Q Learning ◽

Agent Systems ◽

Multi Agent ◽

Optimal Set ◽

Parametric Approximation

This paper presents a hybrid control proposal for multi-agent systems, where the advantages of the reinforcement learning and nonparametric functions are exploited. A modified version of the Q-learning algorithm is used which will provide data training for a Kernel, this approach will provide a sub optimal set of actions to be used by the agents. The proposed algorithm is experimentally tested in a path generation task in an unknown environment for mobile robots.

Download Full-text

Jamming-Resilient Wideband Cognitive Radios with Multi-Agent Reinforcement Learning

International Journal of Software Science and Computational Intelligence ◽

10.4018/ijssci.2018070101 ◽

2018 ◽

Vol 10 (3) ◽

pp. 1-23 ◽

Cited By ~ 1

Author(s):

Mohamed A. Aref ◽

Sudharman K. Jayaweera

Keyword(s):

Learning Algorithm ◽

Cognitive Radios ◽

System Model ◽

Interference Avoidance ◽

Q Learning ◽

Selection Policy ◽

Cognitive Framework ◽

Multi Agent ◽

Simulation Results ◽

The Impact

This article presents a design of a wideband autonomous cognitive radio (WACR) for anti-jamming and interference-avoidance. The proposed system model allows multiple WACRs to simultaneously operate over the same spectrum range producing a multi-agent environment. The objective of each radio is to predict and evade a dynamic jammer signal as well as avoiding transmissions of other WACRs. The proposed cognitive framework is made of two operations: sensing and transmission. Each operation is helped by its own learning algorithm based on Q-learning, but both will be experiencing the same RF environment. The simulation results indicate that the proposed cognitive anti-jamming technique has low computational complexity and significantly outperforms non-cognitive sub-band selection policy while being sufficiently robust against the impact of sensing errors.

Download Full-text

An Entanglement-Inspired Action Selection and Knowledge Sharing Scheme for Cooperative Multi-agent Q-Learning Algorithm used in Robot Navigation

2020 10th International Conference on Computer and Knowledge Engineering (ICCKE) ◽

10.1109/iccke50421.2020.9303636 ◽

2020 ◽

Author(s):

Mohammad Hasan Karami ◽

Hossein Aghababa ◽

Amir Hosein Keyhanipour

Keyword(s):

Knowledge Sharing ◽

Learning Algorithm ◽

Robot Navigation ◽

Action Selection ◽

Q Learning ◽

Sharing Scheme ◽

Multi Agent

Download Full-text

A Research on Regional Penetration Channel of Multi-Agent UAVs based on Improved Q-Learning Algorithm

2021 IEEE 5th Advanced Information Technology, Electronic and Automation Control Conference (IAEAC) ◽

10.1109/iaeac50856.2021.9391089 ◽

2021 ◽

Author(s):

Fuyao Zhang ◽

Anchao Cheng ◽

Qilin Ding ◽

Yihui Zhou

Keyword(s):

Learning Algorithm ◽

Q Learning ◽

Multi Agent

Download Full-text

Statistics Based Q-learning Algorithm for Multi-Agent System and Application in RoboCup

Journal of Software ◽

10.4304/jsw.9.3.634-640 ◽

2014 ◽

Vol 9 (3) ◽

Author(s):

Ya Xie ◽

Zhonghua Huang

Keyword(s):

Learning Algorithm ◽

Multi Agent System ◽

Agent System ◽

Q Learning ◽

Multi Agent

Download Full-text

Q-learning algorithm based multi-agent coordinated control method for microgrids

2015 9th International Conference on Power Electronics and ECCE Asia (ICPE-ECCE Asia) ◽

10.1109/icpe.2015.7167977 ◽

2015 ◽

Cited By ~ 4

Author(s):

Yuanyuan Xi ◽

Liuchen Chang ◽

Meiqin Mao ◽

Peng Jin ◽

Nikos Hatziargyriou ◽

...

Keyword(s):

Control Method ◽

Learning Algorithm ◽

Coordinated Control ◽

Q Learning ◽

Multi Agent

Download Full-text

Implementation of Seamless Switching of Micro-Grid Operation Mode Based on Multi-Agent System

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.448-453.2583 ◽

2013 ◽

Vol 448-453 ◽

pp. 2583-2589

Author(s):

Zhi Wen Liu ◽

Wen Bo Xia

Keyword(s):

Control System ◽

Operation Mode ◽

Control Level ◽

Switching Control ◽

Mode Switching ◽

Short Time Scale ◽

Operation Modes ◽

Multi Agent ◽

Micro Grid ◽

Grid Operation

Switching control of micro-grid operation modes belongs to short time scale control level, this paper proposes three-tier structure of micro-grid energy management system suitable for switching control of micro-grid operation modes on the basis of the analysis of micro-grid operation mode switching requirements for the control system, and builds micro-grid central control system based on multi-agent technology aiming at the coordinated control of micro-grid operation mode switching, which will effectively enhance the implementation effect of switching control strategy, and play important role in achieving the seamless switching control of micro-grid operation modes.

Download Full-text