A Scalable Privacy-Preserving Multi-agent Deep Reinforcement Learning Approach for Large-Scale Peer-to-Peer Transactive Energy Trading

With increasing prosumers employed with distributed energy resources (DER), advanced energy management has become increasingly important. To this end, integrating demand-side DER into electricity market is a trend for future smart grids. The double-side auction (DA) market is viewed as a promising peer-to-peer (P2P) energy trading mechanism that enables interactions among prosumers in a distributed manner. To achieve the maximum profit in a dynamic electricity market, prosumers act as price makers to simultaneously optimize their operations and trading strategies. However, the traditional DA market is difficult to be explicitly modelled due to its complex clearing algorithm and the stochastic bidding behaviors of the participants. For this reason, in this paper we model this task as a multi-agent reinforcement learning (MARL) problem and propose an algorithm called DA-MADDPG that is modified based on MADDPG by abstracting the other agents’ observations and actions through the DA market public information for each agent’s critic. The experiments show that 1) prosumers obtain more economic benefits in P2P energy trading w.r.t. the conventional electricity market independently trading with the utility company; and 2) DA-MADDPG performs better than the traditional Zero Intelligence (ZI) strategy and the other MARL algorithms, e.g., IQL, IDDPG, IPPO and MADDPG.

Download Full-text

Peer-to-Peer Energy Trading and Energy Conversion in Interconnected Multi-Energy Microgrids Using Multi-Agent Deep Reinforcement Learning

IEEE Transactions on Smart Grid ◽

10.1109/tsg.2021.3124465 ◽

2021 ◽

pp. 1-1

Author(s):

Tianyi Chen ◽

Shengrong Bu ◽

Xue Liu ◽

Jikun Kang ◽

F. Richard Yu ◽

...

Keyword(s):

Reinforcement Learning ◽

Energy Conversion ◽

Peer To Peer ◽

Energy Trading ◽

Multi Agent

Download Full-text

Scalable coordinated management of peer-to-peer energy trading: A multi-cluster deep reinforcement learning approach

Applied Energy ◽

10.1016/j.apenergy.2021.116940 ◽

2021 ◽

Vol 292 ◽

pp. 116940

Author(s):

Dawei Qiu ◽

Yujian Ye ◽

Dimitrios Papadaskalopoulos ◽

Goran Strbac

Keyword(s):

Reinforcement Learning ◽

Peer To Peer ◽

Learning Approach ◽

Energy Trading

Download Full-text

Dynamic Dispatching for Large-Scale Heterogeneous Fleet via Multi-agent Deep Reinforcement Learning

2020 IEEE International Conference on Big Data (Big Data) ◽

10.1109/bigdata50022.2020.9378191 ◽

2020 ◽

Author(s):

Chi Zhang ◽

Philip Odonkor ◽

Shuai Zheng ◽

Hamed Khorasgani ◽

Susumu Serita ◽

...

Keyword(s):

Reinforcement Learning ◽

Large Scale ◽

Heterogeneous Fleet ◽

Multi Agent ◽

Dynamic Dispatching

Download Full-text

Adaptive Traffic Signal Control for large-scale scenario with Cooperative Group-based Multi-agent reinforcement learning

Transportation Research Part C Emerging Technologies ◽

10.1016/j.trc.2021.103046 ◽

2021 ◽

Vol 125 ◽

pp. 103046

Author(s):

Tong Wang ◽

Jiahua Cao ◽

Azhar Hussain

Keyword(s):

Reinforcement Learning ◽

Large Scale ◽

Traffic Signal ◽

Signal Control ◽

Traffic Signal Control ◽

Cooperative Group ◽

Adaptive Traffic Signal Control ◽

Multi Agent

Download Full-text

A Confrontation Decision-Making Method with Deep Reinforcement Learning and Knowledge Transfer for Multi-Agent System

Symmetry ◽

10.3390/sym12040631 ◽

2020 ◽

Vol 12 (4) ◽

pp. 631

Author(s):

Chunyang Hu

Keyword(s):

Decision Making ◽

Reinforcement Learning ◽

Knowledge Transfer ◽

Large Scale ◽

Effective Control ◽

Small Scale ◽

Learning Agent ◽

Multi Agent ◽

Transfer Method ◽

Parameter Sharing

In this paper, deep reinforcement learning (DRL) and knowledge transfer are used to achieve the effective control of the learning agent for the confrontation in the multi-agent systems. Firstly, a multi-agent Deep Deterministic Policy Gradient (DDPG) algorithm with parameter sharing is proposed to achieve confrontation decision-making of multi-agent. In the process of training, the information of other agents is introduced to the critic network to improve the strategy of confrontation. The parameter sharing mechanism can reduce the loss of experience storage. In the DDPG algorithm, we use four neural networks to generate real-time action and Q-value function respectively and use a momentum mechanism to optimize the training process to accelerate the convergence rate for the neural network. Secondly, this paper introduces an auxiliary controller using a policy-based reinforcement learning (RL) method to achieve the assistant decision-making for the game agent. In addition, an effective reward function is used to help agents balance losses of enemies and our side. Furthermore, this paper also uses the knowledge transfer method to extend the learning model to more complex scenes and improve the generalization of the proposed confrontation model. Two confrontation decision-making experiments are designed to verify the effectiveness of the proposed method. In a small-scale task scenario, the trained agent can successfully learn to fight with the competitors and achieve a good winning rate. For large-scale confrontation scenarios, the knowledge transfer method can gradually improve the decision-making level of the learning agent.

Download Full-text

A Resource-Constrained and Privacy-Preserving Edge Computing Enabled Clinical Decision System: A Federated Reinforcement Learning Approach

IEEE Internet of Things Journal ◽

10.1109/jiot.2021.3057653 ◽

2021 ◽

pp. 1-1

Author(s):

Zeyue Xue ◽

Pan Zhou ◽

Zichuan Xu ◽

Xiumin Wang ◽

Yulai Xie ◽

...

Keyword(s):

Reinforcement Learning ◽

Clinical Decision ◽

Privacy Preserving ◽

Edge Computing ◽

Learning Approach ◽

Resource Constrained ◽

Decision System ◽

System A ◽

Clinical Decision System

Download Full-text

ReLight-WCTM: Multi-Agent Reinforcement Learning Approach for Traffic Light Control within a Realistic Traffic Simulation

10.1109/tsp52935.2021.9522612 ◽

2021 ◽

Author(s):

Peter Palos ◽

Arpad Huszak

Keyword(s):

Reinforcement Learning ◽

Traffic Simulation ◽

Learning Approach ◽

Light Control ◽

Traffic Light ◽

Traffic Light Control ◽

Multi Agent

Download Full-text

A Case Study in Hybrid Multi-threading and Hierarchical Reinforcement Learning Approach for Cooperative Multi-agent Systems

2015 Fourteenth Mexican International Conference on Artificial Intelligence (MICAI) ◽

10.1109/micai.2015.20 ◽

2015 ◽

Author(s):

Hiram Ponce ◽

Ricardo Padilla ◽

Alan Davalos ◽

Alvaro Herrasti ◽

Cynthia Pichardo ◽

...

Keyword(s):

Reinforcement Learning ◽

Learning Approach ◽

Multi Agent Systems ◽

Agent Systems ◽

Hierarchical Reinforcement Learning ◽

Multi Agent

Download Full-text

A Scalable Privacy-Preserving Multi-agent Deep Reinforcement Learning Approach for Large-Scale Peer-to-Peer Transactive Energy Trading

Engineering A Large-Scale Traffic Signal Control: A Multi-Agent Reinforcement Learning Approach

Multi-Agent Reinforcement Learning for Automated Peer-to-Peer Energy Trading in Double-Side Auction Market

Peer-to-Peer Energy Trading and Energy Conversion in Interconnected Multi-Energy Microgrids Using Multi-Agent Deep Reinforcement Learning

Scalable coordinated management of peer-to-peer energy trading: A multi-cluster deep reinforcement learning approach

Dynamic Dispatching for Large-Scale Heterogeneous Fleet via Multi-agent Deep Reinforcement Learning

Adaptive Traffic Signal Control for large-scale scenario with Cooperative Group-based Multi-agent reinforcement learning

A Confrontation Decision-Making Method with Deep Reinforcement Learning and Knowledge Transfer for Multi-Agent System

A Resource-Constrained and Privacy-Preserving Edge Computing Enabled Clinical Decision System: A Federated Reinforcement Learning Approach

ReLight-WCTM: Multi-Agent Reinforcement Learning Approach for Traffic Light Control within a Realistic Traffic Simulation

A Case Study in Hybrid Multi-threading and Hierarchical Reinforcement Learning Approach for Cooperative Multi-agent Systems

Export Citation Format