2021 ◽  
Vol 2021 ◽  
pp. 1-10
Author(s):  
Chen Sun ◽  
Shiyi Wu ◽  
Bo Zhang

In future heterogeneous cellular networks with small cells, such as D2D and relay, interference coordination between macro cells and small cells should be addressed through effective resource allocation and power control. The two-step Stackelberg game is a widely used and feasible model for resource allocation and power control problem formulation. Both in the follower games for small cells and in the leader games for the macro cell, the cost parameters are a critical variable for the performance of Stackelberg game. Previous studies have failed to adequately address the optimization of cost parameters. This paper presents a reinforcement learning approach for effectively training cost parameters for better system performance. Furthermore, a two-stage pretraining plus ε -greedy algorithm is proposed to accelerate the convergence of reinforcement learning. The simulation results can demonstrate that compared with the three beachmarking algorithms, the proposed algorithm can enhance average throughput of all users and cellular users by up to 7% and 9.7%, respectively.


2021 ◽  
Vol 64 (8) ◽  
Author(s):  
Xinyi Wang ◽  
Zesong Fei ◽  
Jingxuan Huang ◽  
J. Andrew Zhang ◽  
Jinhong Yuan

Sign in / Sign up

Export Citation Format

Share Document