Deep Reinforcement Learning for Energy Saving in Radio Access Network

Author(s):  
Keran Zhang ◽  
Xiangming Wen ◽  
Yawen Chen ◽  
Zhaoming Lu
2020 ◽  
Author(s):  
hao jin ◽  
Wenzhe Pang ◽  
Chenglin Zhao

Abstract To support various service requirements such as massive Machine Type Communications, Ultra-Reliable and Low-Latency Communications in 5G scenario, Network Function Virtualization (NFV) plays an important role in the 5G network architecture to manage and orchestrate network services. As the key network function responsible for mobility management, Access and Mobility Management Function (AMF) can be deployed flexibly at the edge of the radio access network to improve the performance of mobility management based on NFV. In this paper, the optimal placement of AMF is addressed based on Deep Reinforcement Learning (DRL) in a heterogeneous radio access network, which aims to minimize the network utility including the average delay of mobility management requests at AMF, the average wired hops to relay the requests and the cost of AMF instances. By considering time-varying features including user mobility and the arrival rate of user mobility management requests, an AMF optimal placement approach is proposed for the long term optimization. Simulation results show that the performance of the proposed DRL based AMF optimal placement approach outperforms that of the baselines.


Author(s):  
Behnam Khodapanah ◽  
Ahmad Awada ◽  
Ingo Viering ◽  
Andre Noll Barreto ◽  
Meryem Simsek ◽  
...  

2021 ◽  
Author(s):  
Yi Shi ◽  
Parisa Rahimzadeh ◽  
Maice Costa ◽  
Tugba Erpek ◽  
Yalin E. Sagduyu

The paper presents a reinforcement learning solution to dynamic admission control and resource allocation for 5G radio access network (RAN) slicing requests, when the spectrum is potentially shared between 5G and an incumbent user such as in the Citizens Broadband Radio Service scenarios. Available communication resources (frequency-time resource blocks and transmit powers) and computational resources (processor power) not used by the incumbent user can be allocated to stochastic arrivals of network slicing requests. Each request arrives with priority (weight), throughput, computational resource, and latency (deadline) requirements. As online algorithms, the greedy and myopic solutions that do not consider heterogeneity of future requests and their arrival process become ineffective for network slicing. Therefore, reinforcement learning solutions (Q-learning and Deep Q-learning) are presented to maximize the network utility in terms of the total weight of granted network slicing requests over a time horizon, subject to communication and computational constraints. Results show that reinforcement learning provides improvements in the 5G network utility relative to myopic, greedy, random, and first come first served solutions. In particular, deep Q-learning reduces the complexity and allows practical implementation as the state-action space grows, and effectively admits/rejects requests when 5G needs to share the spectrum with incumbent users that may dynamically occupy some of the frequency-time blocks. Furthermore, the robustness of deep reinforcement learning is demonstrated in the presence of the misdetection/false alarm errors in detecting the incumbent user's activity.


IEEE Access ◽  
2020 ◽  
Vol 8 ◽  
pp. 68183-68198
Author(s):  
Yu Abiko ◽  
Takato Saito ◽  
Daizo Ikeda ◽  
Ken Ohta ◽  
Tadanori Mizuno ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document