Resource allocation and congestion control in clustered M2M communication using Q-learning

Machine to machine (M2M) communication has received increasing attention in recent years. A M2M network exhibits salient features such as large number of machines/devices, low data rates, delay tolerant/sensitive, small sized packets, energy-constrained and low or no mobility. A large number of M2M terminals may exist in a small area with many trying to simultaneously and randomly access for channel resources - which will result in overload and access problem. This increased signaling overhead and diverse requirements of machine type communication devices (MTCDs) call for the development of flexible and efficient scheduling and random access techniques. In this thesis, we first review and compare various scheduling and random access techniques in LTE-based cellular networks for M2M communication. We also discuss how successful they are to fulfill the unique requirements of M2M communication and networking. Resource management in M2M networks with a large number devices is also reviewed from the access point of view. We propose a multi-objective optimization based solution to the problem of resource allocation in interference-limited M2M communication. We consider MTCDs in a clustered network structure, where they are divided into clusters and the devices belonging to a cluster communicate to cluster head (or controller). We maximize the number of admitted MTCD controllers and throughput with least interference caused to conventional primary users. We formulate the problem as a mixed-integer non-linear problem with multiple objectives and solve it using meshed adaptive direct search (MADS) algorithm. Simulation results show the effects of varying different parameters on cumulative throughput and the number of admitted iii MTCD controllers. We then formulate the slot selection problem in M2M networks with admitted MTCDs as an optimization problem. We present a solution using the Q-learning algorithm to select conflict-free slot assignment in a random access network with MTCD controllers. The performance of the solution is dependent on parameters such as learning rate and reward. We thoroughly analyze the performance of the proposed algorithm considering different parameters related to its operation. We also compare it with simple ALOHA and channel-based scheduled allocation and show that the proposed Q-learning based technique has a higher probability of assigning slots compared to these techniques. We then present a block based Q-learning algorithm for the scheduling of MTCDs in clustered M2M communication networks. At first centralized slot assignment is done and an algorithm is proposed for minimizing the inter-cluster interference. Then we propose to use an Q-learning algorithm to assign slots in a distributed manner and comparison is made between the two schemes. Afterwards, we show the effects of distributed slot-assignment with respect to varying signal-to-interference ratio on convergence rate and convergence probability. Cumulative distribution function is used to study the effect of various SIR threshold levels on the convergence probability. With the increase in SIR threshold levels, increase in convergence time and decrease in convergence probability are observed, as less block configuration fulfills the required threshold in the M2M network.

Download Full-text

Resource allocation in clustered M2M networks: a q-learning approach

10.32920/ryerson.14649174 ◽

2021 ◽

Author(s):

Fatima Hussain

Keyword(s):

Resource Allocation ◽

Learning Algorithm ◽

Random Access ◽

Cumulative Distribution ◽

Convergence Time ◽

Mixed Integer ◽

M2m Communication ◽

Q Learning ◽

Threshold Levels ◽

Slot Assignment

Machine to machine (M2M) communication has received increasing attention in recent years. A M2M network exhibits salient features such as large number of machines/devices, low data rates, delay tolerant/sensitive, small sized packets, energy-constrained and low or no mobility. A large number of M2M terminals may exist in a small area with many trying to simultaneously and randomly access for channel resources - which will result in overload and access problem. This increased signaling overhead and diverse requirements of machine type communication devices (MTCDs) call for the development of flexible and efficient scheduling and random access techniques. In this thesis, we first review and compare various scheduling and random access techniques in LTE-based cellular networks for M2M communication. We also discuss how successful they are to fulfill the unique requirements of M2M communication and networking. Resource management in M2M networks with a large number devices is also reviewed from the access point of view. We propose a multi-objective optimization based solution to the problem of resource allocation in interference-limited M2M communication. We consider MTCDs in a clustered network structure, where they are divided into clusters and the devices belonging to a cluster communicate to cluster head (or controller). We maximize the number of admitted MTCD controllers and throughput with least interference caused to conventional primary users. We formulate the problem as a mixed-integer non-linear problem with multiple objectives and solve it using meshed adaptive direct search (MADS) algorithm. Simulation results show the effects of varying different parameters on cumulative throughput and the number of admitted iii MTCD controllers. We then formulate the slot selection problem in M2M networks with admitted MTCDs as an optimization problem. We present a solution using the Q-learning algorithm to select conflict-free slot assignment in a random access network with MTCD controllers. The performance of the solution is dependent on parameters such as learning rate and reward. We thoroughly analyze the performance of the proposed algorithm considering different parameters related to its operation. We also compare it with simple ALOHA and channel-based scheduled allocation and show that the proposed Q-learning based technique has a higher probability of assigning slots compared to these techniques. We then present a block based Q-learning algorithm for the scheduling of MTCDs in clustered M2M communication networks. At first centralized slot assignment is done and an algorithm is proposed for minimizing the inter-cluster interference. Then we propose to use an Q-learning algorithm to assign slots in a distributed manner and comparison is made between the two schemes. Afterwards, we show the effects of distributed slot-assignment with respect to varying signal-to-interference ratio on convergence rate and convergence probability. Cumulative distribution function is used to study the effect of various SIR threshold levels on the convergence probability. With the increase in SIR threshold levels, increase in convergence time and decrease in convergence probability are observed, as less block configuration fulfills the required threshold in the M2M network.

Download Full-text

Combining scheduling and congestion control for fair resource allocation in WLAN

Journal of Computer Applications ◽

10.3724/sp.j.1087.2009.00487 ◽

2009 ◽

Vol 29 (2) ◽

pp. 487-490 ◽

Cited By ~ 1

Author(s):

Li YU ◽

Zi-bo SHI ◽

Yan-tai SHU ◽

Mao-de MA

Keyword(s):

Resource Allocation ◽

Congestion Control

Download Full-text

A Q-learning based Resource Allocation for Downlink Non-Orthogonal Multiple Access Systems Considering QoS

IEEE Access ◽

10.1109/access.2021.3080283 ◽

2021 ◽

pp. 1-1

Author(s):

Qi Zhai ◽

Miodrag Bolic ◽

Yong Li ◽

Wei Cheng ◽

Chenxi Liu

Keyword(s):

Resource Allocation ◽

Multiple Access ◽

Q Learning

Download Full-text

Priority-based Joint Resource Allocation with Deep Q-Learning for Heterogeneous NOMA Systems

IEEE Access ◽

10.1109/access.2021.3065314 ◽

2021 ◽

pp. 1-1

Author(s):

Sifat Rezwan ◽

Wooyeol Choi

Keyword(s):

Resource Allocation ◽

Q Learning ◽

Joint Resource Allocation

Download Full-text

Dynamic Resource Allocation Based on Q-learning for VNE in Fiber-Wireless (FiWi) Access Network

Proceedings of the International Conference on Graphics and Signal Processing - ICGSP '17 ◽

10.1145/3121360.3121381 ◽

2017 ◽

Author(s):

QingHai Ou ◽

Honghao Zhao ◽

Yuepian Ye ◽

Xiaohui Yu ◽

Zhu Liu ◽

...

Keyword(s):

Resource Allocation ◽

Access Network ◽

Dynamic Resource Allocation ◽

Q Learning ◽

Dynamic Resource

Download Full-text

Continuous Q-Learning Resource Allocation Network

ICANN 98 - Perspectives in Neural Computing ◽

10.1007/978-1-4471-1599-1_68 ◽

1998 ◽

pp. 455-460 ◽

Cited By ~ 1

Author(s):

W. Ilg ◽

K.-U. Scholl

Keyword(s):

Resource Allocation ◽

Learning Resource ◽

Q Learning

Download Full-text

Energy Efficiency Optimization-Based Joint Resource Allocation and Clustering Algorithm for M2M Communication Networks (Workshop)

Communications and Networking - Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering ◽

10.1007/978-3-030-41117-6_29 ◽

2020 ◽

pp. 351-363

Author(s):

Changzhu Liu ◽

Ahmad Zubair ◽

Rong Chai ◽

Qianbin Chen

Keyword(s):

Resource Allocation ◽

Energy Efficiency ◽

Communication Networks ◽

Clustering Algorithm ◽

M2m Communication ◽

Efficiency Optimization ◽

Joint Resource Allocation

Download Full-text

A Resource Allocation Algorithm for Ultra-Dense Networks Based on Deep Reinforcement Learning

International Journal of Computers Communications & Control ◽

10.15837/ijccc.2021.2.4189 ◽

2021 ◽

Vol 16 (2) ◽

Author(s):

Huashuai Zhang ◽

Tingmei Wang ◽

Haiwei Shen

Keyword(s):

Resource Allocation ◽

Reinforcement Learning ◽

Data Traffic ◽

Wireless Data ◽

Resource Allocation Algorithm ◽

Allocation Algorithm ◽

Q Learning ◽

Dense Networks ◽

Target Network ◽

Wireless Resource Allocation

The resource optimization of ultra-dense networks (UDNs) is critical to meet the huge demand of users for wireless data traffic. But the mainstream optimization algorithms have many problems, such as the poor optimization effect, and high computing load. This paper puts forward a wireless resource allocation algorithm based on deep reinforcement learning (DRL), which aims to maximize the total throughput of the entire network and transform the resource allocation problem into a deep Q-learning process. To effectively allocate resources in UDNs, the DRL algorithm was introduced to improve the allocation efficiency of wireless resources; the authors adopted the resource allocation strategy of the deep Q-network (DQN), and employed empirical repetition and target network to overcome the instability and divergence of the results caused by the previous network state, and to solve the overestimation of the Q value. Simulation results show that the proposed algorithm can maximize the total throughput of the network, while making the network more energy-efficient and stable. Thus, it is very meaningful to introduce the DRL to the research of UDN resource allocation.

Download Full-text