Deep Reinforcement Learning-Based Long Short-Term Memory for Satellite IoT Channel Allocation

Asynchronous Advantage Actor-Critic (A3C) adalah sebuah algoritma deep reinforcement learning yang dikembangkan oleh Google DeepMind. Algoritma ini dapat digunakan untuk menciptakan sebuah arsitektur artificial intelligence yang dapat menguasai berbagai jenis game yang berbeda melalui trial and error dengan mempelajari tempilan layar game dan skor yang diperoleh dari hasil tindakannya tanpa campur tangan manusia. Sebuah network A3C terdiri dari Convolutional Neural Network (CNN) di bagian depan, Long Short-Term Memory Network (LSTM) di tengah, dan sebuah Actor-Critic network di bagian belakang. CNN berguna sebagai perangkum dari citra output layar dengan mengekstrak fitur-fitur yang penting yang terdapat pada layar. LSTM berguna sebagai pengingat keadaan game sebelumnya. Actor-Critic Network berguna untuk menentukan tindakan terbaik untuk dilakukan ketika dihadapkan dengan suatu kondisi tertentu. Dari hasil percobaan yang dilakukan, metode ini cukup efektif dan dapat mengalahkan pemain pemula dalam memainkan 5 game yang digunakan sebagai bahan uji coba.

Download Full-text

Control of battery charging based on reinforcement learning and long short-term memory networks

Computers & Electrical Engineering ◽

10.1016/j.compeleceng.2020.106670 ◽

2020 ◽

Vol 85 ◽

pp. 106670 ◽

Cited By ~ 1

Author(s):

Fangyuan Chang ◽

Tao Chen ◽

Wencong Su ◽

Qais Alsafasfeh

Keyword(s):

Reinforcement Learning ◽

Short Term Memory ◽

Short Term ◽

Battery Charging ◽

Term Memory ◽

Long Short Term Memory

Download Full-text

One to Any: Distributed Conflict Resolution with Deep Multi-Agent Reinforcement Learning and Long Short-Term Memory

AIAA Scitech 2021 Forum ◽

10.2514/6.2021-1952 ◽

2021 ◽

Author(s):

Marc W. Brittain ◽

Peng Wei

Keyword(s):

Conflict Resolution ◽

Reinforcement Learning ◽

Short Term Memory ◽

Short Term ◽

Term Memory ◽

Multi Agent ◽

Long Short Term Memory

Download Full-text

Online pricing of demand response based on long short-term memory and reinforcement learning

Applied Energy ◽

10.1016/j.apenergy.2020.114945 ◽

2020 ◽

Vol 271 ◽

pp. 114945

Author(s):

Xiangyu Kong ◽

Deqian Kong ◽

Jingtao Yao ◽

Linquan Bai ◽

Jie Xiao

Keyword(s):

Reinforcement Learning ◽

Demand Response ◽

Short Term Memory ◽

Short Term ◽

Term Memory ◽

Long Short Term Memory ◽

Online Pricing

Download Full-text

Recurrent Neural Network and Reinforcement Learning Model for COVID-19 Prediction

Frontiers in Public Health ◽

10.3389/fpubh.2021.744100 ◽

2021 ◽

Vol 9 ◽

Author(s):

R. Lakshmana Kumar ◽

Firoz Khan ◽

Sadia Din ◽

Shahab S. Band ◽

Amir Mosavi ◽

...

Keyword(s):

Deep Learning ◽

Reinforcement Learning ◽

Short Term Memory ◽

Learning Model ◽

Short Term ◽

Real World Data ◽

Term Memory ◽

Long Short Term Memory ◽

Learning Reinforcement ◽

Novel Coronavirus

Detection and prediction of the novel Coronavirus present new challenges for the medical research community due to its widespread across the globe. Methods driven by Artificial Intelligence can help predict specific parameters, hazards, and outcomes of such a pandemic. Recently, deep learning-based approaches have proven a novel opportunity to determine various difficulties in prediction. In this work, two learning algorithms, namely deep learning and reinforcement learning, were developed to forecast COVID-19. This article constructs a model using Recurrent Neural Networks (RNN), particularly the Modified Long Short-Term Memory (MLSTM) model, to forecast the count of newly affected individuals, losses, and cures in the following few days. This study also suggests deep learning reinforcement to optimize COVID-19's predictive outcome based on symptoms. Real-world data was utilized to analyze the success of the suggested system. The findings show that the established approach promises prognosticating outcomes concerning the current COVID-19 pandemic and outperformed the Long Short-Term Memory (LSTM) model and the Machine Learning model, Logistic Regresion (LR) in terms of error rate.

Download Full-text

Tool remaining useful life prediction using deep transfer reinforcement learning based on long short-term memory networks

The International Journal of Advanced Manufacturing Technology ◽

10.1007/s00170-021-07950-2 ◽

2021 ◽

Author(s):

Jiachen Yao ◽

Baochun Lu ◽

Junli Zhang

Keyword(s):

Reinforcement Learning ◽

Life Prediction ◽

Short Term Memory ◽

Remaining Useful Life ◽

Short Term ◽

Term Memory ◽

Useful Life ◽

Long Short Term Memory

Download Full-text

Joint Channel Allocation and Power Control Based on Long Short-Term Memory Deep Q Network in Cognitive Radio Networks

Complexity ◽

10.1155/2020/1628023 ◽

2020 ◽

Vol 2020 ◽

pp. 1-11

Author(s):

Zifeng Ye ◽

Yonghua Wang ◽

Pin Wan

Keyword(s):

Cognitive Radio ◽

Power Control ◽

Short Term Memory ◽

Channel Allocation ◽

Control Strategies ◽

Radio Networks ◽

Short Term ◽

Term Memory ◽

Long Short Term Memory ◽

Spectrum Resource Management

Efficient spectrum resource management in cognitive radio networks (CRNs) is a promising method that improves the utilization of spectrum resource. In particular, the power control and channel allocation are of top priorities in spectrum resource management. Nevertheless, the joint design of power control and channel allocation is an NP-hard problem and the research is still in the preliminary stage. In this paper, we propose a novel joint approach based on long short-term memory deep Q network (LSTM-DQN). Our objective is to obtain the channel allocation schemes of the access points (APs) and the power control strategies of the secondary users (SUs). Specifically, the received signal strength information (RSSI) collected by the microbase stations is used as the input of LSTM-DQN. In this way, the collection of RSSI can be shared between users. After the training is completed, the APs are capable of selecting channels with small interference while the SUs may access the authorized channels in an underlay operation mode without knowing any knowledge about the primary users (PUs). Experimental results show that the channels are allocated to the APs with a lower probability of collision. Moreover, the SUs can adjust their power control strategies quickly to avoid the harmful interference to the PUs when the environment parameters change randomly. Consequently, the overall performance of CRNs and the utilization of spectrum resources are improved significantly compared to existing popular solutions.

Download Full-text