Actor-Critic Reinforcement Learning with Neural Networks in Continuous Games

Location- and Person-Independent Activity Recognition with WiFi, Deep Neural Networks, and Reinforcement Learning

ACM Transactions on Internet of Things ◽

10.1145/3424739 ◽

2021 ◽

Vol 2 (1) ◽

pp. 1-25

Author(s):

Yongsen Ma ◽

Sheheryar Arshad ◽

Swetha Muniraju ◽

Eric Torkildson ◽

Enrico Rantala ◽

...

Keyword(s):

Neural Network ◽

Neural Networks ◽

Reinforcement Learning ◽

Activity Recognition ◽

Deep Neural Networks ◽

State Machine ◽

Recognition Algorithm ◽

The State ◽

Neural Architecture ◽

Learning Agent

In recent years, Channel State Information (CSI) measured by WiFi is widely used for human activity recognition. In this article, we propose a deep learning design for location- and person-independent activity recognition with WiFi. The proposed design consists of three Deep Neural Networks (DNNs): a 2D Convolutional Neural Network (CNN) as the recognition algorithm, a 1D CNN as the state machine, and a reinforcement learning agent for neural architecture search. The recognition algorithm learns location- and person-independent features from different perspectives of CSI data. The state machine learns temporal dependency information from history classification results. The reinforcement learning agent optimizes the neural architecture of the recognition algorithm using a Recurrent Neural Network (RNN) with Long Short-Term Memory (LSTM). The proposed design is evaluated in a lab environment with different WiFi device locations, antenna orientations, sitting/standing/walking locations/orientations, and multiple persons. The proposed design has 97% average accuracy when testing devices and persons are not seen during training. The proposed design is also evaluated by two public datasets with accuracy of 80% and 83%. The proposed design needs very little human efforts for ground truth labeling, feature engineering, signal processing, and tuning of learning parameters and hyperparameters.

Download Full-text

Cascade Attribute Network: Decomposing Reinforcement Learning Control Policies using Hierarchical Neural Networks

IFAC-PapersOnLine ◽

10.1016/j.ifacol.2020.12.2317 ◽

2020 ◽

Vol 53 (2) ◽

pp. 8181-8186

Author(s):

Haonan Chang ◽

Zhuo Xu ◽

Masayoshi Tomizuka

Keyword(s):

Neural Networks ◽

Reinforcement Learning ◽

Learning Control ◽

Control Policies ◽

Hierarchical Neural Networks

Download Full-text

Cancer Diagnosis Based on Combination of Artificial Neural Networks and Reinforcement Learning

2020 6th Iranian Conference on Signal Processing and Intelligent Systems (ICSPIS) ◽

10.1109/icspis51611.2020.9349530 ◽

2020 ◽

Author(s):

Amir Toranj Simin ◽

Seyed Mohsen Ghorabi Baygi ◽

Amin Noori

Keyword(s):

Neural Networks ◽

Artificial Neural Networks ◽

Reinforcement Learning ◽

Cancer Diagnosis ◽

Artificial Neural

Download Full-text

R3L: Connecting Deep Reinforcement Learning To Recurrent Neural Networks For Image Denoising Via Residual Recovery

10.1109/icip42928.2021.9506323 ◽

2021 ◽

Author(s):

Rongkai Zhang ◽

Jiang Zhu ◽

Zhiyuan Zha ◽

Justin Dauwels ◽

Bihan Wen

Keyword(s):

Neural Networks ◽

Reinforcement Learning ◽

Image Denoising ◽

Recurrent Neural Networks

Download Full-text

Cognitive Control Using Adaptive RBF Neural Networks and Reinforcement Learning for Networked Control System Subject to Time-Varying Delay and Packet Losses

Arabian Journal for Science and Engineering ◽

10.1007/s13369-021-05752-y ◽

2021 ◽

Author(s):

Shuti Wang ◽

Xunhe Yin ◽

Peng Li ◽

Yanxin Zhang ◽

Xin Wang ◽

...

Keyword(s):

Neural Networks ◽

Control System ◽

Reinforcement Learning ◽

Cognitive Control ◽

Networked Control System ◽

Time Varying ◽

Rbf Neural Networks ◽

Packet Losses ◽

Time Varying Delay ◽

Varying Delay

Download Full-text

Diversity oriented Deep Reinforcement Learning for targeted molecule generation

Journal of Cheminformatics ◽

10.1186/s13321-021-00498-z ◽

2021 ◽

Vol 13 (1) ◽

Author(s):

Tiago Pereira ◽

Maryam Abbasi ◽

Bernardete Ribeiro ◽

Joel P. Arrais

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Reinforcement Learning ◽

Deep Neural Networks ◽

Chemical Space ◽

Biological Properties ◽

Training Process ◽

Training Strategy ◽

Inhibitory Power ◽

Exploratory Strategy

AbstractIn this work, we explore the potential of deep learning to streamline the process of identifying new potential drugs through the computational generation of molecules with interesting biological properties. Two deep neural networks compose our targeted generation framework: the Generator, which is trained to learn the building rules of valid molecules employing SMILES strings notation, and the Predictor which evaluates the newly generated compounds by predicting their affinity for the desired target. Then, the Generator is optimized through Reinforcement Learning to produce molecules with bespoken properties. The innovation of this approach is the exploratory strategy applied during the reinforcement training process that seeks to add novelty to the generated compounds. This training strategy employs two Generators interchangeably to sample new SMILES: the initially trained model that will remain fixed and a copy of the previous one that will be updated during the training to uncover the most promising molecules. The evolution of the reward assigned by the Predictor determines how often each one is employed to select the next token of the molecule. This strategy establishes a compromise between the need to acquire more information about the chemical space and the need to sample new molecules, with the experience gained so far. To demonstrate the effectiveness of the method, the Generator is trained to design molecules with an optimized coefficient of partition and also high inhibitory power against the Adenosine $$A_{2A}$$ A 2 A and $$\kappa$$ κ opioid receptors. The results reveal that the model can effectively adjust the newly generated molecules towards the wanted direction. More importantly, it was possible to find promising sets of unique and diverse molecules, which was the main purpose of the newly implemented strategy.

Download Full-text

Modular Neural Networks for Reinforcement Learning with Temporal Intrinsic Rewards

2007 International Joint Conference on Neural Networks ◽

10.1109/ijcnn.2007.4371120 ◽

2007 ◽

Cited By ~ 5

Author(s):

Johane Takeuchi ◽

Osamu Shouno ◽

Hiroshi Tsujino

Keyword(s):

Neural Networks ◽

Reinforcement Learning ◽

Intrinsic Rewards ◽

Modular Neural Networks

Download Full-text

Late Breaking Results: Reinforcement Learning for Scalable Logic Optimization with Graph Neural Networks

10.1109/dac18074.2021.9586206 ◽

2021 ◽

Author(s):

Xavier Timoneda ◽

Lukas Cavigelli

Keyword(s):

Neural Networks ◽

Reinforcement Learning ◽

Logic Optimization ◽

Graph Neural Networks

Download Full-text

Towards Anomaly-resistant Graph Neural Networks via Reinforcement Learning

10.1145/3459637.3482203 ◽

2021 ◽

Author(s):

Kaize Ding ◽

Xuan Shan ◽

Huan Liu

Keyword(s):

Neural Networks ◽

Reinforcement Learning ◽

Graph Neural Networks

Download Full-text

The synthesis method of regulators for multichannel systems using neural networks

Вычислительные технологии ◽

10.25743/ict.2020.25.3.012 ◽

2020 ◽

pp. 111-118

Author(s):

Александр Александрович Воевода ◽

Дмитрий Олегович Романников

Keyword(s):

Neural Networks ◽

Reinforcement Learning ◽

Automatic Control ◽

Control Systems ◽

Synthesis Method ◽

Neural Net ◽

Initial State ◽

Set Point ◽

Automatic Control Systems ◽

Policy Gradient

Синтез регуляторов для многоканальных систем - актуальная и сложная задача. Одним из возможных способов синтеза является применение нейронных сетей. Нейронный регулятор либо обучают на предварительно рассчитанных данных, либо используют для настройки параметров ПИД-регулятора из начального устойчивого положения замкнутой системы. Предложено использовать нейронные сети для регулирования двухканального объекта, при этом обучение будет выполняться из неустойчивого (произвольного) начального положения с применением методов обучения нейронных сетей с подкреплением. Предложена структура нейронной сети и замкнутой системы, в которой уставка задается при помощи входного параметра нейронной сети регулятора The problem for synthesis of automatic control systems is hard, especially for multichannel objects. One of the approaches is the use of neural networks. For the approaches that are based on the use of reinforcement learning, there is an additional issue - supporting of range of values for the set points. The method of synthesis of automatic control systems using neural networks and the process of its learning with reinforcement learning that allows neural networks learning for supporting regulation is proposed in the predefined range of set points. The main steps of the method are 1) to form a neural net input as a state of the object and system set point; 2) to perform modelling of the system with a set of randomly generated set points from the desired range; 3) to perform a one-step of the learning using the Deterministic Policy Gradient method. The originality of the proposed method is that, in contrast to existing methods of using a neural network to synthesize a controller, the proposed method allows training a controller from an unstable initial state in a closed system and set of a range of set points. The method was applied to the problem of stabilizing the outputs of a two-channel object, for which stabilization both outputs and the first near the input set point is required

Download Full-text