Attitude control of a nanosatellite system using reinforcement learning and neural networks

In recent years, Channel State Information (CSI) measured by WiFi is widely used for human activity recognition. In this article, we propose a deep learning design for location- and person-independent activity recognition with WiFi. The proposed design consists of three Deep Neural Networks (DNNs): a 2D Convolutional Neural Network (CNN) as the recognition algorithm, a 1D CNN as the state machine, and a reinforcement learning agent for neural architecture search. The recognition algorithm learns location- and person-independent features from different perspectives of CSI data. The state machine learns temporal dependency information from history classification results. The reinforcement learning agent optimizes the neural architecture of the recognition algorithm using a Recurrent Neural Network (RNN) with Long Short-Term Memory (LSTM). The proposed design is evaluated in a lab environment with different WiFi device locations, antenna orientations, sitting/standing/walking locations/orientations, and multiple persons. The proposed design has 97% average accuracy when testing devices and persons are not seen during training. The proposed design is also evaluated by two public datasets with accuracy of 80% and 83%. The proposed design needs very little human efforts for ground truth labeling, feature engineering, signal processing, and tuning of learning parameters and hyperparameters.

Download Full-text

Satellite Attitude Control with Deep Reinforcement Learning

2020 Chinese Automation Congress (CAC) ◽

10.1109/cac51589.2020.9326605 ◽

2020 ◽

Author(s):

Duozhi Gao ◽

Haibo Zhang ◽

Chuanjiang Li ◽

Xinzhou Gao

Keyword(s):

Reinforcement Learning ◽

Attitude Control ◽

Satellite Attitude ◽

Satellite Attitude Control

Download Full-text

Cascade Attribute Network: Decomposing Reinforcement Learning Control Policies using Hierarchical Neural Networks

IFAC-PapersOnLine ◽

10.1016/j.ifacol.2020.12.2317 ◽

2020 ◽

Vol 53 (2) ◽

pp. 8181-8186

Author(s):

Haonan Chang ◽

Zhuo Xu ◽

Masayoshi Tomizuka

Keyword(s):

Neural Networks ◽

Reinforcement Learning ◽

Learning Control ◽

Control Policies ◽

Hierarchical Neural Networks

Download Full-text

Cancer Diagnosis Based on Combination of Artificial Neural Networks and Reinforcement Learning

2020 6th Iranian Conference on Signal Processing and Intelligent Systems (ICSPIS) ◽

10.1109/icspis51611.2020.9349530 ◽

2020 ◽

Author(s):

Amir Toranj Simin ◽

Seyed Mohsen Ghorabi Baygi ◽

Amin Noori

Keyword(s):

Neural Networks ◽

Artificial Neural Networks ◽

Reinforcement Learning ◽

Cancer Diagnosis ◽

Artificial Neural

Download Full-text

R3L: Connecting Deep Reinforcement Learning To Recurrent Neural Networks For Image Denoising Via Residual Recovery

10.1109/icip42928.2021.9506323 ◽

2021 ◽

Author(s):

Rongkai Zhang ◽

Jiang Zhu ◽

Zhiyuan Zha ◽

Justin Dauwels ◽

Bihan Wen

Keyword(s):

Neural Networks ◽

Reinforcement Learning ◽

Image Denoising ◽

Recurrent Neural Networks

Download Full-text

Cognitive Control Using Adaptive RBF Neural Networks and Reinforcement Learning for Networked Control System Subject to Time-Varying Delay and Packet Losses

Arabian Journal for Science and Engineering ◽

10.1007/s13369-021-05752-y ◽

2021 ◽

Author(s):

Shuti Wang ◽

Xunhe Yin ◽

Peng Li ◽

Yanxin Zhang ◽

Xin Wang ◽

...

Keyword(s):

Neural Networks ◽

Control System ◽

Reinforcement Learning ◽

Cognitive Control ◽

Networked Control System ◽

Time Varying ◽

Rbf Neural Networks ◽

Packet Losses ◽

Time Varying Delay ◽

Varying Delay

Download Full-text

Diversity oriented Deep Reinforcement Learning for targeted molecule generation

Journal of Cheminformatics ◽

10.1186/s13321-021-00498-z ◽

2021 ◽

Vol 13 (1) ◽

Author(s):

Tiago Pereira ◽

Maryam Abbasi ◽

Bernardete Ribeiro ◽

Joel P. Arrais

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Reinforcement Learning ◽

Deep Neural Networks ◽

Chemical Space ◽

Biological Properties ◽

Training Process ◽

Training Strategy ◽

Inhibitory Power ◽

Exploratory Strategy

AbstractIn this work, we explore the potential of deep learning to streamline the process of identifying new potential drugs through the computational generation of molecules with interesting biological properties. Two deep neural networks compose our targeted generation framework: the Generator, which is trained to learn the building rules of valid molecules employing SMILES strings notation, and the Predictor which evaluates the newly generated compounds by predicting their affinity for the desired target. Then, the Generator is optimized through Reinforcement Learning to produce molecules with bespoken properties. The innovation of this approach is the exploratory strategy applied during the reinforcement training process that seeks to add novelty to the generated compounds. This training strategy employs two Generators interchangeably to sample new SMILES: the initially trained model that will remain fixed and a copy of the previous one that will be updated during the training to uncover the most promising molecules. The evolution of the reward assigned by the Predictor determines how often each one is employed to select the next token of the molecule. This strategy establishes a compromise between the need to acquire more information about the chemical space and the need to sample new molecules, with the experience gained so far. To demonstrate the effectiveness of the method, the Generator is trained to design molecules with an optimized coefficient of partition and also high inhibitory power against the Adenosine $$A_{2A}$$ A 2 A and $$\kappa$$ κ opioid receptors. The results reveal that the model can effectively adjust the newly generated molecules towards the wanted direction. More importantly, it was possible to find promising sets of unique and diverse molecules, which was the main purpose of the newly implemented strategy.

Download Full-text