Device Placement Optimization for Deep Neural Networks via One-shot Model and Reinforcement Learning

In recent years, Channel State Information (CSI) measured by WiFi is widely used for human activity recognition. In this article, we propose a deep learning design for location- and person-independent activity recognition with WiFi. The proposed design consists of three Deep Neural Networks (DNNs): a 2D Convolutional Neural Network (CNN) as the recognition algorithm, a 1D CNN as the state machine, and a reinforcement learning agent for neural architecture search. The recognition algorithm learns location- and person-independent features from different perspectives of CSI data. The state machine learns temporal dependency information from history classification results. The reinforcement learning agent optimizes the neural architecture of the recognition algorithm using a Recurrent Neural Network (RNN) with Long Short-Term Memory (LSTM). The proposed design is evaluated in a lab environment with different WiFi device locations, antenna orientations, sitting/standing/walking locations/orientations, and multiple persons. The proposed design has 97% average accuracy when testing devices and persons are not seen during training. The proposed design is also evaluated by two public datasets with accuracy of 80% and 83%. The proposed design needs very little human efforts for ground truth labeling, feature engineering, signal processing, and tuning of learning parameters and hyperparameters.

Download Full-text

Diversity oriented Deep Reinforcement Learning for targeted molecule generation

Journal of Cheminformatics ◽

10.1186/s13321-021-00498-z ◽

2021 ◽

Vol 13 (1) ◽

Author(s):

Tiago Pereira ◽

Maryam Abbasi ◽

Bernardete Ribeiro ◽

Joel P. Arrais

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Reinforcement Learning ◽

Deep Neural Networks ◽

Chemical Space ◽

Biological Properties ◽

Training Process ◽

Training Strategy ◽

Inhibitory Power ◽

Exploratory Strategy

AbstractIn this work, we explore the potential of deep learning to streamline the process of identifying new potential drugs through the computational generation of molecules with interesting biological properties. Two deep neural networks compose our targeted generation framework: the Generator, which is trained to learn the building rules of valid molecules employing SMILES strings notation, and the Predictor which evaluates the newly generated compounds by predicting their affinity for the desired target. Then, the Generator is optimized through Reinforcement Learning to produce molecules with bespoken properties. The innovation of this approach is the exploratory strategy applied during the reinforcement training process that seeks to add novelty to the generated compounds. This training strategy employs two Generators interchangeably to sample new SMILES: the initially trained model that will remain fixed and a copy of the previous one that will be updated during the training to uncover the most promising molecules. The evolution of the reward assigned by the Predictor determines how often each one is employed to select the next token of the molecule. This strategy establishes a compromise between the need to acquire more information about the chemical space and the need to sample new molecules, with the experience gained so far. To demonstrate the effectiveness of the method, the Generator is trained to design molecules with an optimized coefficient of partition and also high inhibitory power against the Adenosine $$A_{2A}$$ A 2 A and $$\kappa$$ κ opioid receptors. The results reveal that the model can effectively adjust the newly generated molecules towards the wanted direction. More importantly, it was possible to find promising sets of unique and diverse molecules, which was the main purpose of the newly implemented strategy.

Download Full-text

A Novel Rank Selection Scheme in Tensor Ring Decomposition Based on Reinforcement Learning for Deep Neural Networks

ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp40776.2020.9053292 ◽

2020 ◽

Cited By ~ 1

Author(s):

Zhiyu Cheng ◽

Baopu Li ◽

Yanwen Fan ◽

Yingze Bao

Keyword(s):

Neural Networks ◽

Reinforcement Learning ◽

Deep Neural Networks ◽

Selection Scheme

Download Full-text

Distilling deep neural networks with reinforcement learning

2018 IEEE International Conference on Information and Automation (ICIA) ◽

10.1109/icinfa.2018.8812321 ◽

2018 ◽

Author(s):

You Huang ◽

Yuanlong Yu

Keyword(s):

Neural Networks ◽

Reinforcement Learning ◽

Deep Neural Networks

Download Full-text

Reinforcement Learning-Based Layer-Wise Quantization For Lightweight Deep Neural Networks

2020 IEEE International Conference on Image Processing (ICIP) ◽

10.1109/icip40778.2020.9191267 ◽

2020 ◽

Author(s):

Juri Jung ◽

Jonghee Kim ◽

Youngeun Kim ◽

Changick Kim

Keyword(s):

Neural Networks ◽

Reinforcement Learning ◽

Deep Neural Networks

Download Full-text

Real-Time Object Navigation With Deep Neural Networks and Hierarchical Reinforcement Learning

IEEE Access ◽

10.1109/access.2020.3034524 ◽

2020 ◽

Vol 8 ◽

pp. 195608-195621

Author(s):

Aleksey Staroverov ◽

Dmitry A. Yudin ◽

Ilya Belkin ◽

Vasily Adeshkin ◽

Yaroslav K. Solomentsev ◽

...

Keyword(s):

Neural Networks ◽

Reinforcement Learning ◽

Real Time ◽

Deep Neural Networks ◽

Hierarchical Reinforcement Learning

Download Full-text

Object Detection with Deep Neural Networks for Reinforcement Learning in the Task of Autonomous Vehicles Path Planning at the Intersection

Optical Memory and Neural Networks ◽

10.3103/s1060992x19040118 ◽

2019 ◽

Vol 28 (4) ◽

pp. 283-295 ◽

Cited By ~ 2

Author(s):

D. A. Yudin ◽

A. Skrynnik ◽

A. Krishtopik ◽

I. Belkin ◽

A. I. Panov

Keyword(s):

Neural Networks ◽

Reinforcement Learning ◽

Path Planning ◽

Object Detection ◽

Autonomous Vehicles ◽

Deep Neural Networks

Download Full-text

Coordinated Wide-Area Damping Control Using Deep Neural Networks and Reinforcement Learning

IEEE Transactions on Power Systems ◽

10.1109/tpwrs.2021.3091940 ◽

2021 ◽

pp. 1-1

Author(s):

Pooja Gupta ◽

Anamitra Pal ◽

Vijay Vittal

Keyword(s):

Neural Networks ◽

Reinforcement Learning ◽

Deep Neural Networks ◽

Wide Area ◽

Damping Control

Download Full-text

Pre-Training Acquisition Functions by Deep Reinforcement Learning for Fixed Budget Active Learning

Neural Processing Letters ◽

10.1007/s11063-021-10476-z ◽

2021 ◽

Author(s):

Yusuke Taguchi ◽

Hideitsu Hino ◽

Keisuke Kameyama

Keyword(s):

Neural Networks ◽

Reinforcement Learning ◽

Active Learning ◽

Supervised Learning ◽

Deep Neural Networks ◽

Learning Algorithm ◽

Learning Problem ◽

Q Learning ◽

Fixed Budget ◽

Active Learner

AbstractThere are many situations in supervised learning where the acquisition of data is very expensive and sometimes determined by a user’s budget. One way to address this limitation is active learning. In this study, we focus on a fixed budget regime and propose a novel active learning algorithm for the pool-based active learning problem. The proposed method performs active learning with a pre-trained acquisition function so that the maximum performance can be achieved when the number of data that can be acquired is fixed. To implement this active learning algorithm, the proposed method uses reinforcement learning based on deep neural networks as as a pre-trained acquisition function tailored for the fixed budget situation. By using the pre-trained deep Q-learning-based acquisition function, we can realize the active learner which selects a sample for annotation from the pool of unlabeled samples taking the fixed-budget situation into account. The proposed method is experimentally shown to be comparable with or superior to existing active learning methods, suggesting the effectiveness of the proposed approach for the fixed-budget active learning.

Download Full-text

Dynamic Ride-Hailing with Electric Vehicles

Transportation Science ◽

10.1287/trsc.2021.1042 ◽

2021 ◽

Author(s):

Nicholas D. Kullman ◽

Martin Cousineau ◽

Justin C. Goodson ◽

Jorge E. Mendoza

Keyword(s):

Neural Networks ◽

Decision Making ◽

New York ◽

New York City ◽

Reinforcement Learning ◽

Electric Vehicles ◽

Optimal Policy ◽

Deep Neural Networks ◽

Real Data ◽

Dual Bounds

We consider the problem of an operator controlling a fleet of electric vehicles for use in a ride-hailing service. The operator, seeking to maximize profit, must assign vehicles to requests as they arise as well as recharge and reposition vehicles in anticipation of future requests. To solve this problem, we employ deep reinforcement learning, developing policies whose decision making uses [Formula: see text]-value approximations learned by deep neural networks. We compare these policies against a reoptimization-based policy and against dual bounds on the value of an optimal policy, including the value of an optimal policy with perfect information, which we establish using a Benders-based decomposition. We assess performance on instances derived from real data for the island of Manhattan in New York City. We find that, across instances of varying size, our best policy trained with deep reinforcement learning outperforms the reoptimization approach. We also provide evidence that this policy may be effectively scaled and deployed on larger instances without retraining.

Download Full-text