RIFLING: A reinforcement learning‐based GPU scheduler for deep learning research and development platforms

We present RLCard, a Python platform for reinforcement learning research and development in card games. RLCard supports various card environments and several baseline algorithms with unified easy-to-use interfaces, aiming at bridging reinforcement learning and imperfect information games. The platform provides flexible configurations of state representation, action encoding, and reward design. RLCard also supports visualizations for algorithm debugging. In this demo, we showcase two representative environments and their visualization results. We conclude this demo with challenges and research opportunities brought by RLCard. A video is available on YouTube.

Download Full-text

Scalable reinforcement-learning-based neural architecture search for cancer deep learning research

Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis ◽

10.1145/3295500.3356202 ◽

2019 ◽

Cited By ~ 1

Author(s):

Prasanna Balaprakash ◽

Romain Egele ◽

Misha Salim ◽

Stefan Wild ◽

Venkatram Vishwanath ◽

...

Keyword(s):

Deep Learning ◽

Reinforcement Learning ◽

Neural Architecture ◽

Learning Research

Download Full-text

Author response for "Deep learning and reinforcement learning approach on microgrid"

10.1002/2050-7038.12531/v2/response1 ◽

2020 ◽

Author(s):

Kumar Chandrasekaran ◽

Prabaakaran Kandasamy ◽

Srividhya Ramanathan

Keyword(s):

Deep Learning ◽

Reinforcement Learning ◽

Author Response ◽

Learning Approach

Download Full-text

Enabling Rewards for Reinforcement Learning in Laser Beam Welding processes through Deep Learning

2020 19th IEEE International Conference on Machine Learning and Applications (ICMLA) ◽

10.1109/icmla51294.2020.00221 ◽

2020 ◽

Author(s):

Markus Schmitz ◽

Florian Pinsker ◽

Alexander Ruhri ◽

Beibei Jiang ◽

Georgij Safronov

Keyword(s):

Deep Learning ◽

Reinforcement Learning ◽

Laser Beam ◽

Laser Beam Welding ◽

Welding Processes

Download Full-text

Towards a Distributed Framework for Multi-Agent Reinforcement Learning Research

2020 IEEE High Performance Extreme Computing Conference (HPEC) ◽

10.1109/hpec43674.2020.9286212 ◽

2020 ◽

Author(s):

Yutai Zhou ◽

Shawn Manuel ◽

Peter Morales ◽

Sheng Li ◽

Jaime Pena ◽

...

Keyword(s):

Reinforcement Learning ◽

Distributed Framework ◽

Multi Agent ◽

Learning Research

Download Full-text

Data augmentation and transfer learning strategies for reaction prediction in low chemical data regimes

Organic Chemistry Frontiers ◽

10.1039/d0qo01636e ◽

2021 ◽

Author(s):

Yun Zhang ◽

Ling Wang ◽

Xinqiao Wang ◽

Chengyun Zhang ◽

Jiamin Ge ◽

...

Keyword(s):

Organic Chemistry ◽

Deep Learning ◽

Drug Discovery ◽

Research And Development ◽

Learning Strategies ◽

Transfer Learning ◽

Chemical Reactions ◽

Data Augmentation ◽

Learning Method ◽

Reaction Prediction

An effective and rapid deep learning method to predict chemical reactions contributes to the research and development of organic chemistry and drug discovery.

Download Full-text

Deep Learning-Based Ground Vibration Monitoring: Reinforcement Learning and RNN-CNN Approach

IEEE Geoscience and Remote Sensing Letters ◽

10.1109/lgrs.2021.3067974 ◽

2021 ◽

pp. 1-5

Author(s):

Sangseok Yun ◽

Jae-Mo Kang ◽

Jeongseok Ha ◽

Sangho Lee ◽

Dong-Woo Ryu ◽

...

Keyword(s):

Deep Learning ◽

Reinforcement Learning ◽

Ground Vibration ◽

Vibration Monitoring

Download Full-text

A Hybrid Framework for Functional Verification using Reinforcement Learning and Deep Learning

Proceedings of the 2019 on Great Lakes Symposium on VLSI - GLSVLSI '19 ◽

10.1145/3299874.3318039 ◽

2019 ◽

Cited By ~ 2

Author(s):

Karunveer Singh ◽

Rishabh Gupta ◽

Vikram Gupta ◽

Arash Fayyazi ◽

Massoud Pedram ◽

...

Keyword(s):

Deep Learning ◽

Reinforcement Learning ◽

Functional Verification ◽

Hybrid Framework

Download Full-text

Diversity oriented Deep Reinforcement Learning for targeted molecule generation

Journal of Cheminformatics ◽

10.1186/s13321-021-00498-z ◽

2021 ◽

Vol 13 (1) ◽

Author(s):

Tiago Pereira ◽

Maryam Abbasi ◽

Bernardete Ribeiro ◽

Joel P. Arrais

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Reinforcement Learning ◽

Deep Neural Networks ◽

Chemical Space ◽

Biological Properties ◽

Training Process ◽

Training Strategy ◽

Inhibitory Power ◽

Exploratory Strategy

AbstractIn this work, we explore the potential of deep learning to streamline the process of identifying new potential drugs through the computational generation of molecules with interesting biological properties. Two deep neural networks compose our targeted generation framework: the Generator, which is trained to learn the building rules of valid molecules employing SMILES strings notation, and the Predictor which evaluates the newly generated compounds by predicting their affinity for the desired target. Then, the Generator is optimized through Reinforcement Learning to produce molecules with bespoken properties. The innovation of this approach is the exploratory strategy applied during the reinforcement training process that seeks to add novelty to the generated compounds. This training strategy employs two Generators interchangeably to sample new SMILES: the initially trained model that will remain fixed and a copy of the previous one that will be updated during the training to uncover the most promising molecules. The evolution of the reward assigned by the Predictor determines how often each one is employed to select the next token of the molecule. This strategy establishes a compromise between the need to acquire more information about the chemical space and the need to sample new molecules, with the experience gained so far. To demonstrate the effectiveness of the method, the Generator is trained to design molecules with an optimized coefficient of partition and also high inhibitory power against the Adenosine $$A_{2A}$$ A 2 A and $$\kappa$$ κ opioid receptors. The results reveal that the model can effectively adjust the newly generated molecules towards the wanted direction. More importantly, it was possible to find promising sets of unique and diverse molecules, which was the main purpose of the newly implemented strategy.

Download Full-text