Optimizing a Gamified Design Through Reinforcement Learning - a Case Study in Stack Overflow

<p>In medicinal chemistry programs it is key to design and make compounds that are efficacious and safe. This is a long, complex and difficult multi-parameter optimization process, often including several properties with orthogonal trends. New methods for the automated design of compounds against profiles of multiple properties are thus of great value. Here we present a fragment-based reinforcement learning approach based on an actor-critic model, for the generation of novel molecules with optimal properties. The actor and the critic are both modelled with bidirectional long short-term memory (LSTM) networks. The AI method learns how to generate new compounds with desired properties by starting from an initial set of lead molecules and then improve these by replacing some of their fragments. A balanced binary tree based on the similarity of fragments is used in the generative process to bias the output towards structurally similar molecules. The method is demonstrated by a case study showing that 93% of the generated molecules are chemically valid, and a third satisfy the targeted objectives, while there were none in the initial set.</p>

Download Full-text

Enhancing Energy Trading Between Different Islanded Microgrids A Reinforcement Learning Algorithm Case Study in Northern Kordofan State

2020 International Conference on Computer, Control, Electrical, and Electronics Engineering (ICCCEEE) ◽

10.1109/iccceee49695.2021.9429584 ◽

2021 ◽

Author(s):

Moayad ELamin ◽

Fay Elhassan ◽

Mahmoud A. Manzoul

Keyword(s):

Reinforcement Learning ◽

Learning Algorithm ◽

Energy Trading ◽

Reinforcement Learning Algorithm

Download Full-text

How are programming questions from women received on stack overflow? a case study of peer parity

Proceedings Companion of the 2017 ACM SIGPLAN International Conference on Systems, Programming, Languages, and Applications: Software for Humanity - SPLASH Companion 2017 ◽

10.1145/3135932.3135952 ◽

2017 ◽

Cited By ~ 4

Author(s):

Savannah Morgan

Keyword(s):

Stack Overflow

Download Full-text

How Do Users Revise Answers on Technical Q&A Websites? A Case Study on Stack Overflow

IEEE Transactions on Software Engineering ◽

10.1109/tse.2018.2874470 ◽

2020 ◽

Vol 46 (9) ◽

pp. 1024-1038 ◽

Cited By ~ 3

Author(s):

Shaowei Wang ◽

Tse-Hsun Chen ◽

Ahmed E. Hassan

Keyword(s):

Stack Overflow

Download Full-text

A Case Study in Hybrid Multi-threading and Hierarchical Reinforcement Learning Approach for Cooperative Multi-agent Systems

2015 Fourteenth Mexican International Conference on Artificial Intelligence (MICAI) ◽

10.1109/micai.2015.20 ◽

2015 ◽

Author(s):

Hiram Ponce ◽

Ricardo Padilla ◽

Alan Davalos ◽

Alvaro Herrasti ◽

Cynthia Pichardo ◽

...

Keyword(s):

Reinforcement Learning ◽

Learning Approach ◽

Multi Agent Systems ◽

Agent Systems ◽

Hierarchical Reinforcement Learning ◽

Multi Agent

Download Full-text

Adaptive Multi-objective Reinforcement Learning for Pareto Frontier Approximation: A Case Study of Resource Allocation Network in Massive MIMO

10.23919/eusipco54536.2021.9615934 ◽

2021 ◽

Author(s):

Ruiqing Chen ◽

Fanglei Sun ◽

Liang Chen ◽

Kai Li ◽

Liantao Wu ◽

...

Keyword(s):

Resource Allocation ◽

Reinforcement Learning ◽

Massive Mimo ◽

Pareto Frontier ◽

Multi Objective

Download Full-text

A Reinforcement Learning - Great-Deluge Hyper-Heuristic for Examination Timetabling

International Journal of Applied Metaheuristic Computing ◽

10.4018/jamc.2010102603 ◽

2010 ◽

Vol 1 (1) ◽

pp. 39-59 ◽

Cited By ~ 70

Author(s):

Ender Özcan ◽

Mustafa Misir ◽

Gabriela Ochoa ◽

Edmund K. Burke

Keyword(s):

Reinforcement Learning ◽

Complete Solution ◽

Examination Timetabling ◽

Low Level ◽

Termination Criteria ◽

Candidate Solution ◽

Wide Range ◽

Finite Set ◽

Different Characteristics

Hyper-heuristics can be identified as methodologies that search the space generated by a finite set of low level heuristics for solving search problems. An iterative hyper-heuristic framework can be thought of as requiring a single candidate solution and multiple perturbation low level heuristics. An initially generated complete solution goes through two successive processes (heuristic selection and move acceptance) until a set of termination criteria is satisfied. A motivating goal of hyper-heuristic research is to create automated techniques that are applicable to a wide range of problems with different characteristics. Some previous studies show that different combinations of heuristic selection and move acceptance as hyper-heuristic components might yield different performances. This study investigates whether learning heuristic selection can improve the performance of a great deluge based hyper-heuristic using an examination timetabling problem as a case study.

Download Full-text

Accelerate Personalized IoT Service Provision by Cloud-Aided Edge Reinforcement Learning: A Case Study on Smart Lighting

Service-Oriented Computing - Lecture Notes in Computer Science ◽

10.1007/978-3-030-65310-1_6 ◽

2020 ◽

pp. 69-84

Author(s):

Jun Na ◽

Handuo Zhang ◽

Xin Deng ◽

Bin Zhang ◽

Ziyi Ye

Keyword(s):

Reinforcement Learning ◽

Service Provision ◽

Smart Lighting

Download Full-text

Multi-Context Generation in Virtual Reality Environments Using Deep Reinforcement Learning

Volume 9: 40th Computers and Information in Engineering Conference (CIE) ◽

10.1115/detc2020-22624 ◽

2020 ◽

Author(s):

James Cunningham ◽

Christian Lopez ◽

Omar Ashour ◽

Conrad S. Tucker

Keyword(s):

Virtual Reality ◽

Reinforcement Learning ◽

Virtual Environments ◽

Probability Distributions ◽

Automatic Generation ◽

Grocery Store ◽

Training Data ◽

Learning Approaches ◽

Common Concept

Abstract In this work, a Deep Reinforcement Learning (RL) approach is proposed for Procedural Content Generation (PCG) that seeks to automate the generation of multiple related virtual reality (VR) environments for enhanced personalized learning. This allows for the user to be exposed to multiple virtual scenarios that demonstrate a consistent theme, which is especially valuable in an educational context. RL approaches to PCG offer the advantage of not requiring training data, as opposed to other PCG approaches that employ supervised learning approaches. This work advances the state of the art in RL-based PCG by demonstrating the ability to generate a diversity of contexts in order to teach the same underlying concept. A case study is presented that demonstrates the feasibility of the proposed RL-based PCG method using examples of probability distributions in both manufacturing facility and grocery store virtual environments. The method demonstrated in this paper has the potential to enable the automatic generation of a variety of virtual environments that are connected by a common concept or theme.

Download Full-text