Optimal Design of Semi-Active Mid-Story Isolation System using Supervised Learning and Reinforcement Learning

Joo-Won Kang;  ; Hyun-Su Kim

doi:10.9712/kass.2021.21.4.73

Combining Reinforcement Learning with Supervised Learning for Sepsis Treatment

10.1145/3426020.3426077 ◽

2020 ◽

Author(s):

Thanh Cong Do ◽

Hyung Jeong Yang ◽

Seok Bong Yoo ◽

In-Jae Oh

Keyword(s):

Reinforcement Learning ◽

Supervised Learning

Download Full-text

A strategy learning model for autonomous agents based on classification

International Journal of Applied Mathematics and Computer Science ◽

10.1515/amcs-2015-0035 ◽

2015 ◽

Vol 25 (3) ◽

pp. 471-482 ◽

Cited By ~ 7

Author(s):

Bartłomiej Śnieżyński

Keyword(s):

Reinforcement Learning ◽

Supervised Learning ◽

Learning Process ◽

Autonomous Agents ◽

Good Alternative ◽

Learning Model ◽

Learning Method ◽

Complex Environments ◽

Agent Based ◽

Proposed Model

AbstractIn this paper we propose a strategy learning model for autonomous agents based on classification. In the literature, the most commonly used learning method in agent-based systems is reinforcement learning. In our opinion, classification can be considered a good alternative. This type of supervised learning can be used to generate a classifier that allows the agent to choose an appropriate action for execution. Experimental results show that this model can be successfully applied for strategy generation even if rewards are delayed. We compare the efficiency of the proposed model and reinforcement learning using the farmer-pest domain and configurations of various complexity. In complex environments, supervised learning can improve the performance of agents much faster that reinforcement learning. If an appropriate knowledge representation is used, the learned knowledge may be analyzed by humans, which allows tracking the learning process

Download Full-text

DDPG Agent to Swing Up and Balance Cart- Pole System

International Journal of Advanced Research in Science, Communication and Technology ◽

10.48175/ijarsct-943 ◽

2021 ◽

pp. 102-116

Author(s):

Buvanesh Pandian V

Keyword(s):

Reinforcement Learning ◽

Supervised Learning ◽

Real World ◽

Learning Algorithm ◽

Current Approach ◽

Control Problems ◽

Mathematical Framework ◽

Test Environment ◽

Continuous Action ◽

Action Spaces

Reinforcement learning is a mathematical framework for agents to interact intelligently with their environment. Unlike supervised learning, where a system learns with the help of labeled data, reinforcement learning agents learn how to act by trial and error only receiving a reward signal from their environments. A field where reinforcement learning has been prominently successful is robotics [3]. However, real-world control problems are also particularly challenging because of the noise and high- dimensionality of input data (e.g., visual input). In recent years, in the field of supervised learning, deep neural networks have been successfully used to extract meaning from this kind of data. Building on these advances, deep reinforcement learning was used to solve complex problems like Atari games and Go. Mnih et al. [1] built a system with fixed hyper parameters able to learn to play 49 different Atari games only from raw pixel inputs. However, in order to apply the same methods to real-world control problems, deep reinforcement learning has to be able to deal with continuous action spaces. Discretizing continuous action spaces would scale poorly, since the number of discrete actions grows exponentially with the dimensionality of the action. Furthermore, having a parametrized policy can be advantageous because it can generalize in the action space. Therefore with this thesis we study state-of-the-art deep reinforcement learning algorithm, Deep Deterministic Policy Gradients. We provide a theoretical comparison to other popular methods, an evaluation of its performance, identify its limitations and investigate future directions of research. The remainder of the thesis is organized as follows. We start by introducing the field of interest, machine learning, focusing our attention of deep learning and reinforcement learning. We continue by describing in details the two main algorithms, core of this study, namely Deep Q-Network (DQN) and Deep Deterministic Policy Gradients (DDPG). We then provide implementatory details of DDPG and our test environment, followed by a description of benchmark test cases. Finally, we discuss the results of our evaluation, identifying limitations of the current approach and proposing future avenues of research.

Download Full-text

I4R: Promoting Deep Reinforcement Learning by the Indicator for Expressive Representations

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2020/370 ◽

2020 ◽

Author(s):

Xufang Luo ◽

Qi Meng ◽

Di He ◽

Wei Chen ◽

Yunhong Wang

Keyword(s):

Reinforcement Learning ◽

Supervised Learning ◽

Singular Values ◽

Regularization Term ◽

Policy Gradient

Learning expressive representations is always crucial for well-performed policies in deep reinforcement learning (DRL). Different from supervised learning, in DRL, accurate targets are not always available, and some inputs with different actions only have tiny differences, which stimulates the demand for learning expressive representations. In this paper, firstly, we empirically compare the representations of DRL models with different performances. We observe that the representations of a better state extractor (SE) are more scattered than a worse one when they are visualized. Thus, we investigate the singular values of representation matrix, and find that, better SEs always correspond to smaller differences among these singular values. Next, based on such observations, we define an indicator of the representations for DRL model, which is the Number of Significant Singular Values (NSSV) of a representation matrix. Then, we propose I4R algorithm, to improve DRL algorithms by adding the corresponding regularization term to enhance the NSSV. Finally, we apply I4R to both policy gradient and value based algorithms on Atari games, and the results show the superiority of our proposed method.

Download Full-text

Stochastic optimal design of novel nonlinear base isolation system for seismic-excited building structures

Structural Control and Health Monitoring ◽

10.1002/stc.2168 ◽

2018 ◽

Vol 25 (7) ◽

pp. e2168 ◽

Cited By ~ 3

Author(s):

Jiazeng Shan ◽

Zhiguo Shi ◽

Fan Hu ◽

Jian Yu ◽

Weixing Shi

Keyword(s):

Optimal Design ◽

Base Isolation ◽

Building Structures ◽

Isolation System ◽

Base Isolation System

Download Full-text

Optimal Design of Planar Microwave Microfluidic Sensors Based on Deep Reinforcement Learning

IEEE Sensors Journal ◽

10.1109/jsen.2021.3124294 ◽

2021 ◽

pp. 1-1

Author(s):

Bin-Xiao Wang ◽

Wen-Sheng Zhao ◽

Da-Wei Wang ◽

Junchao Wang ◽

Wenjun Li ◽

...

Keyword(s):

Reinforcement Learning ◽

Optimal Design ◽

Microfluidic Sensors

Download Full-text

Comparing Reinforcement Learning Agents and Supervised Learning Neural Networks for EMG-Based Decoding of Continuous Movements

10.1109/embc46164.2021.9630744 ◽

2021 ◽

Author(s):

Joseph Berman ◽

Robert Hinson ◽

He Huang

Keyword(s):

Neural Networks ◽

Reinforcement Learning ◽

Supervised Learning ◽

Learning Agents ◽

Continuous Movements

Download Full-text

Optimal design of an inerter isolation system considering the soil condition

Engineering Structures ◽

10.1016/j.engstruct.2019.109324 ◽

2019 ◽

Vol 196 ◽

pp. 109324 ◽

Cited By ~ 14

Author(s):

Zhipeng Zhao ◽

Qingjun Chen ◽

Ruifu Zhang ◽

Chao Pan ◽

Yiyao Jiang

Keyword(s):

Optimal Design ◽

Soil Condition ◽

Isolation System

Download Full-text

Hidden Link Prediction in Criminal Networks Using the Deep Reinforcement Learning Technique

Computers ◽

10.3390/computers8010008 ◽

2019 ◽

Vol 8 (1) ◽

pp. 8 ◽

Cited By ~ 7

Author(s):

Marcus Lim ◽

Azween Abdullah ◽

NZ Jhanjhi ◽

Mahadevan Supramaniam

Keyword(s):

Machine Learning ◽

Reinforcement Learning ◽

Network Analysis ◽

Prediction Model ◽

Supervised Learning ◽

Link Prediction ◽

Supervised Machine Learning ◽

Criminal Networks ◽

Criminal Network ◽

Learning Technique

Criminal network activities, which are usually secret and stealthy, present certain difficulties in conducting criminal network analysis (CNA) because of the lack of complete datasets. The collection of criminal activities data in these networks tends to be incomplete and inconsistent, which is reflected structurally in the criminal network in the form of missing nodes (actors) and links (relationships). Criminal networks are commonly analyzed using social network analysis (SNA) models. Most machine learning techniques that rely on the metrics of SNA models in the development of hidden or missing link prediction models utilize supervised learning. However, supervised learning usually requires the availability of a large dataset to train the link prediction model in order to achieve an optimum performance level. Therefore, this research is conducted to explore the application of deep reinforcement learning (DRL) in developing a criminal network hidden links prediction model from the reconstruction of a corrupted criminal network dataset. The experiment conducted on the model indicates that the dataset generated by the DRL model through self-play or self-simulation can be used to train the link prediction model. The DRL link prediction model exhibits a better performance than a conventional supervised machine learning technique, such as the gradient boosting machine (GBM) trained with a relatively smaller domain dataset.

Download Full-text

A Cascaded Supervised Learning Approach to Inverse Reinforcement Learning

Advanced Information Systems Engineering - Lecture Notes in Computer Science ◽

10.1007/978-3-642-40988-2_1 ◽

2013 ◽

pp. 1-16 ◽

Cited By ~ 6

Author(s):

Edouard Klein ◽

Bilal Piot ◽

Matthieu Geist ◽

Olivier Pietquin

Keyword(s):

Reinforcement Learning ◽

Supervised Learning ◽

Learning Approach ◽

Inverse Reinforcement Learning

Download Full-text