Analysis of a Method Improving Reinforcement Learning Agents’ Policies

Reinforcement learning (RL) is a kind of machine learning. It aims to optimize agents’ policies by adapting the agents to an environment according to rewards. In this paper, we propose a method for improving policies by using stochastic knowledge, in which reinforcement learning agents obtain. We use a Bayesian Network (BN), which is a stochastic model, as knowledge of an agent. Its structure is decided by minimum description length criterion using series of an agent’s input-output and rewards as sample data. A BN constructed in our study represents stochastic dependences between input-output and rewards. In our proposed method, policies are improved by supervised learning using the structure of BN (i.e. stochastic knowledge). The proposed improvement mechanism makes RL agents acquire more effective policies. We carry out simulations in the pursuit problem in order to show the effectiveness of our proposed method.

Download Full-text

SMART (Stochastic Model Acquisition with ReinforcemenT) Learning Agents: A Preliminary Report

Adaptive Agents and Multi-Agent Systems II - Lecture Notes in Computer Science ◽

10.1007/978-3-540-32274-0_5 ◽

2005 ◽

pp. 73-87

Author(s):

Christopher Child ◽

Kostas Stathis

Keyword(s):

Reinforcement Learning ◽

Stochastic Model ◽

Preliminary Report ◽

Learning Agents ◽

Model Acquisition

Download Full-text

Comparing Reinforcement Learning Agents and Supervised Learning Neural Networks for EMG-Based Decoding of Continuous Movements

10.1109/embc46164.2021.9630744 ◽

2021 ◽

Author(s):

Joseph Berman ◽

Robert Hinson ◽

He Huang

Keyword(s):

Neural Networks ◽

Reinforcement Learning ◽

Supervised Learning ◽

Learning Agents ◽

Continuous Movements

Download Full-text

Reinforcement Learning Agents with Analytic Hierarchy Process: A Case Study of Pursuit Problem

Transactions of the Japanese Society for Artificial Intelligence ◽

10.1527/tjsai.19.279 ◽

2004 ◽

Vol 19 ◽

pp. 279-291

Author(s):

Kengo Katayama ◽

Takahiro Koshiishi ◽

Hiroyuki Narihisa

Keyword(s):

Reinforcement Learning ◽

Analytic Hierarchy Process ◽

Analytic Hierarchy ◽

Pursuit Problem ◽

Learning Agents ◽

Hierarchy Process

Download Full-text

Hidden Link Prediction in Criminal Networks Using the Deep Reinforcement Learning Technique

Computers ◽

10.3390/computers8010008 ◽

2019 ◽

Vol 8 (1) ◽

pp. 8 ◽

Cited By ~ 7

Author(s):

Marcus Lim ◽

Azween Abdullah ◽

NZ Jhanjhi ◽

Mahadevan Supramaniam

Keyword(s):

Machine Learning ◽

Reinforcement Learning ◽

Network Analysis ◽

Prediction Model ◽

Supervised Learning ◽

Link Prediction ◽

Supervised Machine Learning ◽

Criminal Networks ◽

Criminal Network ◽

Learning Technique

Criminal network activities, which are usually secret and stealthy, present certain difficulties in conducting criminal network analysis (CNA) because of the lack of complete datasets. The collection of criminal activities data in these networks tends to be incomplete and inconsistent, which is reflected structurally in the criminal network in the form of missing nodes (actors) and links (relationships). Criminal networks are commonly analyzed using social network analysis (SNA) models. Most machine learning techniques that rely on the metrics of SNA models in the development of hidden or missing link prediction models utilize supervised learning. However, supervised learning usually requires the availability of a large dataset to train the link prediction model in order to achieve an optimum performance level. Therefore, this research is conducted to explore the application of deep reinforcement learning (DRL) in developing a criminal network hidden links prediction model from the reconstruction of a corrupted criminal network dataset. The experiment conducted on the model indicates that the dataset generated by the DRL model through self-play or self-simulation can be used to train the link prediction model. The DRL link prediction model exhibits a better performance than a conventional supervised machine learning technique, such as the gradient boosting machine (GBM) trained with a relatively smaller domain dataset.

Download Full-text

Machine Learning and Data Mining in Bioinformatics

Machine Learning ◽

10.4018/978-1-60960-818-7.ch401 ◽

2012 ◽

pp. 695-703

Author(s):

George Tzanis ◽

Christos Berberidis ◽

Ioannis Vlahavas

Keyword(s):

Artificial Intelligence ◽

Machine Learning ◽

Data Mining ◽

Reinforcement Learning ◽

Supervised Learning ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

The World ◽

Supervised Learning Algorithms ◽

Computational Systems

Machine learning is one of the oldest subfields of artificial intelligence and is concerned with the design and development of computational systems that can adapt themselves and learn. The most common machine learning algorithms can be either supervised or unsupervised. Supervised learning algorithms generate a function that maps inputs to desired outputs, based on a set of examples with known output (labeled examples). Unsupervised learning algorithms find patterns and relationships over a given set of inputs (unlabeled examples). Other categories of machine learning are semi-supervised learning, where an algorithm uses both labeled and unlabeled examples, and reinforcement learning, where an algorithm learns a policy of how to act given an observation of the world.

Download Full-text

Machine Learning and Data Mining in Bioinformatics

Handbook of Research on Innovations in Database Technologies and Applications ◽

10.4018/978-1-60566-242-8.ch066 ◽

2009 ◽

pp. 612-621 ◽

Cited By ~ 2

Author(s):

George Tzanis ◽

Christos Berberidis ◽

Ioannis Vlahavas

Keyword(s):

Artificial Intelligence ◽

Machine Learning ◽

Data Mining ◽

Reinforcement Learning ◽

Supervised Learning ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

The World ◽

Supervised Learning Algorithms ◽

Computational Systems

Download Full-text

Pemanfaatan Machine Learning dalam Berbagai Bidang: Review paper

IJCIT (Indonesian Journal on Computer and Information Technology) ◽

10.31294/ijcit.v5i1.7951 ◽

2020 ◽

Vol 5 (1) ◽

Author(s):

Ahmad Roihan ◽

Po Abas Sunarya ◽

Ageng Setiani Rafika

Keyword(s):

Artificial Intelligence ◽

Machine Learning ◽

Reinforcement Learning ◽

Unsupervised Learning ◽

Supervised Learning ◽

Review Paper ◽

Computational Costs ◽

Accuracy And Precision ◽

Learning Reinforcement ◽

High Level

Abstrak - Pembelajaran mesin merupakan bagian dari kecerdasan buatan yang banyak digunakan untuk memecahkan berbagai masalah. Artikel ini menyajikan ulasan pemecahan masalah dari penelitian-penelitian terkini dengan mengklasifikasikan machine learning menjadi tiga kategori: pembelajaran terarah, pembelajaran tidak terarah, dan pembelajaran reinforcement. Hasil ulasan menunjukkan ketiga kategori masih berpeluang digunakan dalam beberapa kasus terkini dan dapat ditingkatkan untuk mengurangi beban komputasi dan mempercepat kinerja untuk mendapatkan tingkat akurasi dan presisi yang tinggi. Tujuan ulasan artikel ini diharapkan dapat menemukan celah dan dijadikan pedoman untuk penelitian pada masa yang akan datang.Katakunci: pembelajaran mesin, pembelajaran reinforcement, pembelajaran terarah, pembelajaran tidak terarahAbstract - Machine learning is part of artificial intelligence that is widely used to solve various problems. This article reviews problem solving from the latest studies by classifying machine learning into three categories: supervised learning, unsupervised learning, and reinforcement learning. The results of the review show that the three categories are still likely to be used in some of the latest cases and can be improved to reduce computational costs and accelerate performance to get a high level of accuracy and precision. The purpose of this article review is expected to be able to find a gap and it is used as a guideline for future research.Keywords: machine learning, reinforcement learning, supervised learning, unsupervised learning

Download Full-text

Learning and generalising object extraction skill for contact-rich disassembly tasks: an introductory study

The International Journal of Advanced Manufacturing Technology ◽

10.1007/s00170-021-08086-z ◽

2021 ◽

Author(s):

Antonio Serrano-Muñoz ◽

Nestor Arana-Arexolaleiba ◽

Dimitrios Chrysostomou ◽

Simon Bøgh

Keyword(s):

Machine Learning ◽

Reinforcement Learning ◽

State Of The Art ◽

Learning Algorithms ◽

Robotic Manipulation ◽

Object Extraction ◽

Learning Agents ◽

Machine Learning Methods ◽

Key Concepts ◽

Planning And Operation

AbstractRemanufacturing automation must be designed to be flexible and robust enough to overcome the uncertainties, conditions of the products, and complexities in the planning and operation of the processes. Machine learning methods, in particular reinforcement learning, are presented as techniques to learn, improve, and generalise the automation of many robotic manipulation tasks (most of them related to grasping, picking, or assembly). However, not much has been exploited in remanufacturing, in particular in disassembly tasks. This work presents the state of the art of contact-rich disassembly using reinforcement learning algorithms and a study about the generalisation of object extraction skills when applied to contact-rich disassembly tasks. The generalisation capabilities of two state-of-the-art reinforcement learning agents (trained in simulation) are tested and evaluated in simulation, and real world while perform a disassembly task. Results show that at least one of the agents can generalise the contact-rich extraction skill. Besides, this work identifies key concepts and gaps for the reinforcement learning algorithms’ research and application on disassembly tasks.

Download Full-text

Deep Reinforcement Learning Based on Link Prediction Method in Social Network Analysis

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.b1127.1292s219 ◽

2019 ◽

Vol 9 (2S2) ◽

pp. 820-826

Keyword(s):

Machine Learning ◽

Social Network ◽

Reinforcement Learning ◽

Supervised Learning ◽

Link Prediction ◽

Prediction Models ◽

Learning Strategy ◽

Prediction Method ◽

Gradient Boosting ◽

Data Set

Improving the performance of link prediction is a significant role in the evaluation of social network. Link prediction is known as one of the primary purposes for recommended systems, bio information, and web. Most machine learning methods that depend on SNA model’s metrics use supervised learning to develop link prediction models. Supervised learning actually needed huge amount of data set to train the model of link prediction to obtain an optimal level of performance. In few years, Deep Reinforcement Learning (DRL) has achieved excellent success in various domain such as SNA. In this paper, we present the use of deep reinforcement learning (DRL) to improve the performance and accuracy of the model for the applied dataset. The experiment shows that the dataset created by the DRL model through self-play or auto-simulation can be utilized to improve the link prediction model. We have used three different datasets: JUNANES, MAMBO, JAKE. Experimental results show that the DRL proposed method provide accuracy of 85% for JUNANES, 87% for MAMABO, and 78% for JAKE dataset which outperforms the GBM next highest accuracy of 75% for JUNANES, 79% for MAMBO and 71% for JAKE dataset respectively trained with 2500 iteration and also in terms of AUC measures as well. The DRL model shows the better efficiency than a traditional machine learning strategy, such as, Random Forest and the gradient boosting machine (GBM).

Download Full-text

Learning and Generalising Object Extraction Skill for Contact-rich Disassembly Tasks: An Introductory Study

10.21203/rs.3.rs-331448/v1 ◽

2021 ◽

Author(s):

Antonio Serrano Muñoz ◽

Nestor Arana-Arexolaleiba ◽

Dimitrios Chrysostomou ◽

Simon Bøgh

Keyword(s):

Machine Learning ◽

Reinforcement Learning ◽

Real World ◽

State Of The Art ◽

Learning Algorithms ◽

Robotic Manipulation ◽

Object Extraction ◽

Learning Agents ◽

Key Concepts ◽

Planning And Operation

Abstract Remanufacturing automation must be designed to be flexible and robust enough to overcome the uncertainties, conditions of the products, and complexities in the process's planning and operation. Machine learning, particularly reinforcement learning, methods are presented as techniques to learn, improve, and generalise the automation of many robotic manipulation tasks (most of them related to grasping, picking, or assembly). However, not much has been exploited in remanufacturing, in particular in disassembly tasks. This work presents the State-of-the-Art of contact-rich disassembly using reinforcement learning algorithms and a study about the object extraction skill's generalisation when applied to contact-rich disassembly tasks. The generalisation capabilities of two State-of-the-Art reinforcement learning agents (trained in simulation) are tested and evaluated in simulation and real-world while perform a disassembly task. Results shows that, at least, one of the agents can generalise the contact-rich extraction skill. Also, this work identifies key concepts and gaps for the reinforcement learning algorithms' research and application on disassembly tasks.

Download Full-text