Control of an Inverted Pendulum by Reinforcement Learning Method in PLC Environment

Gomoku is a two-player board game that originated in ancient China. There are various cases of developing Gomoku using artificial intelligence, such as a genetic algorithm and a tree search algorithm. Alpha-Gomoku, Gomoku AI built with Alpha-Go’s algorithm, defines all possible situations in the Gomoku board using Monte-Carlo tree search (MCTS), and minimizes the probability of learning other correct answers in the duplicated Gomoku board situation. However, in the tree search algorithm, the accuracy drops, because the classification criteria are manually set. In this paper, we propose an improved reinforcement learning-based high-level decision approach using convolutional neural networks (CNN). The proposed algorithm expresses each state as One-Hot Encoding based vectors and determines the state of the Gomoku board by combining the similar state of One-Hot Encoding based vectors. Thus, in a case where a stone that is determined by CNN has already been placed or cannot be placed, we suggest a method for selecting an alternative. We verify the proposed method of Gomoku AI in GuPyEngine, a Python-based 3D simulation platform.

Download Full-text

A Deep Reinforcement Learning Method for Solving Task Mapping Problems with Dynamic Traffic on Parallel Systems

The International Conference on High Performance Computing in Asia-Pacific Region ◽

10.1145/3432261.3432262 ◽

2021 ◽

Author(s):

Yu-Cheng Wang ◽

Jerry Chou ◽

I-Hsin Chung

Keyword(s):

Reinforcement Learning ◽

Parallel Systems ◽

Task Mapping ◽

Learning Method ◽

Dynamic Traffic

Download Full-text

A real-time HIL control system on rotary inverted pendulum hardware platform based on double deep Q-network

Measurement and Control ◽

10.1177/00202940211000380 ◽

2021 ◽

Vol 54 (3-4) ◽

pp. 417-428

Author(s):

Yanyan Dai ◽

KiDong Lee ◽

SukGyu Lee

Keyword(s):

Control System ◽

Reinforcement Learning ◽

Inverted Pendulum ◽

Learning Algorithm ◽

Deep Understanding ◽

Control Engineering ◽

Experience Replay ◽

Real Hardware ◽

Rotary Inverted Pendulum ◽

Reinforcement Learning Algorithm

For real applications, rotary inverted pendulum systems have been known as the basic model in nonlinear control systems. If researchers have no deep understanding of control, it is difficult to control a rotary inverted pendulum platform using classic control engineering models, as shown in section 2.1. Therefore, without classic control theory, this paper controls the platform by training and testing reinforcement learning algorithm. Many recent achievements in reinforcement learning (RL) have become possible, but there is a lack of research to quickly test high-frequency RL algorithms using real hardware environment. In this paper, we propose a real-time Hardware-in-the-loop (HIL) control system to train and test the deep reinforcement learning algorithm from simulation to real hardware implementation. The Double Deep Q-Network (DDQN) with prioritized experience replay reinforcement learning algorithm, without a deep understanding of classical control engineering, is used to implement the agent. For the real experiment, to swing up the rotary inverted pendulum and make the pendulum smoothly move, we define 21 actions to swing up and balance the pendulum. Comparing Deep Q-Network (DQN), the DDQN with prioritized experience replay algorithm removes the overestimate of Q value and decreases the training time. Finally, this paper shows the experiment results with comparisons of classic control theory and different reinforcement learning algorithms.

Download Full-text

A Model-Free Distributed Cooperative Frequency Control Strategy for MT-HVDC Systems Using Reinforcement Learning Method

Journal of the Franklin Institute ◽

10.1016/j.jfranklin.2021.06.011 ◽

2021 ◽

Author(s):

Zhong-Jie Hu ◽

Zhi-Wei Liu ◽

Chaojie Li ◽

Tingwen Huang ◽

Xiong Hu

Keyword(s):

Reinforcement Learning ◽

Control Strategy ◽

Frequency Control ◽

Learning Method ◽

Model Free

Download Full-text

A reinforcement learning method with closed-loop stability guarantee for systems with unknown parameters

IFAC-PapersOnLine ◽

10.1016/j.ifacol.2020.12.2303 ◽

2020 ◽

Vol 53 (2) ◽

pp. 8157-8162

Author(s):

Thomas Göhrt ◽

Fritjof Griesing-Scheiwe ◽

Pavel Osinenko ◽

Stefan Streif

Keyword(s):

Reinforcement Learning ◽

Closed Loop ◽

Learning Method ◽

Unknown Parameters ◽

Loop Stability

Download Full-text

Control of an Inverted Pendulum by Reinforcement Learning Method in PLC Environment

A Plant Control Technology Using Reinforcement Learning Method with Automatic Reward Adjustment

Multi-index Evaluation based Reinforcement Learning Method for Cyclic Optimization of Multiple Energy Utilization in Steel Industry

Medium and Long-Term Stochastic Optimization of Hybrid Pumped Storage Reservoir via Reinforcement Learning Method

Collision Avoidance in IEEE 802.11 DCF using a Reinforcement Learning Method

AoI-Energy-Aware UAV-assisted Data Collection for IoT Networks: A Deep Reinforcement Learning Method

Enhanced Reinforcement Learning Method Combining One-Hot Encoding-Based Vectors for CNN-Based Alternative High-Level Decisions

A Deep Reinforcement Learning Method for Solving Task Mapping Problems with Dynamic Traffic on Parallel Systems

A real-time HIL control system on rotary inverted pendulum hardware platform based on double deep Q-network

A Model-Free Distributed Cooperative Frequency Control Strategy for MT-HVDC Systems Using Reinforcement Learning Method

A reinforcement learning method with closed-loop stability guarantee for systems with unknown parameters

Export Citation Format