A3C for drone autonomous driving using Airsim

Recently, such a direction of machine learning as reinforcement learning has been actively developing. As a consequence, attempts are being made to use reinforcement learning for solving computer vision problems, in particular for solving the problem of image classification. The tasks of computer vision are currently one of the most urgent tasks of artificial intelligence. The article proposes a method for image classification in the form of a deep neural network using reinforcement learning. The idea of the developed method comes down to solving the problem of a contextual multi-armed bandit using various strategies for achieving a compromise between exploitation and research and reinforcement learning algorithms. Strategies such as -greedy, -softmax, -decay-softmax, and the UCB1 method, and reinforcement learning algorithms such as DQN, REINFORCE, and A2C are considered. The analysis of the influence of various parameters on the efficiency of the method is carried out, and options for further development of the method are proposed.

Download Full-text

Developing an Artificial Intelligence Project in your Radiology Department

Indian Journal of Musculoskeletal Radiology ◽

10.25259/ijmsr_50_2019 ◽

2020 ◽

Vol 2 ◽

pp. 58-61 ◽

Cited By ~ 1

Author(s):

Syed Junaid ◽

Asad Saeed ◽

Zeili Yang ◽

Thomas Micic ◽

Rajesh Botchu

Keyword(s):

Neural Network ◽

Artificial Intelligence ◽

Health Care ◽

Deep Learning ◽

Learning Algorithms ◽

Radiology Department ◽

Short Article ◽

Computing Power ◽

Road Map ◽

Key Concepts

The advances in deep learning algorithms, exponential computing power, and availability of digital patient data like never before have led to the wave of interest and investment in artificial intelligence in health care. No radiology conference is complete without a substantial dedication to AI. Many radiology departments are keen to get involved but are unsure of where and how to begin. This short article provides a simple road map to aid departments to get involved with the technology, demystify key concepts, and pique an interest in the field. We have broken down the journey into seven steps; problem, team, data, kit, neural network, validation, and governance.

Download Full-text

Reinforcement Learning for Cloud Computing Digital Library

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.571-572.105 ◽

2014 ◽

Vol 571-572 ◽

pp. 105-108

Author(s):

Lin Xu

Keyword(s):

Artificial Intelligence ◽

Cloud Computing ◽

Reinforcement Learning ◽

Digital Library ◽

Learning Algorithms ◽

Experimental Results ◽

Current Status ◽

Self Learning ◽

New Framework

This paper proposes a new framework of combining reinforcement learning with cloud computing digital library. Unified self-learning algorithms, which includes reinforcement learning, artificial intelligence and etc, have led to many essential advances. Given the current status of highly-available models, analysts urgently desire the deployment of write-ahead logging. In this paper we examine how DNS can be applied to the investigation of superblocks, and introduce the reinforcement learning to improve the quality of current cloud computing digital library. The experimental results show that the method works more efficiency.

Download Full-text

Pemanfaatan Asynchronous Advantage Actor-Critic Dalam Pembuatan AI Game Bot Pada Game Arcade

Journal of Intelligent System and Computation ◽

10.52985/insyst.v1i2.82 ◽

2019 ◽

Vol 1 (2) ◽

pp. 74-84

Author(s):

Evan Kusuma Susanto ◽

Yosi Kristian

Keyword(s):

Neural Network ◽

Artificial Intelligence ◽

Reinforcement Learning ◽

Convolutional Neural Network ◽

Short Term Memory ◽

Trial And Error ◽

Short Term ◽

Term Memory ◽

Memory Network ◽

Long Short Term Memory

Asynchronous Advantage Actor-Critic (A3C) adalah sebuah algoritma deep reinforcement learning yang dikembangkan oleh Google DeepMind. Algoritma ini dapat digunakan untuk menciptakan sebuah arsitektur artificial intelligence yang dapat menguasai berbagai jenis game yang berbeda melalui trial and error dengan mempelajari tempilan layar game dan skor yang diperoleh dari hasil tindakannya tanpa campur tangan manusia. Sebuah network A3C terdiri dari Convolutional Neural Network (CNN) di bagian depan, Long Short-Term Memory Network (LSTM) di tengah, dan sebuah Actor-Critic network di bagian belakang. CNN berguna sebagai perangkum dari citra output layar dengan mengekstrak fitur-fitur yang penting yang terdapat pada layar. LSTM berguna sebagai pengingat keadaan game sebelumnya. Actor-Critic Network berguna untuk menentukan tindakan terbaik untuk dilakukan ketika dihadapkan dengan suatu kondisi tertentu. Dari hasil percobaan yang dilakukan, metode ini cukup efektif dan dapat mengalahkan pemain pemula dalam memainkan 5 game yang digunakan sebagai bahan uji coba.

Download Full-text

Comprehensive Overview of Neural Networks and Its Applications in Autonomous Vehicles

Computational Intelligence in the Internet of Things - Advances in Computational Intelligence and Robotics ◽

10.4018/978-1-5225-7955-7.ch007 ◽

2019 ◽

pp. 159-173

Author(s):

Jay Rodge ◽

Swati Jaiswal

Keyword(s):

Neural Network ◽

Artificial Intelligence ◽

Neural Networks ◽

Deep Learning ◽

Autonomous Vehicles ◽

Autonomous Vehicle ◽

Learning Algorithms ◽

Activation Functions ◽

Comprehensive Overview ◽

Unit Component

Deep learning and Artificial intelligence (AI) have been trending these days due to the capability and state-of-the-art results that they provide. They have replaced some highly skilled professionals with neural network-powered AI, also known as deep learning algorithms. Deep learning majorly works on neural networks. This chapter discusses about the working of a neuron, which is a unit component of neural network. There are numerous techniques that can be incorporated while designing a neural network, such as activation functions, training, etc. to improve its features, which will be explained in detail. It has some challenges such as overfitting, which are difficult to neglect but can be overcome using proper techniques and steps that have been discussed. The chapter will help the academician, researchers, and practitioners to further investigate the associated area of deep learning and its applications in the autonomous vehicle industry.

Download Full-text

Machine Learning and Data Mining in Bioinformatics

Machine Learning ◽

10.4018/978-1-60960-818-7.ch401 ◽

2012 ◽

pp. 695-703

Author(s):

George Tzanis ◽

Christos Berberidis ◽

Ioannis Vlahavas

Keyword(s):

Artificial Intelligence ◽

Machine Learning ◽

Data Mining ◽

Reinforcement Learning ◽

Supervised Learning ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

The World ◽

Supervised Learning Algorithms ◽

Computational Systems

Machine learning is one of the oldest subfields of artificial intelligence and is concerned with the design and development of computational systems that can adapt themselves and learn. The most common machine learning algorithms can be either supervised or unsupervised. Supervised learning algorithms generate a function that maps inputs to desired outputs, based on a set of examples with known output (labeled examples). Unsupervised learning algorithms find patterns and relationships over a given set of inputs (unlabeled examples). Other categories of machine learning are semi-supervised learning, where an algorithm uses both labeled and unlabeled examples, and reinforcement learning, where an algorithm learns a policy of how to act given an observation of the world.

Download Full-text

Machine Learning and Data Mining in Bioinformatics

Handbook of Research on Innovations in Database Technologies and Applications ◽

10.4018/978-1-60566-242-8.ch066 ◽

2009 ◽

pp. 612-621 ◽

Cited By ~ 2

Author(s):

George Tzanis ◽

Christos Berberidis ◽

Ioannis Vlahavas

Keyword(s):

Artificial Intelligence ◽

Machine Learning ◽

Data Mining ◽

Reinforcement Learning ◽

Supervised Learning ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

The World ◽

Supervised Learning Algorithms ◽

Computational Systems

Machine learning is one of the oldest subfields of artificial intelligence and is concerned with the design and development of computational systems that can adapt themselves and learn. The most common machine learning algorithms can be either supervised or unsupervised. Supervised learning algorithms generate a function that maps inputs to desired outputs, based on a set of examples with known output (labeled examples). Unsupervised learning algorithms find patterns and relationships over a given set of inputs (unlabeled examples). Other categories of machine learning are semi-supervised learning, where an algorithm uses both labeled and unlabeled examples, and reinforcement learning, where an algorithm learns a policy of how to act given an observation of the world.

Download Full-text

Evaluation of Deep Reinforcement Learning Algorithms for Autonomous Driving

2020 IEEE Intelligent Vehicles Symposium (IV) ◽

10.1109/iv47402.2020.9304792 ◽

2020 ◽

Author(s):

Marco Stang ◽

Daniel Grimm ◽

Moritz Gaiser ◽

Eric Sax

Keyword(s):

Reinforcement Learning ◽

Learning Algorithms ◽

Autonomous Driving

Download Full-text

Actor–critic-based decision-making method for the artificial intelligence commander in tactical wargames

The Journal of Defense Modeling and Simulation Applications Methodology Technology ◽

10.1177/1548512920954542 ◽

2020 ◽

pp. 154851292095454

Author(s):

Junfeng Zhang ◽

Qing Xue

Keyword(s):

Neural Network ◽

Artificial Intelligence ◽

Decision Making ◽

Reinforcement Learning ◽

Convolutional Neural Network ◽

Difficult Problem ◽

Learning Method ◽

Rule Based ◽

Autonomous Decision ◽

Decision Making Problem

In a tactical wargame, the decisions of the artificial intelligence (AI) commander are critical to the final combat result. Due to the existence of fog-of-war, AI commanders are faced with unknown and invisible information on the battlefield and lack of understanding of the situation, and it is difficult to make appropriate tactical strategies. The traditional knowledge rule-based decision-making method lacks flexibility and autonomy. How to make flexible and autonomous decision-making when facing complex battlefield situations is a difficult problem. This paper aims to solve the decision-making problem of the AI commander by using the deep reinforcement learning (DRL) method. We develop a tactical wargame as the research environment, which contains built-in script AI and supports the machine–machine combat mode. On this basis, an end-to-end actor–critic framework for commander decision making based on the convolutional neural network is designed to represent the battlefield situation and the reinforcement learning method is used to try different tactical strategies. Finally, we carry out a combat experiment between a DRL-based agent and a rule-based agent in a jungle terrain scenario. The result shows that the AI commander who adopts the actor–critic method successfully learns how to get a higher score in the tactical wargame, and the DRL-based agent has a higher winning ratio than the rule-based agent.

Download Full-text

RLXSS: Optimizing XSS Detection Model to Defend Against Adversarial Attacks Based on Reinforcement Learning

Future Internet ◽

10.3390/fi11080177 ◽

2019 ◽

Vol 11 (8) ◽

pp. 177

Author(s):

Yong Fang ◽

Cheng Huang ◽

Yijia Xu ◽

Yang Li

Keyword(s):

Artificial Intelligence ◽

Reinforcement Learning ◽

Learning Algorithms ◽

Attack Detection ◽

Machine Learning Algorithms ◽

Detection Model ◽

Attack Model ◽

Intelligence Models ◽

Artificial Intelligence Models ◽

Adversarial Model

With the development of artificial intelligence, machine learning algorithms and deep learning algorithms are widely applied to attack detection models. Adversarial attacks against artificial intelligence models become inevitable problems when there is a lack of research on the cross-site scripting (XSS) attack detection model for defense against attacks. It is extremely important to design a method that can effectively improve the detection model against attack. In this paper, we present a method based on reinforcement learning (called RLXSS), which aims to optimize the XSS detection model to defend against adversarial attacks. First, the adversarial samples of the detection model are mined by the adversarial attack model based on reinforcement learning. Secondly, the detection model and the adversarial model are alternately trained. After each round, the newly-excavated adversarial samples are marked as a malicious sample and are used to retrain the detection model. Experimental results show that the proposed RLXSS model can successfully mine adversarial samples that escape black-box and white-box detection and retain aggressive features. What is more, by alternately training the detection model and the confrontation attack model, the escape rate of the detection model is continuously reduced, which indicates that the model can improve the ability of the detection model to defend against attacks.

Download Full-text