Implementation of modified  SARSA  learning  technique in EMCAP

This paper presents a method using neural networks and Markov Decision Process (MDP) to identify the source and class of video streaming services. The paper presents the design and implementation of an end-to-end pipeline for training and classifying a machine learning system that can take in packets collected over a network interface and classify the data stream as belonging to one of five streaming video services: You Tube, You Tube TV, Netflix, Amazon Prime, or HBO

Download Full-text

Cloud Load Balancing and Reinforcement Learning

Advances in Business Information Systems and Analytics - Cloud Computing Technologies for Green Enterprises ◽

10.4018/978-1-5225-3038-1.ch011 ◽

2018 ◽

pp. 266-291

Author(s):

Abdelghafour Harraz ◽

Mostapha Zbakh

Keyword(s):

Artificial Intelligence ◽

Reinforcement Learning ◽

Load Balancing ◽

Decision Process ◽

Cloud System ◽

Human Intervention ◽

Q Learning ◽

State Action ◽

Learning Techniques ◽

Markov Decision

Artificial Intelligence allows to create engines that are able to explore, learn environments and therefore create policies that permit to control them in real time with no human intervention. It can be applied, through its Reinforcement Learning techniques component, using frameworks such as temporal differences, State-Action-Reward-State-Action (SARSA), Q Learning to name a few, to systems that are be perceived as a Markov Decision Process, this opens door in front of applying Reinforcement Learning to Cloud Load Balancing to be able to dispatch load dynamically to a given Cloud System. The authors will describe different techniques that can used to implement a Reinforcement Learning based engine in a cloud system.

Download Full-text

Offline Pashto Characters Dataset for OCR Systems

Security and Communication Networks ◽

10.1155/2021/3543816 ◽

2021 ◽

Vol 2021 ◽

pp. 1-7

Author(s):

Sulaiman Khan ◽

Habib Ullah Khan ◽

Shah Nazir

Keyword(s):

Machine Learning ◽

Learning Algorithm ◽

Research Work ◽

Faculty Members ◽

Text Recognition ◽

Machine Learning Algorithm ◽

Machine Learning Technique ◽

Scientific Research Work ◽

Learning Technique ◽

Accumulation Phase

In computer vision and artificial intelligence, text recognition and analysis based on images play a key role in the text retrieving process. Enabling a machine learning technique to recognize handwritten characters of a specific language requires a standard dataset. Acceptable handwritten character datasets are available in many languages including English, Arabic, and many more. However, the lack of datasets for handwritten Pashto characters hinders the application of a suitable machine learning algorithm for recognizing useful insights. In order to address this issue, this study presents the first handwritten Pashto characters image dataset (HPCID) for the scientific research work. This dataset consists of fourteen thousand, seven hundred, and eighty-four samples—336 samples for each of the 44 characters in the Pashto character dataset. Such samples of handwritten characters are collected on an A4-sized paper from different students of Pashto Department in University of Peshawar, Khyber Pakhtunkhwa, Pakistan. On total, 336 students and faculty members contributed in developing the proposed database accumulation phase. This dataset contains multisize, multifont, and multistyle characters and of varying structures.

Download Full-text

A Prototype Agent Based Model and Machine Learning Hybrid System for Healthcare Decision Support

Digital Advances in Medicine, E-Health, and Communication Technologies ◽

10.4018/978-1-4666-2794-9.ch013 ◽

2012 ◽

pp. 230-253 ◽

Cited By ~ 1

Author(s):

Marek Laskowski

Keyword(s):

Machine Learning ◽

Decision Support ◽

Decision Process ◽

Patient Flow ◽

Policy Decision ◽

Healthcare Policy ◽

Learning System ◽

Agent Based Model ◽

Agent Based ◽

Markov Decision

Science is on the verge of practical agent based modeling decision support systems capable of machine learning for healthcare policy decision support. The details of integrating an agent based model of a hospital emergency department with a genetic programming machine learning system are presented in this paper. A novel GP heuristic or extension is introduced to better represent the Markov Decision Process that underlies agent decision making in an unknown environment. The capabilities of the resulting prototype for automated hypothesis generation within the context of healthcare policy decision support are demonstrated by automatically generating patient flow and infection spread prevention policies. Finally, some observations are made regarding moving forward from the prototype stage.

Download Full-text

Carrier-borne aircrafts aviation operation automated scheduling using multiplicative weights apprenticeship learning

International Journal of Advanced Robotic Systems ◽

10.1177/1729881419828917 ◽

2019 ◽

Vol 16 (1) ◽

pp. 172988141982891 ◽

Cited By ~ 1

Author(s):

Mao Zheng ◽

Fangqing Yang ◽

Zaopeng Dong ◽

Shuo Xie ◽

Xiumin Chu

Keyword(s):

Artificial Intelligence ◽

Decision Process ◽

Learning Algorithm ◽

Aircraft Carrier ◽

Scheduling Policy ◽

Apprenticeship Learning ◽

Markov Decision ◽

Artificial Intelligence Technology ◽

Aviation Operations ◽

Simulative Model

Efficiency and safety are vital for aviation operations in order to improve the combat capacity of aircraft carrier. In this article, the theory of apprenticeship learning, as a kind of artificial intelligence technology, is applied to constructing the method of automated scheduling. First, with the use of Markov decision process frame, the simulative model of aircrafts launching and recovery was established. Second, the multiplicative weights apprenticeship learning algorithm was applied to creating the optimized scheduling policy. In the situation with an expert to learn from, the learned policy matches quite well with the expert’s demonstration and the total deviations can be limited within 3%. Finally, in the situation without expert’s demonstration, the policy generated by multiplicative weights apprenticeship learning algorithm shows an obvious superiority compared to the three human experts. The results of different operation situations show that the method is highly robust and well functional.

Download Full-text

A Q-learning Approach to a Consumption-Investment Problem

International Journal of Statistics and Probability ◽

10.5539/ijsp.v10n2p110 ◽

2021 ◽

Vol 10 (2) ◽

pp. 110

Author(s):

Ruy Lopez-Rios

Keyword(s):

Machine Learning ◽

Decision Process ◽

Infinite Horizon ◽

Learning Approach ◽

Q Learning ◽

Time Consumption ◽

Investment Problem ◽

Discounted Utility ◽

Learning Technique ◽

Markov Decision

The paper deals with a discrete-time consumption investment problem with an infinite horizon. This problem is formulated as a Markov decision process with an expected total discounted utility as an objective function. This paper aims to presents a procedure to approximate the solution via machine learning, specifically, a Q-learning technique. The numerical results of the problem are provided.

Download Full-text

Deep Reinforcement Learning for Optimization

Handbook of Research on Deep Learning Innovations and Trends - Advances in Computational Intelligence and Robotics ◽

10.4018/978-1-5225-7862-8.ch011 ◽

2019 ◽

pp. 180-196

Author(s):

Md Mahmudul Hasan ◽

Md Shahinur Rahman ◽

Adrian Bell

Keyword(s):

Artificial Intelligence ◽

Machine Learning ◽

Reinforcement Learning ◽

Markov Decision Process ◽

Decision Process ◽

Autonomous Systems ◽

Stochastic Gradient Descent ◽

Policy Selection ◽

Markov Decision ◽

The Common

Deep reinforcement learning (DRL) has transformed the field of artificial intelligence (AI) especially after the success of Google DeepMind. This branch of machine learning epitomizes a step toward building autonomous systems by understanding of the visual world. Deep reinforcement learning (RL) is currently applied to different sorts of problems that were previously obstinate. In this chapter, at first, the authors started with an introduction of the general field of RL and Markov decision process (MDP). Then, they clarified the common DRL framework and the necessary components RL settings. Moreover, they analyzed the stochastic gradient descent (SGD)-based optimizers such as ADAM and a non-specific multi-policy selection mechanism in a multi-objective Markov decision process. In this chapter, the authors also included the comparison for different Deep Q networks. In conclusion, they describe several challenges and trends in research within the deep reinforcement learning field.

Download Full-text

A Q-learning Approach to a Consumption-Investment Problem

International Journal of Statistics and Probability ◽

10.5539/ijsp.v10n2p109 ◽

2021 ◽

Vol 10 (2) ◽

pp. 109

Author(s):

Ruy Lopez-Rios

Keyword(s):

Machine Learning ◽

Decision Process ◽

Infinite Horizon ◽

Learning Approach ◽

Q Learning ◽

Time Consumption ◽

Investment Problem ◽

Discounted Utility ◽

Learning Technique ◽

Markov Decision

The paper deals with a discrete-time consumption investment problem with an infinite horizon. This problem is formulated as a Markov decision process with an expected total discounted utility as an objective function. This paper aims to presents a procedure to approximate the solution via machine learning, specifically, a Q-learning technique. The numerical results of the problem are provided.

Download Full-text

Deep Reinforcement Learning for Optimization

Research Anthology on Artificial Intelligence Applications in Security ◽

10.4018/978-1-7998-7705-9.ch070 ◽

2021 ◽

pp. 1598-1614

Author(s):

Md Mahmudul Hasan ◽

Md Shahinur Rahman ◽

Adrian Bell

Keyword(s):

Artificial Intelligence ◽

Machine Learning ◽

Reinforcement Learning ◽

Markov Decision Process ◽

Decision Process ◽

Autonomous Systems ◽

Stochastic Gradient Descent ◽

Policy Selection ◽

Markov Decision ◽

The Common

Deep reinforcement learning (DRL) has transformed the field of artificial intelligence (AI) especially after the success of Google DeepMind. This branch of machine learning epitomizes a step toward building autonomous systems by understanding of the visual world. Deep reinforcement learning (RL) is currently applied to different sorts of problems that were previously obstinate. In this chapter, at first, the authors started with an introduction of the general field of RL and Markov decision process (MDP). Then, they clarified the common DRL framework and the necessary components RL settings. Moreover, they analyzed the stochastic gradient descent (SGD)-based optimizers such as ADAM and a non-specific multi-policy selection mechanism in a multi-objective Markov decision process. In this chapter, the authors also included the comparison for different Deep Q networks. In conclusion, they describe several challenges and trends in research within the deep reinforcement learning field.

Download Full-text

Application of a Rough Set-Based Inductive Learning System

Fundamenta Informaticae ◽

10.3233/fi-1993-182-409 ◽

1993 ◽

Vol 18 (2-4) ◽

pp. 209-220

Author(s):

Michael Hadjimichael ◽

Anita Wasilewska

Keyword(s):

Machine Learning ◽

Rough Set ◽

Presidential Election ◽

Predictive Accuracy ◽

Learning Algorithm ◽

Inductive Learning ◽

Real Data ◽

Semantic Content ◽

Learning System ◽

Voter Preferences

We present here an application of Rough Set formalism to Machine Learning. The resulting Inductive Learning algorithm is described, and its application to a set of real data is examined. The data consists of a survey of voter preferences taken during the 1988 presidential election in the U.S.A. Results include an analysis of the predictive accuracy of the generated rules, and an analysis of the semantic content of the rules.

Download Full-text