markovian decision processes
Recently Published Documents


TOTAL DOCUMENTS

85
(FIVE YEARS 1)

H-INDEX

18
(FIVE YEARS 0)



2020 ◽  
Vol 12 (5) ◽  
pp. 15-27
Author(s):  
Fenjiro Youssef ◽  
◽  
Benbrahim Houda

Self-driving car is one of the most amazing applications and most active research of artificial intelligence. It uses end-to-end deep learning models to take orientation and speed decisions, using mainly Convolutional Neural Networks for computer vision, plugged to a fully connected network to output control commands. In this paper, we introduce the Self-driving car domain and the CARLA simulation environment with a focus on the lane-keeping task, then we present the two main end-to-end models, used to solve this problematic, beginning by Deep imitation learning (IL) and specifically the Conditional Imitation Learning (COIL) algorithm, that learns through expert labeled demonstrations, trying to mimic their behaviors, and thereafter, describing Deep Reinforcement Learning (DRL), and precisely DQN and DDPG (respectively Deep Q learning and deep deterministic policy gradient), that uses the concepts of learning by trial and error, while adopting the Markovian decision processes (MDP), to get the best policy for the driver agent. In the last chapter, we compare the two algorithms IL and DRL based on a new approach, with metrics used in deep learning (Loss during training phase) and Self-driving car (the episode's duration before a crash and Average distance from the road center during the testing phase). The results of the training and testing on CARLA simulator reveals that the IL algorithm performs better than DRL algorithm when the agents are already trained on a given circuit, but DRL agents show better adaptability when they are on new roads.



Author(s):  
Karl Hinderer ◽  
Ulrich Rieder ◽  
Michael Stieglitz


Author(s):  
Karl Hinderer ◽  
Ulrich Rieder ◽  
Michael Stieglitz


Author(s):  
Karl Hinderer ◽  
Ulrich Rieder ◽  
Michael Stieglitz


2016 ◽  
Vol 49 (12) ◽  
pp. 35-40 ◽  
Author(s):  
E. Moser ◽  
N. Stricker ◽  
C. Liebrecht ◽  
A. Hiller ◽  
M. Ziegler ◽  
...  


Author(s):  
Karl Hinderer ◽  
Ulrich Rieder ◽  
Michael Stieglitz


Author(s):  
Karl Hinderer ◽  
Ulrich Rieder ◽  
Michael Stieglitz


2014 ◽  
Vol 2014 ◽  
pp. 1-14 ◽  
Author(s):  
Ianire Taboada ◽  
Fidel Liberal

This paper deals with the resource allocation problem aimed at maximizing users’ perception of quality in wireless channels with time-varying capacity. First of all, we model the subjective quality-aware scheduling problem in the framework of Markovian decision processes. Then, given that the obtaining of the optimal solution of this model is unachievable, we propose a simple scheduling index rule with closed-form expression by using a methodology based on Whittle approach. Finally, we analyze the performance of the achieved scheduling proposal in several relevant scenarios, concluding that it outperforms the most popular existing resource allocation strategies.



Sign in / Sign up

Export Citation Format

Share Document