scholarly journals A Fast Integrated Planning and Control Framework for Autonomous Driving via Imitation Learning

Author(s):  
Liting Sun ◽  
Cheng Peng ◽  
Wei Zhan ◽  
Masayoshi Tomizuka

Safety and efficiency are two key elements for planning and control in autonomous driving. Theoretically, model-based optimization methods, such as Model Predictive Control (MPC), can provide such optimal driving policies. Their computational complexity, however, grows exponentially with horizon length and number of surrounding vehicles. This makes them impractical for real-time implementation, particularly when nonlinear models are considered. To enable a fast and approximately optimal driving policy, we propose a safe imitation framework, which contains two hierarchical layers. The first layer, defined as the policy layer, is represented by a neural network that imitates a long-term expert driving policy via imitation learning. The second layer, called the execution layer, is a short-term model-based optimal controller that tracks and further fine-tunes the reference trajectories proposed by the policy layer with guaranteed short-term collision avoidance. Moreover, to reduce the distribution mismatch between the training set and the real world, Dataset Aggregation is utilized so that the performance of the policy layer can be improved from iteration to iteration. Several highway driving scenarios are demonstrated in simulations, and the results show that the proposed framework can achieve similar performance as sophisticated long-term optimization approaches but with significantly improved computational efficiency.

2021 ◽  
Vol 44 ◽  
Author(s):  
Peter Dayan

Abstract We use neural reinforcement learning concepts including Pavlovian versus instrumental control, liking versus wanting, model-based versus model-free control, online versus offline learning and planning, and internal versus external actions and control to reflect on putative conflicts between short-term temptations and long-term goals.


2017 ◽  
Vol 40 ◽  
Author(s):  
Nisheeth Srivastava ◽  
Narayanan Srinivasan

AbstractWe suggest that steep intertemporal discounting in individuals of low socioeconomic status (SES) may arise as a rational metacognitive adaptation to experiencing planning and control failures in long-term plans. Low SES individuals' plans fail more frequently because they operate close to budgetary boundaries, in turn because they consistently operate with limited budgets of money, status, trust, or other forms of social utility.


2016 ◽  
Vol 138 (6) ◽  
Author(s):  
Yi Ren ◽  
Alparslan Emrah Bayrak ◽  
Panos Y. Papalambros

We compare the performance of human players against that of the efficient global optimization (EGO) algorithm for an NP-complete powertrain design and control problem. Specifically, we cast this optimization problem as an online competition and received 2391 game plays by 124 anonymous players during the first month from launch. We found that while only a small portion of human players can outperform the algorithm in the long term, players tend to formulate good heuristics early on that can be used to constrain the solution space. Such constraining of the search enhances algorithm efficiency, even for different game settings. These findings indicate that human-assisted computational searches are promising in solving comprehensible yet computationally hard optimal design and control problems, when human players can outperform the algorithm in a short term.


2015 ◽  
Vol 56 (7) ◽  
pp. 835-844 ◽  
Author(s):  
Yehong Wang ◽  
Rui Zhu ◽  
Jim Xiao ◽  
John C. Davis ◽  
Jaap W. Mandema ◽  
...  

Temida ◽  
2003 ◽  
Vol 6 (4) ◽  
pp. 3-13 ◽  
Author(s):  
Vesna Nikolic-Ristanovic

In this paper the author explores, focusing largely on the example of the Balkans, the connection between the expansion of neoliberal market economy and war, and related to it the growth of illegal markets and the shadow economy, on one hand, and the victimisation by human trafficking, on the other. By locating human trade within expanding local and global illegal markets, the author is arguing that, without taking into consideration wider social contexts, which create structural incentives for illegal markets and transnational organised crime, we can hardly understand the causes, let alone build effective strategies to combat and prevent it. Consequently, on the basis of the analyses of human trade as a form of both transnational organised crime and illegal markets, some strategies (short-term and long-term) for the prevention and control of human trafficking on both the micro and macro level are suggested.


Sign in / Sign up

Export Citation Format

Share Document