SHIV: Reducing supervisor burden in DAgger using support vectors for efficient learning from demonstrations in high dimensional state spaces

Improving the accuracy and robustness of deep neural nets (DNNs) and adapting them to small training data are primary tasks in deep learning (DL) research. In this paper, we replace the output activation function of DNNs, typically the data-agnostic softmax function, with a graph Laplacian-based high-dimensional interpolating function which, in the continuum limit, converges to the solution of a Laplace–Beltrami equation on a high-dimensional manifold. Furthermore, we propose end-to-end training and testing algorithms for this new architecture. The proposed DNN with graph interpolating activation integrates the advantages of both deep learning and manifold learning. Compared to the conventional DNNs with the softmax function as output activation, the new framework demonstrates the following major advantages: First, it is better applicable to data-efficient learning in which we train high capacity DNNs without using a large number of training data. Second, it remarkably improves both natural accuracy on the clean images and robust accuracy on the adversarial images crafted by both white-box and black-box adversarial attacks. Third, it is a natural choice for semi-supervised learning. This paper is a significant extension of our earlier work published in NeurIPS, 2018. For reproducibility, the code is available at https://github.com/BaoWangMath/DNN-DataDependentActivation.

Download Full-text

Efficient learning methods for high dimensional visual data

10.32657/10356/65624 ◽

2015 ◽

Author(s):

Marcus Caixing Chen

Keyword(s):

High Dimensional ◽

Visual Data ◽

Learning Methods ◽

Efficient Learning

Download Full-text

Efficient learning algorithms for episodic tasks with acyclic state spaces

2006 IEEE International Conference on Automation Science and Engineering ◽

10.1109/coase.2006.326917 ◽

2006 ◽

Author(s):

Spyros Reveliotis ◽

Theologos Bountourelis

Keyword(s):

Learning Algorithms ◽

State Spaces ◽

Efficient Learning

Download Full-text

Particle algorithms for filtering in high dimensional state spaces: A case study in group object tracking

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp.2011.5947712 ◽

2011 ◽

Cited By ~ 3

Author(s):

Lyudmila Mihaylova ◽

Avishy Carmi

Keyword(s):

Object Tracking ◽

High Dimensional ◽

State Spaces ◽

Group Object

Download Full-text

Bayesian Experience Reuse for Learning from Multiple Demonstrators

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2021/334 ◽

2021 ◽

Author(s):

Mike Gimelfarb ◽

Scott Sanner ◽

Chi-Guhn Lee

Keyword(s):

Neural Networks ◽

Dynamic Decision Making ◽

Quadratic Program ◽

High Dimensional ◽

Inverse Gamma ◽

Learning From Demonstrations ◽

Powerful Approach ◽

Experience Reuse ◽

Task Goals ◽

Conflicting Goals

Learning from Demonstrations (LfD) is a powerful approach for incorporating advice from experts in the form of demonstrations. However, demonstrations often come from multiple sub-optimal experts with conflicting goals, rendering them difficult to incorporate effectively in online settings. To address this, we formulate a quadratic program whose solution yields an adaptive weighting over experts, that can be used to sample experts with relevant goals. In order to compare different source and target task goals safely, we model their uncertainty using normal-inverse-gamma priors, whose posteriors are learned from demonstrations using Bayesian neural networks with a shared encoder. Our resulting approach, which we call Bayesian Experience Reuse, can be applied for LfD in static and dynamic decision-making settings. We demonstrate its effectiveness for minimizing multi-modal functions, and optimizing a high-dimensional supply chain with cost uncertainty, where it is also shown to improve upon the performance of the demonstrators' policies.

Download Full-text