Better Approximations of High Dimensional Smooth Functions by Deep Neural Networks with Rectified Power Units

AbstractWe demonstrate that deep neural networks with the ReLU activation function can efficiently approximate the solutions of various types of parametric linear transport equations. For non-smooth initial conditions, the solutions of these PDEs are high-dimensional and non-smooth. Therefore, approximation of these functions suffers from a curse of dimension. We demonstrate that through their inherent compositionality deep neural networks can resolve the characteristic flow underlying the transport equations and thereby allow approximation rates independent of the parameter dimension.

Download Full-text

A Black-Box Approach to Generate Adversarial Examples Against Deep Neural Networks for High Dimensional Input

2019 IEEE Fourth International Conference on Data Science in Cyberspace (DSC) ◽

10.1109/dsc.2019.00078 ◽

2019 ◽

Author(s):

Chengru Song ◽

Changqiao Xu ◽

Shujie Yang ◽

Zan Zhou ◽

Changhui Gong

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Black Box ◽

High Dimensional ◽

Adversarial Examples

Download Full-text

Intrinsic motivation and episodic memories for robot exploration of high-dimensional sensory spaces

Adaptive Behavior ◽

10.1177/1059712320922916 ◽

2020 ◽

pp. 105971232092291

Author(s):

Guido Schillaci ◽

Antonio Pico Villalpando ◽

Verena V Hafner ◽

Peter Hanappe ◽

David Colliaux ◽

...

Keyword(s):

Neural Networks ◽

Episodic Memory ◽

Intrinsic Motivation ◽

Computational Models ◽

Deep Neural Networks ◽

Image Sensor ◽

Forward Kinematics ◽

High Dimensional ◽

Episodic Memories ◽

Low Dimensional

This work presents an architecture that generates curiosity-driven goal-directed exploration behaviours for an image sensor of a microfarming robot. A combination of deep neural networks for offline unsupervised learning of low-dimensional features from images and of online learning of shallow neural networks representing the inverse and forward kinematics of the system have been used. The artificial curiosity system assigns interest values to a set of pre-defined goals and drives the exploration towards those that are expected to maximise the learning progress. We propose the integration of an episodic memory in intrinsic motivation systems to face catastrophic forgetting issues, typically experienced when performing online updates of artificial neural networks. Our results show that adopting an episodic memory system not only prevents the computational models from quickly forgetting knowledge that has been previously acquired but also provides new avenues for modulating the balance between plasticity and stability of the models.

Download Full-text

Simulator-free solution of high-dimensional stochastic elliptic partial differential equations using deep neural networks

Journal of Computational Physics ◽

10.1016/j.jcp.2019.109120 ◽

2020 ◽

Vol 404 ◽

pp. 109120 ◽

Cited By ~ 9

Author(s):

Sharmila Karumuri ◽

Rohit Tripathy ◽

Ilias Bilionis ◽

Jitesh Panchal

Keyword(s):

Neural Networks ◽

Partial Differential Equations ◽

Differential Equations ◽

Deep Neural Networks ◽

Elliptic Partial Differential Equations ◽

High Dimensional ◽

Free Solution ◽

Partial Differential

Download Full-text

Deep Neural Networks for High Dimension, Low Sample Size Data

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2017/318 ◽

2017 ◽

Cited By ~ 24

Author(s):

Bo Liu ◽

Ying Wei ◽

Yu Zhang ◽

Qiang Yang

Keyword(s):

Neural Networks ◽

Sample Size ◽

High Dimension ◽

Deep Neural Networks ◽

Genetic Data ◽

High Dimensional ◽

Large Sample Size ◽

Prediction Problem ◽

The Stability ◽

Size Data

Deep neural networks (DNN) have achieved breakthroughs in applications with large sample size. However, when facing high dimension, low sample size (HDLSS) data, such as the phenotype prediction problem using genetic data in bioinformatics, DNN suffers from overfitting and high-variance gradients. In this paper, we propose a DNN model tailored for the HDLSS data, named Deep Neural Pursuit (DNP). DNP selects a subset of high dimensional features for the alleviation of overfitting and takes the average over multiple dropouts to calculate gradients with low variance. As the first DNN method applied on the HDLSS data, DNP enjoys the advantages of the high nonlinearity, the robustness to high dimensionality, the capability of learning from a small number of samples, the stability in feature selection, and the end-to-end training. We demonstrate these advantages of DNP via empirical results on both synthetic and real-world biological datasets.

Download Full-text

Application of deep neural networks for high-dimensional large BWR core neutronics

Nuclear Engineering and Technology ◽

10.1016/j.net.2020.05.010 ◽

2020 ◽

Vol 52 (12) ◽

pp. 2709-2716

Author(s):

Rabie Abu Saleem ◽

Majdi I. Radaideh ◽

Tomasz Kozlowski

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

High Dimensional

Download Full-text

Archetypal landscapes for deep neural networks

Proceedings of the National Academy of Sciences ◽

10.1073/pnas.1919995117 ◽

2020 ◽

Vol 117 (36) ◽

pp. 21857-21864

Author(s):

Philipp C. Verpoort ◽

Alpha A. Lee ◽

David J. Wales

Keyword(s):

Neural Networks ◽

Learning Community ◽

Gradient Descent ◽

Deep Neural Networks ◽

Loss Functions ◽

Stochastic Gradient Descent ◽

High Dimensional ◽

Local Minima ◽

High Loss ◽

Optimization Schemes

The predictive capabilities of deep neural networks (DNNs) continue to evolve to increasingly impressive levels. However, it is still unclear how training procedures for DNNs succeed in finding parameters that produce good results for such high-dimensional and nonconvex loss functions. In particular, we wish to understand why simple optimization schemes, such as stochastic gradient descent, do not end up trapped in local minima with high loss values that would not yield useful predictions. We explain the optimizability of DNNs by characterizing the local minima and transition states of the loss-function landscape (LFL) along with their connectivity. We show that the LFL of a DNN in the shallow network or data-abundant limit is funneled, and thus easy to optimize. Crucially, in the opposite low-data/deep limit, although the number of minima increases, the landscape is characterized by many minima with similar loss values separated by low barriers. This organization is different from the hierarchical landscapes of structural glass formers and explains why minimization procedures commonly employed by the machine-learning community can navigate the LFL successfully and reach low-lying solutions.

Download Full-text