scholarly journals Empirical Bounds on Linear Regions of Deep Rectifier Networks

2020 ◽  
Vol 34 (04) ◽  
pp. 5628-5635 ◽  
Author(s):  
Thiago Serra ◽  
Srikumar Ramalingam

We can compare the expressiveness of neural networks that use rectified linear units (ReLUs) by the number of linear regions, which reflect the number of pieces of the piecewise linear functions modeled by such networks. However, enumerating these regions is prohibitive and the known analytical bounds are identical for networks with same dimensions. In this work, we approximate the number of linear regions through empirical bounds based on features of the trained network and probabilistic inference. Our first contribution is a method to sample the activation patterns defined by ReLUs using universal hash functions. This method is based on a Mixed-Integer Linear Programming (MILP) formulation of the network and an algorithm for probabilistic lower bounds of MILP solution sets that we call MIPBound, which is considerably faster than exact counting and reaches values in similar orders of magnitude. Our second contribution is a tighter activation-based bound for the maximum number of linear regions, which is particularly stronger in networks with narrow layers. Combined, these bounds yield a fast proxy for the number of linear regions of a deep neural network.

Author(s):  
Arturo Sarmiento-Reyes ◽  
Luis Hernandez-Martinez ◽  
Miguel Angel Gutierrez de Anda ◽  
Francisco Javier Castro Gonzalez

We describe a sense in which mesh duality is equivalent to Legendre duality. That is, a general pair of meshes, which satisfy a definition of duality for meshes, are shown to be the projection of a pair of piecewise linear functions that are dual to each other in the sense of a Legendre dual transformation. In applications the latter functions can be a tangent plane approximation to a smoother function, and a chordal plane approximation to its Legendre dual. Convex examples include one from meteorology, and also the relation between the Delaunay mesh and the Voronoi tessellation. The latter are shown to be the projections of tangent plane and chordal approximations to the same paraboloid.


Sign in / Sign up

Export Citation Format

Share Document