New Error Function for Single Hidden Layer Feedforward Neural Networks

In this paper, we propose a group Lasso regularization term as a hidden layer regularization method for feedforward neural networks. Adding a group Lasso regularization term into the standard error function as a hidden layer regularization term is a fruitful approach to eliminate the redundant or unnecessary hidden layer neurons from the feedforward neural network structure. As a comparison, a popular Lasso regularization method is introduced into standard error function of the network. Our novel hidden layer regularization method can force a group of outgoing weights to become smaller during the training process and can eventually be removed after the training process. This means it can simplify the neural network structure and it minimizes the computational cost. Numerical simulations are provided by using K-fold cross-validation method with K = 5 to avoid overtraining and to select the best learning parameters. The numerical results show that our proposed hidden layer regularization method prunes more redundant hidden layer neurons consistently for each benchmark dataset without loss of accuracy. In contrast, the existing Lasso regularization method prunes only the redundant weights of the network, but it cannot prune any redundant hidden layer neurons.

Download Full-text

Convergence suppression and divergence facilitation: new approach to prune hidden layer and weights of feedforward neural networks

Proceedings of ISCAS'95 - International Symposium on Circuits and Systems ◽

10.1109/iscas.1995.521466 ◽

2002 ◽

Cited By ~ 6

Author(s):

S. Yasui ◽

A. Malinowski ◽

J.M. Zurada

Keyword(s):

Neural Networks ◽

Feedforward Neural Networks ◽

New Approach ◽

Hidden Layer

Download Full-text

Hematocrit estimation from compact single hidden layer feedforward neural networks trained by evolutionary algorithm

2008 IEEE Congress on Evolutionary Computation (IEEE World Congress on Computational Intelligence) ◽

10.1109/cec.2008.4631197 ◽

2008 ◽

Cited By ~ 2

Author(s):

Hieu Trung Huynh ◽

Yonggwan Won

Keyword(s):

Neural Networks ◽

Evolutionary Algorithm ◽

Feedforward Neural Networks ◽

Hidden Layer

Download Full-text

A new deep neural network based on a stack of single-hidden-layer feedforward neural networks with randomly fixed hidden neurons

Neurocomputing ◽

10.1016/j.neucom.2015.06.017 ◽

2016 ◽

Vol 171 ◽

pp. 63-72 ◽

Cited By ~ 19

Author(s):

Junying Hu ◽

Jiangshe Zhang ◽

Chunxia Zhang ◽

Juan Wang

Keyword(s):

Neural Network ◽

Neural Networks ◽

Deep Neural Network ◽

Feedforward Neural Networks ◽

Hidden Layer ◽

Hidden Neurons

Download Full-text

Group $L_{1/2}$ Regularization for Pruning Hidden Layer Nodes of Feedforward Neural Networks

IEEE Access ◽

10.1109/access.2018.2890740 ◽

2019 ◽

Vol 7 ◽

pp. 9540-9557 ◽

Cited By ~ 3

Author(s):

Habtamu Zegeye Alemu ◽

Junhong Zhao ◽

Feng Li ◽

Wei Wu

Keyword(s):

Neural Networks ◽

Feedforward Neural Networks ◽

Hidden Layer

Download Full-text

On Functional Approximation with Normalized Gaussian Units

Neural Computation ◽

10.1162/neco.1994.6.2.319 ◽

1994 ◽

Vol 6 (2) ◽

pp. 319-333 ◽

Cited By ~ 34

Author(s):

Michel Benaim

Keyword(s):

Neural Networks ◽

Asymptotic Stability ◽

Asymptotic Properties ◽

Learning Rule ◽

Adaptive Algorithms ◽

Feedforward Neural Networks ◽

Hybrid Learning ◽

Universal Approximation ◽

Ode Method ◽

Hidden Layer

Feedforward neural networks with a single hidden layer using normalized gaussian units are studied. It is proved that such neural networks are capable of universal approximation in a satisfactory sense. Then, a hybrid learning rule as per Moody and Darken that combines unsupervised learning of hidden units and supervised learning of output units is considered. By using the method of ordinary differential equations for adaptive algorithms (ODE method) it is shown that the asymptotic properties of the learning rule may be studied in terms of an autonomous cascade of dynamical systems. Some recent results from Hirsch about cascades are used to show the asymptotic stability of the learning rule.

Download Full-text