ReLU Network with Bounded Width Is a Universal Approximator in View of an Approximate Identity

Background: In making the deep neural network, activation functions play an important role. But the choice of activation functions also affects the network in term of optimization and to retrieve the better results. Several activation functions have been introduced in machine learning for many practical applications. But which activation function should use at hidden layer of deep neural networks was not identified. Objective: The primary objective of this analysis was to describe which activation function must be used at hidden layers for deep neural networks to solve complex non-linear problems. Methods: The configuration for this comparative model was used by using the datasets of 2 classes (Cat/Dog). The number of Convolutional layer used in this network was 3 and the pooling layer was also introduced after each layer of CNN layer. The total of the dataset was divided into the two parts. The first 8000 images were mainly used for training the network and the next 2000 images were used for testing the network. Results: The experimental comparison was done by analyzing the network by taking different activation functions on each layer of CNN network. The validation error and accuracy on Cat/Dog dataset were analyzed using activation functions (ReLU, Tanh, Selu, PRelu, Elu) at number of hidden layers. Overall the Relu gave best performance with the validation loss at 25th Epoch 0.3912 and validation accuracy at 25th Epoch 0.8320. Conclusion: It is found that a CNN model with ReLU hidden layers (3 hidden layers here) gives best results and improve overall performance better in term of accuracy and speed. These advantages of ReLU in CNN at number of hidden layers are helpful to effectively and fast retrieval of images from the databases.

Download Full-text

Hardware implementation of radial-basis neural networks with Gaussian activation functions on FPGA

Neural Computing and Applications ◽

10.1007/s00521-021-05706-3 ◽

2021 ◽

Author(s):

Volodymyr Shymkovych ◽

Sergii Telenyk ◽

Petro Kravets

Keyword(s):

Neural Networks ◽

Hardware Implementation ◽

Gaussian Function ◽

Activation Function ◽

Rbf Neural Networks ◽

Activation Functions ◽

Rbf Network ◽

Combination Scheme ◽

Radial Basis ◽

Hidden Layer

AbstractThis article introduces a method for realizing the Gaussian activation function of radial-basis (RBF) neural networks with their hardware implementation on field-programmable gaits area (FPGAs). The results of modeling of the Gaussian function on FPGA chips of different families have been presented. RBF neural networks of various topologies have been synthesized and investigated. The hardware component implemented by this algorithm is an RBF neural network with four neurons of the latent layer and one neuron with a sigmoid activation function on an FPGA using 16-bit numbers with a fixed point, which took 1193 logic matrix gate (LUTs—LookUpTable). Each hidden layer neuron of the RBF network is designed on an FPGA as a separate computing unit. The speed as a total delay of the combination scheme of the block RBF network was 101.579 ns. The implementation of the Gaussian activation functions of the hidden layer of the RBF network occupies 106 LUTs, and the speed of the Gaussian activation functions is 29.33 ns. The absolute error is ± 0.005. The Spartan 3 family of chips for modeling has been used to get these results. Modeling on chips of other series has been also introduced in the article. RBF neural networks of various topologies have been synthesized and investigated. Hardware implementation of RBF neural networks with such speed allows them to be used in real-time control systems for high-speed objects.

Download Full-text

Evaluation of Parameter Settings for Training Neural Networks Using Backpropagation Algorithms

10.4018/978-1-6684-2408-7.ch009 ◽

2022 ◽

pp. 202-226

Author(s):

Leema N. ◽

Khanna H. Nehemiah ◽

Elgin Christo V. R. ◽

Kannan A.

Keyword(s):

Neural Network ◽

Neural Networks ◽

Activation Function ◽

Neural Network Training ◽

Network Parameter ◽

Network Parameters ◽

Network Training ◽

Rate Minimum ◽

Hidden Layer ◽

Function Number

Artificial neural networks (ANN) are widely used for classification, and the training algorithm commonly used is the backpropagation (BP) algorithm. The major bottleneck faced in the backpropagation neural network training is in fixing the appropriate values for network parameters. The network parameters are initial weights, biases, activation function, number of hidden layers and the number of neurons per hidden layer, number of training epochs, learning rate, minimum error, and momentum term for the classification task. The objective of this work is to investigate the performance of 12 different BP algorithms with the impact of variations in network parameter values for the neural network training. The algorithms were evaluated with different training and testing samples taken from the three benchmark clinical datasets, namely, Pima Indian Diabetes (PID), Hepatitis, and Wisconsin Breast Cancer (WBC) dataset obtained from the University of California Irvine (UCI) machine learning repository.

Download Full-text

Application of Artificial Neural Networks to Assess the Mycological State of Bulk Stored Rapeseeds

Agriculture ◽

10.3390/agriculture10110567 ◽

2020 ◽

Vol 10 (11) ◽

pp. 567

Author(s):

Jolanta Wawrzyniak

Keyword(s):

Neural Networks ◽

Artificial Neural Networks ◽

Control Systems ◽

Multilayer Perceptron ◽

Activation Function ◽

Fungal Population ◽

Support Tool ◽

Artificial Neural ◽

Hidden Layer ◽

And Storage

Artificial neural networks (ANNs) constitute a promising modeling approach that may be used in control systems for postharvest preservation and storage processes. The study investigated the ability of multilayer perceptron and radial-basis function ANNs to predict fungal population levels in bulk stored rapeseeds with various temperatures (T = 12–30 °C) and water activity in seeds (aw = 0.75–0.90). The neural network model input included aw, temperature, and time, whilst the fungal population level was the model output. During the model construction, networks with a different number of hidden layer neurons and different configurations of activation functions in neurons of the hidden and output layers were examined. The best architecture was the multilayer perceptron ANN, in which the hyperbolic tangent function acted as an activation function in the hidden layer neurons, while the linear function was the activation function in the output layer neuron. The developed structure exhibits high prediction accuracy and high generalization capability. The model provided in the research may be readily incorporated into control systems for postharvest rapeseed preservation and storage as a support tool, which based on easily measurable on-line parameters can estimate the risk of fungal development and thus mycotoxin accumulation.

Download Full-text

Neural Networks for Optimal Approximation of Smooth and Analytic Functions

Neural Computation ◽

10.1162/neco.1996.8.1.164 ◽

1996 ◽

Vol 8 (1) ◽

pp. 164-177 ◽

Cited By ~ 105

Author(s):

H. N. Mhaskar

Keyword(s):

Neural Networks ◽

Target Function ◽

Activation Function ◽

Basis Functions ◽

Optimal Order ◽

Order Of Approximation ◽

Continuous Linear ◽

Linear Functionals ◽

Principal Element ◽

Hidden Layer

We prove that neural networks with a single hidden layer are capable of providing an optimal order of approximation for functions assumed to possess a given number of derivatives, if the activation function evaluated by each principal element satisfies certain technical conditions. Under these conditions, it is also possible to construct networks that provide a geometric order of approximation for analytic target functions. The permissible activation functions include the squashing function (1 − e−x)−1 as well as a variety of radial basis functions. Our proofs are constructive. The weights and thresholds of our networks are chosen independently of the target function; we give explicit formulas for the coefficients as simple, continuous, linear functionals of the target function.

Download Full-text

NEW METHOD OF NEURON DESIGN BASED ON DISCRETE Z-FUNCTION TO ADAPT THE CHANGE OF INTEGRATED VEHICLE STABILITY CONTROL ORDER

International Journal of Computational Intelligence and Applications ◽

10.1142/s146902680900259x ◽

2009 ◽

Vol 08 (03) ◽

pp. 253-285 ◽

Cited By ~ 1

Author(s):

M. HARLY ◽

I. N. SUTANTRA ◽

H. P. MAURIDHI

Keyword(s):

Neural Network ◽

Neural Networks ◽

Back Propagation ◽

Activation Function ◽

Zero Order ◽

Vehicle Stability ◽

Adaptive Control System ◽

Control Performance ◽

Fixed Order ◽

Hidden Layer

Fixed order neural networks (FONN), such as high order neural network (HONN), in which its architecture is developed from zero order of activation function and joint weight, regulates only the number of weight and their value. As a result, this network only produces a fixed order model or control level. These obstacles, which affect preceeding architectures, have been performing finite ability to adapt uncertainty character of real world plant, such as driving dynamics and its desired control performance. This paper introduces a new concept of neural network neuron. In this matter, exploiting discrete z-function builds new neuron activation. Instead of zero order joint weight matrices, the discrete z-function weight matrix will be provided to realize uncertainty or undetermined real word plant and desired adaptive control system that their order has probably been changing. Instead of using bias, an initial condition value is developed. Neural networks using new neurons is called Varied Order Neural Network (VONN). For optimization process, updating order, coefficient and initial value of node activation function uses GA; while updating joint weight, it applies both back propagation (combined LSE-gauss Newton) and NPSO. To estimate the number of hidden layer, constructive back propagation (CBP) was also applied. Thorough simulation was conducted to compare the control performance between FONN and MONN. In order to control, vehicle stability was equipped by electronics stability program (ESP), electronics four wheel steering (4-EWS), and active suspension (AS). 2000, 4000, 6000, 8000 data that are from TODS, a hidden layer, 3 input nodes, 3 output nodes were provided to train and test the network of both the uncertainty model and its adaptive control system. The result of simulation, therefore, shows that stability parameter such as yaw rate error, vehicle side slip error, and rolling angle error produces better performance control in the form of smaller performance index using FDNN than those using MONN.

Download Full-text

Evaluation of Parameter Settings for Training Neural Networks Using Backpropagation Algorithms

International Journal of Operations Research and Information Systems ◽

10.4018/ijoris.2020100104 ◽

2020 ◽

Vol 11 (4) ◽

pp. 62-85

Author(s):

Leema N. ◽

Khanna H. Nehemiah ◽

Elgin Christo V. R. ◽

Kannan A.

Keyword(s):

Neural Network ◽

Neural Networks ◽

Activation Function ◽

Neural Network Training ◽

Network Parameter ◽

Network Parameters ◽

Network Training ◽

Hidden Layer ◽

Function Number ◽

The Impact

Artificial neural networks (ANN) are widely used for classification, and the training algorithm commonly used is the backpropagation (BP) algorithm. The major bottleneck faced in the backpropagation neural network training is in fixing the appropriate values for network parameters. The network parameters are initial weights, biases, activation function, number of hidden layers and the number of neurons per hidden layer, number of training epochs, learning rate, minimum error, and momentum term for the classification task. The objective of this work is to investigate the performance of 12 different BP algorithms with the impact of variations in network parameter values for the neural network training. The algorithms were evaluated with different training and testing samples taken from the three benchmark clinical datasets, namely, Pima Indian Diabetes (PID), Hepatitis, and Wisconsin Breast Cancer (WBC) dataset obtained from the University of California Irvine (UCI) machine learning repository.

Download Full-text

Lp APPROXIMATION CAPABILITIES OF SUM-OF-PRODUCT AND SIGMA-PI-SIGMA NEURAL NETWORKS

International Journal of Neural Systems ◽

10.1142/s0129065707001251 ◽

2007 ◽

Vol 17 (05) ◽

pp. 419-424 ◽

Cited By ~ 10

Author(s):

JINLING LONG ◽

WEI WU ◽

DONG NAN

Keyword(s):

Neural Networks ◽

Activation Function ◽

Compact Set ◽

Almost Everywhere ◽

Lp Approximation

This paper studies the Lp approximation capabilities of sum-of-product (SOPNN) and sigma-pi-sigma (SPSNN) neural networks. It is proved that the set of functions that are generated by the SOPNN with its activation function in [Formula: see text] is dense in [Formula: see text] for any compact set [Formula: see text], if and only if the activation function is not a polynomial almost everywhere. It is also shown that if the activation function of the SPSNN is in [Formula: see text], then the functions generated by the SPSNN are dense in [Formula: see text] if and only if the activation function is not a constant (a.e.).

Download Full-text

Aspects of the numerical analysis of neural networks

Acta Numerica ◽

10.1017/s0962492900002439 ◽

1994 ◽

Vol 3 ◽

pp. 145-202 ◽

Cited By ~ 22

Author(s):

S.W. Ellacott

Keyword(s):

Neural Networks ◽

Degree Of Approximation ◽

Compact Set ◽

Backpropagation Algorithm ◽

Feedforward Network ◽

Delta Rule ◽

Open Questions ◽

Numerical Process ◽

The Subject ◽

Hidden Layer

This article starts with a brief introduction to neural networks for those unfamiliar with the basic concepts, together with a very brief overview of mathematical approaches to the subject. This is followed by a more detailed look at three areas of research which are of particular interest to numerical analysts.The first area is approximation theory. IfKis a compact set in ℝn, for somen, then it is proved that a semilinear feedforward network with one hidden layer can uniformly approximate any continuous function inC(K) to any required accuracy. A discussion of known results and open questions on the degree of approximation is included. We also consider the relevance of radial basis functions to neural networks.The second area considered is that of learning algorithms. A detailed analysis of one popular algorithm (the delta rule) will be given, indicating why one implementation leads to a stable numerical process, whereas an initially attractive variant (essentially a form of steepest descent) does not. Similar considerations apply to the backpropagation algorithm. The effect of filtering and other preprocessing of the input data will also be discussed systematically.Finally some applications of neural networks to numerical computation are considered.

Download Full-text

Heart Attack Prediction Using Neural Networks

International Journal for Research in Applied Science and Engineering Technology ◽

10.22214/ijraset.2021.35935 ◽

2021 ◽

Vol 9 (VI) ◽

pp. 4501-4504

Author(s):

Chandu Nereeksha

Keyword(s):

Machine Learning ◽

Neural Networks ◽

Early Stage ◽

Heart Diseases ◽

Back Propagation ◽

Data Mining Algorithms ◽

Wide Range ◽

Enormous Amount ◽

Health Situation ◽

Mining Algorithms

Today, Heart disease seems to be a great cause for the increasing rate of immortality especially taken the current health situation under consideration. Improving the health conditions using the latest technology makes an enormous amount of contribution to the healthcare industry. One such, improvement is the use of Machine Learning in determining the heart diseases. Machine learning has a wide range of advancement in Neural Networks (NN). Artificial Neural Networks are basically inspired by the working of neural network inside a human brain. Our study aims to use the different algorithms and technologies to predict heart diseases at an early stage. Different data mining algorithms namely, Decision Tree, K-means clustering, Back-propagation and Random Forest are being used. The system classifies data into different stages such as normal and mild or extreme.

Download Full-text