Neural Networks for Optimal Approximation of Smooth and Analytic Functions

Background: In making the deep neural network, activation functions play an important role. But the choice of activation functions also affects the network in term of optimization and to retrieve the better results. Several activation functions have been introduced in machine learning for many practical applications. But which activation function should use at hidden layer of deep neural networks was not identified. Objective: The primary objective of this analysis was to describe which activation function must be used at hidden layers for deep neural networks to solve complex non-linear problems. Methods: The configuration for this comparative model was used by using the datasets of 2 classes (Cat/Dog). The number of Convolutional layer used in this network was 3 and the pooling layer was also introduced after each layer of CNN layer. The total of the dataset was divided into the two parts. The first 8000 images were mainly used for training the network and the next 2000 images were used for testing the network. Results: The experimental comparison was done by analyzing the network by taking different activation functions on each layer of CNN network. The validation error and accuracy on Cat/Dog dataset were analyzed using activation functions (ReLU, Tanh, Selu, PRelu, Elu) at number of hidden layers. Overall the Relu gave best performance with the validation loss at 25th Epoch 0.3912 and validation accuracy at 25th Epoch 0.8320. Conclusion: It is found that a CNN model with ReLU hidden layers (3 hidden layers here) gives best results and improve overall performance better in term of accuracy and speed. These advantages of ReLU in CNN at number of hidden layers are helpful to effectively and fast retrieval of images from the databases.

Download Full-text

Hardware implementation of radial-basis neural networks with Gaussian activation functions on FPGA

Neural Computing and Applications ◽

10.1007/s00521-021-05706-3 ◽

2021 ◽

Author(s):

Volodymyr Shymkovych ◽

Sergii Telenyk ◽

Petro Kravets

Keyword(s):

Neural Networks ◽

Hardware Implementation ◽

Gaussian Function ◽

Activation Function ◽

Rbf Neural Networks ◽

Activation Functions ◽

Rbf Network ◽

Combination Scheme ◽

Radial Basis ◽

Hidden Layer

AbstractThis article introduces a method for realizing the Gaussian activation function of radial-basis (RBF) neural networks with their hardware implementation on field-programmable gaits area (FPGAs). The results of modeling of the Gaussian function on FPGA chips of different families have been presented. RBF neural networks of various topologies have been synthesized and investigated. The hardware component implemented by this algorithm is an RBF neural network with four neurons of the latent layer and one neuron with a sigmoid activation function on an FPGA using 16-bit numbers with a fixed point, which took 1193 logic matrix gate (LUTs—LookUpTable). Each hidden layer neuron of the RBF network is designed on an FPGA as a separate computing unit. The speed as a total delay of the combination scheme of the block RBF network was 101.579 ns. The implementation of the Gaussian activation functions of the hidden layer of the RBF network occupies 106 LUTs, and the speed of the Gaussian activation functions is 29.33 ns. The absolute error is ± 0.005. The Spartan 3 family of chips for modeling has been used to get these results. Modeling on chips of other series has been also introduced in the article. RBF neural networks of various topologies have been synthesized and investigated. Hardware implementation of RBF neural networks with such speed allows them to be used in real-time control systems for high-speed objects.

Download Full-text

Evaluation of Parameter Settings for Training Neural Networks Using Backpropagation Algorithms

10.4018/978-1-6684-2408-7.ch009 ◽

2022 ◽

pp. 202-226

Author(s):

Leema N. ◽

Khanna H. Nehemiah ◽

Elgin Christo V. R. ◽

Kannan A.

Keyword(s):

Neural Network ◽

Neural Networks ◽

Activation Function ◽

Neural Network Training ◽

Network Parameter ◽

Network Parameters ◽

Network Training ◽

Rate Minimum ◽

Hidden Layer ◽

Function Number

Artificial neural networks (ANN) are widely used for classification, and the training algorithm commonly used is the backpropagation (BP) algorithm. The major bottleneck faced in the backpropagation neural network training is in fixing the appropriate values for network parameters. The network parameters are initial weights, biases, activation function, number of hidden layers and the number of neurons per hidden layer, number of training epochs, learning rate, minimum error, and momentum term for the classification task. The objective of this work is to investigate the performance of 12 different BP algorithms with the impact of variations in network parameter values for the neural network training. The algorithms were evaluated with different training and testing samples taken from the three benchmark clinical datasets, namely, Pima Indian Diabetes (PID), Hepatitis, and Wisconsin Breast Cancer (WBC) dataset obtained from the University of California Irvine (UCI) machine learning repository.

Download Full-text

Application of Artificial Neural Networks to Assess the Mycological State of Bulk Stored Rapeseeds

Agriculture ◽

10.3390/agriculture10110567 ◽

2020 ◽

Vol 10 (11) ◽

pp. 567

Author(s):

Jolanta Wawrzyniak

Keyword(s):

Neural Networks ◽

Artificial Neural Networks ◽

Control Systems ◽

Multilayer Perceptron ◽

Activation Function ◽

Fungal Population ◽

Support Tool ◽

Artificial Neural ◽

Hidden Layer ◽

And Storage

Artificial neural networks (ANNs) constitute a promising modeling approach that may be used in control systems for postharvest preservation and storage processes. The study investigated the ability of multilayer perceptron and radial-basis function ANNs to predict fungal population levels in bulk stored rapeseeds with various temperatures (T = 12–30 °C) and water activity in seeds (aw = 0.75–0.90). The neural network model input included aw, temperature, and time, whilst the fungal population level was the model output. During the model construction, networks with a different number of hidden layer neurons and different configurations of activation functions in neurons of the hidden and output layers were examined. The best architecture was the multilayer perceptron ANN, in which the hyperbolic tangent function acted as an activation function in the hidden layer neurons, while the linear function was the activation function in the output layer neuron. The developed structure exhibits high prediction accuracy and high generalization capability. The model provided in the research may be readily incorporated into control systems for postharvest rapeseed preservation and storage as a support tool, which based on easily measurable on-line parameters can estimate the risk of fungal development and thus mycotoxin accumulation.

Download Full-text

Regularization Theory and Neural Networks Architectures

Neural Computation ◽

10.1162/neco.1995.7.2.219 ◽

1995 ◽

Vol 7 (2) ◽

pp. 219-269 ◽

Cited By ~ 775

Author(s):

Federico Girosi ◽

Michael Jones ◽

Tomaso Poggio

Keyword(s):

Neural Networks ◽

Tensor Product ◽

Radial Basis Functions ◽

Basis Functions ◽

Additive Models ◽

Approximation Schemes ◽

Tensor Product Splines ◽

Radial Basis ◽

Regularization Networks ◽

Hidden Layer

We had previously shown that regularization principles lead to approximation schemes that are equivalent to networks with one layer of hidden units, called regularization networks. In particular, standard smoothness functionals lead to a subclass of regularization networks, the well known radial basis functions approximation schemes. This paper shows that regularization networks encompass a much broader range of approximation schemes, including many of the popular general additive models and some of the neural networks. In particular, we introduce new classes of smoothness functionals that lead to different classes of basis functions. Additive splines as well as some tensor product splines can be obtained from appropriate classes of smoothness functionals. Furthermore, the same generalization that extends radial basis functions (RBF) to hyper basis functions (HBF) also leads from additive models to ridge approximation models, containing as special cases Breiman's hinge functions, some forms of projection pursuit regression, and several types of neural networks. We propose to use the term generalized regularization networks for this broad class of approximation schemes that follow from an extension of regularization. In the probabilistic interpretation of regularization, the different classes of basis functions correspond to different classes of prior probabilities on the approximating function spaces, and therefore to different types of smoothness assumptions. In summary, different multilayer networks with one hidden layer, which we collectively call generalized regularization networks, correspond to different classes of priors and associated smoothness functionals in a classical regularization principle. Three broad classes are (1) radial basis functions that can be generalized to hyper basis functions, (2) some tensor product splines, and (3) additive splines that can be generalized to schemes of the type of ridge approximation, hinge functions, and several perceptron-like neural networks with one hidden layer.

Download Full-text

NEW METHOD OF NEURON DESIGN BASED ON DISCRETE Z-FUNCTION TO ADAPT THE CHANGE OF INTEGRATED VEHICLE STABILITY CONTROL ORDER

International Journal of Computational Intelligence and Applications ◽

10.1142/s146902680900259x ◽

2009 ◽

Vol 08 (03) ◽

pp. 253-285 ◽

Cited By ~ 1

Author(s):

M. HARLY ◽

I. N. SUTANTRA ◽

H. P. MAURIDHI

Keyword(s):

Neural Network ◽

Neural Networks ◽

Back Propagation ◽

Activation Function ◽

Zero Order ◽

Vehicle Stability ◽

Adaptive Control System ◽

Control Performance ◽

Fixed Order ◽

Hidden Layer

Fixed order neural networks (FONN), such as high order neural network (HONN), in which its architecture is developed from zero order of activation function and joint weight, regulates only the number of weight and their value. As a result, this network only produces a fixed order model or control level. These obstacles, which affect preceeding architectures, have been performing finite ability to adapt uncertainty character of real world plant, such as driving dynamics and its desired control performance. This paper introduces a new concept of neural network neuron. In this matter, exploiting discrete z-function builds new neuron activation. Instead of zero order joint weight matrices, the discrete z-function weight matrix will be provided to realize uncertainty or undetermined real word plant and desired adaptive control system that their order has probably been changing. Instead of using bias, an initial condition value is developed. Neural networks using new neurons is called Varied Order Neural Network (VONN). For optimization process, updating order, coefficient and initial value of node activation function uses GA; while updating joint weight, it applies both back propagation (combined LSE-gauss Newton) and NPSO. To estimate the number of hidden layer, constructive back propagation (CBP) was also applied. Thorough simulation was conducted to compare the control performance between FONN and MONN. In order to control, vehicle stability was equipped by electronics stability program (ESP), electronics four wheel steering (4-EWS), and active suspension (AS). 2000, 4000, 6000, 8000 data that are from TODS, a hidden layer, 3 input nodes, 3 output nodes were provided to train and test the network of both the uncertainty model and its adaptive control system. The result of simulation, therefore, shows that stability parameter such as yaw rate error, vehicle side slip error, and rolling angle error produces better performance control in the form of smaller performance index using FDNN than those using MONN.

Download Full-text

CONSTRUCTIVE ESTIMATION OF APPROXIMATION FOR TRIGONOMETRIC NEURAL NETWORKS

International Journal of Wavelets Multiresolution and Information Processing ◽

10.1142/s021969131250021x ◽

2012 ◽

Vol 10 (03) ◽

pp. 1250021

Author(s):

JIANJUN WANG ◽

WEIHUA XU ◽

BIN ZOU

Keyword(s):

Neural Networks ◽

Artificial Neural Networks ◽

Lower Bound ◽

Upper Bound ◽

Intrinsic Property ◽

Order Of Approximation ◽

Integrable Functions ◽

Hidden Layer ◽

Hidden Neurons ◽

Bound Estimation

For the three-layer artificial neural networks with trigonometric weights coefficients, the upper bound and lower bound of approximating 2π-periodic pth-order Lebesgue integrable functions [Formula: see text] are obtained in this paper. Theorems we obtained provide explicit equational representations of these approximating networks, the specification for their numbers of hidden-layer units, the lower bound estimation of approximation, and the essential order of approximation. The obtained results not only characterize the intrinsic property of approximation of neural networks, but also uncover the implicit relationship between the precision (speed) and the number of hidden neurons of neural networks.

Download Full-text

Evaluation of Parameter Settings for Training Neural Networks Using Backpropagation Algorithms

International Journal of Operations Research and Information Systems ◽

10.4018/ijoris.2020100104 ◽

2020 ◽

Vol 11 (4) ◽

pp. 62-85

Author(s):

Leema N. ◽

Khanna H. Nehemiah ◽

Elgin Christo V. R. ◽

Kannan A.

Keyword(s):

Neural Network ◽

Neural Networks ◽

Activation Function ◽

Neural Network Training ◽

Network Parameter ◽

Network Parameters ◽

Network Training ◽

Hidden Layer ◽

Function Number ◽

The Impact

Artificial neural networks (ANN) are widely used for classification, and the training algorithm commonly used is the backpropagation (BP) algorithm. The major bottleneck faced in the backpropagation neural network training is in fixing the appropriate values for network parameters. The network parameters are initial weights, biases, activation function, number of hidden layers and the number of neurons per hidden layer, number of training epochs, learning rate, minimum error, and momentum term for the classification task. The objective of this work is to investigate the performance of 12 different BP algorithms with the impact of variations in network parameter values for the neural network training. The algorithms were evaluated with different training and testing samples taken from the three benchmark clinical datasets, namely, Pima Indian Diabetes (PID), Hepatitis, and Wisconsin Breast Cancer (WBC) dataset obtained from the University of California Irvine (UCI) machine learning repository.

Download Full-text

Application of Artificial Neural Networks for estimating index floods

Contributions to Geophysics and Geodesy ◽

10.2478/v10126-012-0014-7 ◽

2012 ◽

Vol 42 (4) ◽

pp. 295-311 ◽

Cited By ~ 2

Author(s):

Viliam Šimor ◽

Kamila Hlavčová ◽

Silvia Kohnová ◽

Ján Szolgay

Keyword(s):

Neural Networks ◽

Artificial Neural Networks ◽

Multiple Regression ◽

Regression Models ◽

Learning Algorithm ◽

Back Propagation ◽

Activation Function ◽

Multiple Regression Models ◽

Artificial Neural ◽

Hidden Layer

Abstract This article presents an application of Artificial Neural Networks (ANNs) and multiple regression models for estimating mean annual maximum discharge (index flood) at ungauged sites. Both approaches were tested for 145 small basins in Slovakia in areas ranging from 20 to 300 km2. Using the objective clustering method, the catchments were divided into ten homogeneous pooling groups; for each pooling group, mutually independent predictors (catchment characteristics) were selected for both models. The neural network was applied as a simple multilayer perceptron with one hidden layer and with a back propagation learning algorithm. Hyperbolic tangents were used as an activation function in the hidden layer. Estimating index floods by the multiple regression models were based on deriving relationships between the index floods and catchment predictors. The efficiencies of both approaches were tested by the Nash-Sutcliffe and a correlation coefficients. The results showed the comparative applicability of both models with slightly better results for the index floods achieved using the ANNs methodology.

Download Full-text

Functionally Equivalent Feedforward Neural Networks

Neural Computation ◽

10.1162/neco.1994.6.3.543 ◽

1994 ◽

Vol 6 (3) ◽

pp. 543-558 ◽

Cited By ~ 34

Author(s):

Věra Kůrková ◽

Paul C. Kainen

Keyword(s):

Neural Networks ◽

Feedforward Neural Networks ◽

Activation Function ◽

Output Function ◽

Input Output ◽

General Activation ◽

Hidden Layer

For a feedforward perceptron type architecture with a single hidden layer but with a quite general activation function, we characterize the relation between pairs of weight vectors determining networks with the same input-output function.

Download Full-text