Classification and Prediction with Neural Networks

Data Mining and Medical Knowledge Management ◽

10.4018/978-1-60566-218-3.ch004 ◽

2011 ◽

pp. 76-107

Author(s):

Arnošt Veselý

Keyword(s):

Neural Networks ◽

Error Function ◽

Descent Method ◽

Cross Entropy ◽

Gradient Descent Method ◽

Classification Problems ◽

Network Training ◽

The Neural Network ◽

Gradient Calculation ◽

Entropy Error

This chapter deals with applications of artificial neural networks in classification and regression problems. Based on theoretical analysis it demonstrates that in classification problems one should use cross-entropy error function rather than the usual sum-of-square error function. Using gradient descent method for finding the minimum of the cross entropy error function, leads to the well-known backpropagation of error scheme of gradient calculation if at the output layer of the neural network the neurons with logistic or softmax output functions are used. The author believes that understanding the underlying theory presented in this chapter will help researchers in medical informatics to choose more suitable network architectures for medical applications and that it helps them to carry out the network training more effectively.

Download Full-text

Infrared Face Recognition System Using Cross Entropy Error Function Based Ensemble Backpropagation Neural Networks

International Journal of Computer Theory and Engineering ◽

10.7763/ijcte.2016.v8.1037 ◽

2016 ◽

Vol 8 (2) ◽

pp. 161-166 ◽

Cited By ~ 3

Author(s):

Benyamin Kusumoputro ◽

◽

Lina Lina

Keyword(s):

Neural Networks ◽

Face Recognition ◽

Error Function ◽

Recognition System ◽

Cross Entropy ◽

Backpropagation Neural Networks ◽

Face Recognition System ◽

Entropy Error

Download Full-text

Convergence Analysis of Inverse Iterative Neural Networks with L2 Penalty

Journal of Applied Computer Science Methods ◽

10.1515/jacsm-2016-0006 ◽

2016 ◽

Vol 8 (2) ◽

pp. 85-98

Author(s):

Yanqing Wen ◽

Jian Wang ◽

Bingjia Huang ◽

Jacek M. Zurada

Keyword(s):

Neural Networks ◽

Strong Convergence ◽

Error Function ◽

Descent Method ◽

Iterative Sequence ◽

Gradient Descent Method ◽

Optimal Point ◽

Penalty Term ◽

Iterative Inversion ◽

Feasible Solutions

Abstract The iterative inversion of neural networks has been used in solving problems of adaptive control due to its good performance of information processing. In this paper an iterative inversion neural network with L2 penalty term has been presented trained by using the classical gradient descent method. We mainly focus on the theoretical analysis of this proposed algorithm such as monotonicity of error function, boundedness of input sequences and weak (strong) convergence behavior. For bounded property of inputs, we rigorously proved that the feasible solutions of input are restricted in a measurable field. The weak convergence means that the gradient of error function with respect to input tends to zero as the iterations go to infinity while the strong convergence stands for the iterative sequence of input vectors convergence to a fixed optimal point.

Download Full-text

Estimation of Navigation Mark Floating Based on Fractional-Order Gradient Descent with Momentum for RBF Neural Network

Mathematical Problems in Engineering ◽

10.1155/2021/6681651 ◽

2021 ◽

Vol 2021 ◽

pp. 1-10

Author(s):

Qionglin Fang

Keyword(s):

Neural Network ◽

Fractional Order ◽

Gradient Descent ◽

Rbf Neural Network ◽

Error Function ◽

Initial Position ◽

Descent Method ◽

Local Optimum ◽

Gradient Descent Method ◽

The Neural Network

To address the difficulty of estimating the drift of the navigation marks, a fractional-order gradient with the momentum RBF neural network (FOGDM-RBF) is designed. The convergence is proved, and it is used to estimate the drifting trajectory of the navigation marks with different geographical locations. First, the weight of the neural network is set. The navigation mark’s meteorological, hydrological, and initial position data are taken as the input of the neural network. The neural network is trained and used to estimate the mark’s position. The navigation mark’s position is taken at a later time as the output of the neural network. The difference between the later position and the estimated position obtained from the neural network is the error function of the neural network. The influence of sea conditions and months are analyzed. The experimental results and error analysis show that FOGDM-RBF is better than other algorithms at trajectory estimation and interpolation, has better accuracy and generalization, and does not easily fall into the local optimum. It is effective at accelerating convergence speed and improving the performance of a gradient descent method.

Download Full-text

Why Does Large Batch Training Result in Poor Generalization? A Comprehensive Explanation and a Better Strategy from the Viewpoint of Stochastic Optimization

Neural Computation ◽

10.1162/neco_a_01089 ◽

2018 ◽

Vol 30 (7) ◽

pp. 2005-2023 ◽

Cited By ~ 3

Author(s):

Tomoumi Takase ◽

Satoshi Oyama ◽

Masahito Kurihara

Keyword(s):

Gradient Descent ◽

Optimization Problems ◽

Descent Method ◽

Batch Size ◽

Gradient Descent Method ◽

Neural Network Training ◽

Nonconvex Optimization Problems ◽

Large Batch ◽

Network Training ◽

Comprehensive Framework

We present a comprehensive framework of search methods, such as simulated annealing and batch training, for solving nonconvex optimization problems. These methods search a wider range by gradually decreasing the randomness added to the standard gradient descent method. The formulation that we define on the basis of this framework can be directly applied to neural network training. This produces an effective approach that gradually increases batch size during training. We also explain why large batch training degrades generalization performance, which previous studies have not clarified.

Download Full-text

SOME METHODS OF ADAPTIVE MULTILAYER NEURAL NETWORKS TRAINING

International Journal of Computing ◽

10.47839/ijc.3.1.259 ◽

2014 ◽

pp. 99-106

Author(s):

Leonid Makhnist ◽

Nikolaj Maniakov ◽

Nikolaj Maniakov

Keyword(s):

Neural Networks ◽

Basic Concept ◽

Gradient Descent ◽

Descent Method ◽

Gradient Descent Method ◽

New Techniques ◽

Adaptive Training ◽

Multilayer Neural Networks

Is proposed two new techniques for multilayer neural networks training. Its basic concept is based on the gradient descent method. For every methodic are showed formulas for calculation of the adaptive training steps. Presented matrix algorithmizations for all of these techniques are very helpful in its program realization.

Download Full-text

Meteorological Data Forecast using RNN

Deep Learning and Neural Networks ◽

10.4018/978-1-7998-0414-7.ch050 ◽

2020 ◽

pp. 905-920

Author(s):

Stefan Balluff ◽

Jörg Bendfeld ◽

Stefan Krauter

Keyword(s):

Neural Networks ◽

Wind Speed ◽

Linear Prediction ◽

Learning Algorithm ◽

Meteorological Data ◽

System Modeling ◽

Descent Method ◽

Gradient Descent Method ◽

Earth System Modeling ◽

Set Up

Gathering knowledge not only of the current but also the upcoming wind speed is getting more and more important as the experience of operating and maintaining wind turbines is increasing. Not only with regards to operation and maintenance tasks such as gearbox and generator checks but moreover due to the fact that energy providers have to sell the right amount of their converted energy at the European energy markets, the knowledge of the wind and hence electrical power of the next day is of key importance. Selling more energy as has been offered is penalized as well as offering less energy as contractually promised. In addition to that the price per offered kWh decreases in case of a surplus of energy. Achieving a forecast there are various methods in computer science: fuzzy logic, linear prediction or neural networks. This paper presents current results of wind speed forecasts using recurrent neural networks (RNN) and the gradient descent method plus a backpropagation learning algorithm. Data used has been extracted from NASA's Modern Era-Retrospective analysis for Research and Applications (MERRA) which is calculated by a GEOS-5 Earth System Modeling and Data Assimilation system. The presented results show that wind speed data can be forecasted using historical data for training the RNN. Nevertheless, the current set up system lacks robustness and can be improved further with regards to accuracy.

Download Full-text

Modeling a Thermochemical Reactor of a Solar Refrigerator by BaCl2-NH3 Sorption Using Artificial Neural Networks and Mathematical Symmetry Groups

Mathematical Problems in Engineering ◽

10.1155/2020/9098709 ◽

2020 ◽

Vol 2020 ◽

pp. 1-11

Author(s):

Onesimo Meza-Cruz ◽

Isaac Pilatowsky ◽

Agustín Pérez-Ramírez ◽

Carlos Rivera-Blanco ◽

Youness El Hamzaoui ◽

...

Keyword(s):

Neural Network ◽

Experimental Data ◽

Neural Networks ◽

Heating Temperature ◽

Barium Chloride ◽

Symmetry Groups ◽

Solar Cooling ◽

Network Training ◽

The Neural Network ◽

Marquardt Algorithm

The aim of this work is to present a model for heat transfer, desorbed refrigerant, and pressure of an intermittent solar cooling system’s thermochemical reactor based on backpropagation neural networks and mathematical symmetry groups. In order to achieve this, a reactor was designed and built based on the reaction of BaCl2-NH3. Experimental data from this reactor were collected, where barium chloride was used as a solid absorbent and ammonia as a refrigerant. The neural network was trained using the Levenberg–Marquardt algorithm. The correlation coefficient between experimental data and data simulated by the neural network was r = 0.9957. In the neural network’s sensitivity analysis, it was found that the inputs, reactor’s heating temperature and sorption time, influence neural network’s learning by 35% and 20%, respectively. It was also found that, by applying permutations to experimental data and using multibase mathematical symmetry groups, the neural network training algorithm converges faster.

Download Full-text

Convergence of Batch Gradient Method Based on the Entropy Error Function for Feedforward Neural Networks

Neural Processing Letters ◽

10.1007/s11063-020-10374-w ◽

2020 ◽

Vol 52 (3) ◽

pp. 2687-2695

Author(s):

Yan Xiong ◽

Xin Tong

Keyword(s):

Neural Networks ◽

Gradient Method ◽

Error Function ◽

Feedforward Neural Networks ◽

Entropy Error

Download Full-text

FEATURE EXTRACTION BASED ON DIRECT CALCULATION OF MUTUAL INFORMATION

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s0218001407005892 ◽

2007 ◽

Vol 21 (07) ◽

pp. 1213-1231 ◽

Cited By ~ 9

Author(s):

NOJUN KWAK

Keyword(s):

Feature Extraction ◽

Mutual Information ◽

Direct Calculation ◽

Extraction Methods ◽

Descent Method ◽

Gradient Descent Method ◽

Probability Density Estimation ◽

Classification Problems ◽

Feature Extraction Method ◽

Window Method

In many pattern recognition problems, it is desirable to reduce the number of input features by extracting important features related to the problems. By focusing on only the problem-relevant features, the dimension of features can be greatly reduced and thereby can result in a better generalization performance with less computational complexity. In this paper, we propose a feature extraction method for handling classification problems. The proposed algorithm is used to search for a set of linear combinations of the original features, whose mutual information with the output class can be maximized. The mutual information between the extracted features and the output class is calculated by using the probability density estimation based on the Parzen window method. A greedy algorithm using the gradient descent method is used to determine the new features. The computational load is proportional to the square of the number of samples. The proposed method was applied to several classification problems, which showed better or comparable performances than the conventional feature extraction methods.

Download Full-text

THE APPLICATION OF FEEDFORWARD NEURAL NETWORKS IN VLSI FABRICATION PROCESS OPTIMIZATION

International Journal of Computational Intelligence and Applications ◽

10.1142/s1469026801000032 ◽

2001 ◽

Vol 01 (01) ◽

pp. 83-90 ◽

Cited By ~ 3

Author(s):

WANG XIANGDONG ◽

WANG SHOUJUE

Keyword(s):

Neural Networks ◽

Manufacturing Process ◽

Large Scale ◽

Maximum Yield ◽

Feedforward Neural Networks ◽

Descent Method ◽

Gradient Descent Method ◽

Manufacturing Process Control ◽

Wafer Probing ◽

Process Learning

In this paper, we present a neural-based manufacturing process control system for semiconductor factories to improve the die yield. A model based on neural networks is proposed to simulate Very Large-Scale Integrated (VLSI) manufacturing process. Learning from the historical processing lists with Radial Basis Function (RBF), we simulate the functional relationship between the wafer probing parameters and the die yield. Then we use a gradient-descent method to search a set of 'optimal' parameters that lead to the maximum yield of the model. At last, we adjust the specification in the practical semiconductor manufacturing process. The average die yield increased from 51.7% to 57.5% after the system had been applied in Huajing Corporation.

Download Full-text