Learning in Feedforward Neural Networks Accelerated by Transfer Entropy

Current neural networks architectures are many times harder to train because of the increasing size and complexity of the used datasets. Our objective is to design more efficient training algorithms utilizing causal relationships inferred from neural networks. The transfer entropy (TE) was initially introduced as an information transfer measure used to quantify the statistical coherence between events (time series). Later, it was related to causality, even if they are not the same. There are only few papers reporting applications of causality or TE in neural networks. Our contribution is an information-theoretical method for analyzing information transfer between the nodes of feedforward neural networks. The information transfer is measured by the TE of feedback neural connections. Intuitively, TE measures the relevance of a connection in the network and the feedback amplifies this connection. We introduce a backpropagation type training algorithm that uses TE feedback connections to improve its performance.

Download Full-text

A Novel Fast Feedforward Neural Networks Training Algorithm

Journal of Artificial Intelligence and Soft Computing Research ◽

10.2478/jaiscr-2021-0017 ◽

2021 ◽

Vol 11 (4) ◽

pp. 287-306

Author(s):

Jarosław Bilski ◽

Bartosz Kowalczyk ◽

Andrzej Marjański ◽

Michał Gandor ◽

Jacek Zurada

Keyword(s):

Neural Networks ◽

Feedforward Neural Networks ◽

Recursive Least Squares ◽

Convergence Time ◽

Normal Equation ◽

Training Algorithms ◽

Training Algorithm ◽

Training Time ◽

Satisfactory Outcome ◽

Accelerated Gradient

Abstract In this paper1 a new neural networks training algorithm is presented. The algorithm originates from the Recursive Least Squares (RLS) method commonly used in adaptive filtering. It uses the QR decomposition in conjunction with the Givens rotations for solving a normal equation - resulting from minimization of the loss function. An important parameter in neural networks is training time. Many commonly used algorithms require a big number of iterations in order to achieve a satisfactory outcome while other algorithms are effective only for small neural networks. The proposed solution is characterized by a very short convergence time compared to the well-known backpropagation method and its variants. The paper contains a complete mathematical derivation of the proposed algorithm. There are presented extensive simulation results using various benchmarks including function approximation, classification, encoder, and parity problems. Obtained results show the advantages of the featured algorithm which outperforms commonly used recent state-of-the-art neural networks training algorithms, including the Adam optimizer and the Nesterov’s accelerated gradient.

Download Full-text

Optimizing the Learning Process of Feedforward Neural Networks Using Lightning Search Algorithm

International Journal of Artificial Intelligence Tools ◽

10.1142/s0218213016500330 ◽

2016 ◽

Vol 25 (06) ◽

pp. 1650033 ◽

Cited By ~ 26

Author(s):

Hossam Faris ◽

Ibrahim Aljarah ◽

Nailah Al-Madi ◽

Seyedali Mirjalili

Keyword(s):

Neural Network ◽

Neural Networks ◽

Optimization Problems ◽

Search Algorithm ◽

Optimization Technique ◽

Back Propagation ◽

Feedforward Neural Networks ◽

Training Algorithms ◽

Local Optima ◽

Local Solutions

Evolutionary Neural Networks are proven to be beneficial in solving challenging datasets mainly due to the high local optima avoidance. Stochastic operators in such techniques reduce the probability of stagnation in local solutions and assist them to supersede conventional training algorithms such as Back Propagation (BP) and Levenberg-Marquardt (LM). According to the No-Free-Lunch (NFL), however, there is no optimization technique for solving all optimization problems. This means that a Neural Network trained by a new algorithm has the potential to solve a new set of problems or outperform the current techniques in solving existing problems. This motivates our attempts to investigate the efficiency of the recently proposed Evolutionary Algorithm called Lightning Search Algorithm (LSA) in training Neural Network for the first time in the literature. The LSA-based trainer is benchmarked on 16 popular medical diagnosis problems and compared to BP, LM, and 6 other evolutionary trainers. The quantitative and qualitative results show that the LSA algorithm is able to show not only better local solutions avoidance but also faster convergence speed compared to the other algorithms employed. In addition, the statistical test conducted proves that the LSA-based trainer is significantly superior in comparison with the current algorithms on the majority of datasets.

Download Full-text

Extended and Unscented Kalman filtering based feedforward neural networks for time series prediction

Applied Mathematical Modelling ◽

10.1016/j.apm.2011.07.052 ◽

2012 ◽

Vol 36 (3) ◽

pp. 1123-1131 ◽

Cited By ~ 37

Author(s):

Xuedong Wu ◽

Yaonan Wang

Keyword(s):

Neural Networks ◽

Time Series ◽

Kalman Filtering ◽

Time Series Prediction ◽

Feedforward Neural Networks

Download Full-text

Step acceleration based training algorithm for feedforward neural networks

Object recognition supported by user interaction for service robots ◽

10.1109/icpr.2002.1048243 ◽

2003 ◽

Author(s):

Yanlai Li ◽

Kuanquan Wang ◽

D. Zhang

Keyword(s):

Neural Networks ◽

Feedforward Neural Networks ◽

Training Algorithm

Download Full-text

Optical Recognition of Handwritten Logic Formulas Using Neural Networks

Electronics ◽

10.3390/electronics10222761 ◽

2021 ◽

Vol 10 (22) ◽

pp. 2761

Author(s):

Vaios Ampelakiotis ◽

Isidoros Perikos ◽

Ioannis Hatzilygeroudis ◽

George Tsihrintzis

Keyword(s):

Neural Networks ◽

Character Recognition ◽

Gradient Descent ◽

Feedforward Neural Networks ◽

Stochastic Gradient ◽

Stochastic Gradient Descent ◽

Training Algorithms ◽

Gradient Descent Algorithm ◽

Two Stages ◽

And Training

In this paper, we present a handwritten character recognition (HCR) system that aims to recognize first-order logic handwritten formulas and create editable text files of the recognized formulas. Dense feedforward neural networks (NNs) are utilized, and their performance is examined under various training conditions and methods. More specifically, after three training algorithms (backpropagation, resilient propagation and stochastic gradient descent) had been tested, we created and trained an NN with the stochastic gradient descent algorithm, optimized by the Adam update rule, which was proved to be the best, using a trainset of 16,750 handwritten image samples of 28 × 28 each and a testset of 7947 samples. The final accuracy achieved is 90.13%. The general methodology followed consists of two stages: the image processing and the NN design and training. Finally, an application has been created that implements the methodology and automatically recognizes handwritten logic formulas. An interesting feature of the application is that it allows for creating new, user-oriented training sets and parameter settings, and thus new NN models.

Download Full-text

ODE-LM: A Hybrid Training Algorithm for Feedforward Neural Networks

Advances in Intelligent Systems and Computing - Foundations and Practical Applications of Cognitive Systems and Information Processing ◽

10.1007/978-3-642-37835-5_17 ◽

2013 ◽

pp. 187-198 ◽

Cited By ~ 1

Author(s):

Li Zhang ◽

Hong Li ◽

Dazheng Feng

Keyword(s):

Neural Networks ◽

Feedforward Neural Networks ◽

Training Algorithm

Download Full-text

Time series forecasting with feedforward neural networks trained using particle swarm optimizers for dynamic environments

Neural Computing and Applications ◽

10.1007/s00521-020-05163-4 ◽

2020 ◽

Author(s):

Salihu A. Abdulkarim ◽

Andries P. Engelbrecht

Keyword(s):

Neural Networks ◽

Time Series ◽

Particle Swarm ◽

Feedforward Neural Networks ◽

Dynamic Environments ◽

Time Series Forecasting

Download Full-text

A New Layer by Layer training algorithm for multilayer feedforward neural networks

2011 3rd International Conference on Advanced Computer Control ◽

10.1109/icacc.2011.6016485 ◽

2011 ◽

Author(s):

Yanlai Li ◽

Tao Li ◽

Kuanquan Wang

Keyword(s):

Neural Networks ◽

Feedforward Neural Networks ◽

Layer By Layer ◽

Training Algorithm

Download Full-text

The HJPS Training Algorithm for Multilayer Feedforward Neural Networks

Journal of Computer Research and Development ◽

10.1360/crad20051023 ◽

2005 ◽

Vol 42 (10) ◽

pp. 1790

Author(s):

Yanlai Li

Keyword(s):

Neural Networks ◽

Feedforward Neural Networks ◽

Training Algorithm

Download Full-text

Learning in Convolutional Neural Networks Accelerated by Transfer Entropy

Entropy ◽

10.3390/e23091218 ◽

2021 ◽

Vol 23 (9) ◽

pp. 1218

Author(s):

Adrian Moldovan ◽

Angel Caţaron ◽

Răzvan Andonie

Keyword(s):

Information Transfer ◽

Effective Connectivity ◽

Transfer Entropy ◽

Learning Mechanisms ◽

Feedback Parameter ◽

Smoothing Factor ◽

Computational Overhead ◽

Neural Information ◽

Input Sample ◽

Feedback Connections

Recently, there is a growing interest in applying Transfer Entropy (TE) in quantifying the effective connectivity between artificial neurons. In a feedforward network, the TE can be used to quantify the relationships between neuron output pairs located in different layers. Our focus is on how to include the TE in the learning mechanisms of a Convolutional Neural Network (CNN) architecture. We introduce a novel training mechanism for CNN architectures which integrates the TE feedback connections. Adding the TE feedback parameter accelerates the training process, as fewer epochs are needed. On the flip side, it adds computational overhead to each epoch. According to our experiments on CNN classifiers, to achieve a reasonable computational overhead–accuracy trade-off, it is efficient to consider only the inter-neural information transfer of the neuron pairs between the last two fully connected layers. The TE acts as a smoothing factor, generating stability and becoming active only periodically, not after processing each input sample. Therefore, we can consider the TE is in our model a slowly changing meta-parameter.

Download Full-text