Optimizing nonlinear activation function for convolutional neural networks

A relationship between the learning rate η in the learning algorithm, and the slope β in the nonlinear activation function, for a class of recurrent neural networks (RNNs) trained by the real-time recurrent learning algorithm is provided. It is shown that an arbitrary RNN can be obtained via the referent RNN, with some deterministic rules imposed on its weights and the learning rate. Such relationships reduce the number of degrees of freedom when solving the nonlinear optimization task of finding the optimal RNN parameters.

Download Full-text

Nonlinear Activation Function Generation Based on Silicon Microring Resonators for Integrated Photonic Neural Networks

2019 Conference on Lasers and Electro-Optics Europe & European Quantum Electronics Conference (CLEO/Europe-EQEC) ◽

10.1109/cleoe-eqec.2019.8872372 ◽

2019 ◽

Author(s):

Mircea Catuneanu ◽

Ryan Hamerly ◽

Nirav Annavarapu ◽

Shahryar Sabouri ◽

Kambiz Jamshidi

Keyword(s):

Neural Networks ◽

Activation Function ◽

Microring Resonators ◽

Function Generation ◽

Nonlinear Activation Function

Download Full-text

SinP[N]: A Fast Convergence Activation Function for Convolutional Neural Networks

2018 IEEE/ACM International Conference on Utility and Cloud Computing Companion (UCC Companion) ◽

10.1109/ucc-companion.2018.00082 ◽

2018 ◽

Cited By ~ 2

Author(s):

Ka-Hou Chan ◽

Sio-Kei Im ◽

Wei Ke ◽

Ngan-Lin Lei

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Activation Function ◽

Fast Convergence

Download Full-text

RSigELU: A nonlinear activation function for deep neural networks

Expert Systems with Applications ◽

10.1016/j.eswa.2021.114805 ◽

2021 ◽

pp. 114805

Author(s):

Serhat Kiliçarslan ◽

Mete Celik

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Activation Function ◽

Nonlinear Activation Function

Download Full-text

Differential Equation Units: Learning Functional Forms of Activation Functions from Data

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i04.6065 ◽

2020 ◽

Vol 34 (04) ◽

pp. 6030-6037

Author(s):

MohamadAli Torkamani ◽

Shiv Shankar ◽

Amirmohammad Rooshenas ◽

Phillip Wallis

Keyword(s):

Differential Equation ◽

Neural Networks ◽

Deep Neural Networks ◽

Functional Form ◽

Activation Function ◽

The Other ◽

Superior Performance ◽

Activation Functions ◽

Functional Forms ◽

Nonlinear Activation Function

Most deep neural networks use simple, fixed activation functions, such as sigmoids or rectified linear units, regardless of domain or network structure. We introduce differential equation units (DEUs), an improvement to modern neural networks, which enables each neuron to learn a particular nonlinear activation function from a family of solutions to an ordinary differential equation. Specifically, each neuron may change its functional form during training based on the behavior of the other parts of the network. We show that using neurons with DEU activation functions results in a more compact network capable of achieving comparable, if not superior, performance when compared to much larger networks.

Download Full-text

All-optical nonlinear activation function for photonic neural networks [Invited]

Optical Materials Express ◽

10.1364/ome.8.003851 ◽

2018 ◽

Vol 8 (12) ◽

pp. 3851 ◽

Cited By ~ 53

Author(s):

Mario Miscuglio ◽

Armin Mehrabian ◽

Zibo Hu ◽

Shaimaa I. Azzam ◽

Jonathan George ◽

...

Keyword(s):

Neural Networks ◽

Activation Function ◽

All Optical ◽

Nonlinear Activation Function

Download Full-text

Deep Learning Based on Fourier Convolutional Neural Network Incorporating Random Kernels

Electronics ◽

10.3390/electronics10162004 ◽

2021 ◽

Vol 10 (16) ◽

pp. 2004

Author(s):

Yuna Han ◽

Byung-Woo Hong

Keyword(s):

Neural Network ◽

Neural Networks ◽

Convolutional Neural Network ◽

Convolutional Neural Networks ◽

Activation Function ◽

Spatial Domain ◽

Transformation Process ◽

Real Component ◽

Rectified Linear Unit ◽

Classification Tasks

In recent years, convolutional neural networks have been studied in the Fourier domain for a limited environment, where competitive results can be expected for conventional image classification tasks in the spatial domain. We present a novel efficient Fourier convolutional neural network, where a new activation function is used, the additional shift Fourier transformation process is eliminated, and the number of learnable parameters is reduced. First, the Phase Rectified Linear Unit (PhaseReLU) is proposed, which is equivalent to the Rectified Linear Unit (ReLU) in the spatial domain. Second, in the proposed Fourier network, the shift Fourier transform is removed since the process is inessential for training. Lastly, we introduce two ways of reducing the number of weight parameters in the Fourier network. The basic method is to use a three-by-three sized kernel instead of five-by-five in our proposed Fourier convolutional neural network. We use the random kernel in our efficient Fourier convolutional neural network, whose standard deviation of the Gaussian distribution is used as a weight parameter. In other words, since only two scalars for each imaginary and real component per channel are required, a very small number of parameters is applied compressively. Therefore, as a result of experimenting in shallow networks, such as LeNet-3 and LeNet-5, our method achieves competitive accuracy with conventional convolutional neural networks while dramatically reducing the number of parameters. Furthermore, our proposed Fourier network, using a basic three-by-three kernel, mostly performs with higher accuracy than traditional convolutional neural networks in shallow and deep neural networks. Our experiments represent that presented kernel methods have the potential to be applied in all architecture based on convolutional neural networks.

Download Full-text

Natural-Logarithm-Rectified Activation Function in Convolutional Neural Networks

2019 IEEE 5th International Conference on Computer and Communications (ICCC) ◽

10.1109/iccc47050.2019.9064398 ◽

2019 ◽

Author(s):

Yang Liu ◽

Jianpeng Zhang ◽

Chao Gao ◽

Jinghua Qu ◽

Lixin Ji

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Activation Function ◽

Natural Logarithm

Download Full-text

Implications of Pooling Strategies in Convolutional Neural Networks: A Deep Insight

Foundations of Computing and Decision Sciences ◽

10.2478/fcds-2019-0016 ◽

2019 ◽

Vol 44 (3) ◽

pp. 303-330 ◽

Cited By ~ 3

Author(s):

Shallu Sharma ◽

Rajesh Mehra

Keyword(s):

Neural Networks ◽

Computer Vision ◽

Convolutional Neural Networks ◽

Network Architecture ◽

Computational Cost ◽

Activation Function ◽

Training Time ◽

Pooling Strategies ◽

Deep Cnn ◽

And Training

Abstract Convolutional neural networks (CNN) is a contemporary technique for computer vision applications, where pooling implies as an integral part of the deep CNN. Besides, pooling provides the ability to learn invariant features and also acts as a regularizer to further reduce the problem of overfitting. Additionally, the pooling techniques significantly reduce the computational cost and training time of networks which are equally important to consider. Here, the performances of pooling strategies on different datasets are analyzed and discussed qualitatively. This study presents a detailed review of the conventional and the latest strategies which would help in appraising the readers with the upsides and downsides of each strategy. Also, we have identified four fundamental factors namely network architecture, activation function, overlapping and regularization approaches which immensely affect the performance of pooling operations. It is believed that this work would help in extending the scope of understanding the significance of CNN along with pooling regimes for solving computer vision problems.

Download Full-text

Optimizing nonlinear activation function for convolutional neural networks

Algorithm Research on Improving Activation Function of Convolutional Neural Networks

Relating the Slope of the Activation Function and the Learning Rate Within a Recurrent Neural Network

Nonlinear Activation Function Generation Based on Silicon Microring Resonators for Integrated Photonic Neural Networks

SinP[N]: A Fast Convergence Activation Function for Convolutional Neural Networks

RSigELU: A nonlinear activation function for deep neural networks

Differential Equation Units: Learning Functional Forms of Activation Functions from Data

All-optical nonlinear activation function for photonic neural networks [Invited]

Deep Learning Based on Fourier Convolutional Neural Network Incorporating Random Kernels

Natural-Logarithm-Rectified Activation Function in Convolutional Neural Networks

Implications of Pooling Strategies in Convolutional Neural Networks: A Deep Insight

Export Citation Format