rectified linear unit Latest Research Papers

AbstractArtificial neural networks (ANNs) providing sophisticated, power-efficient classification are finding their way into thin-film electronics. Thin-film technologies require robust, layout-efficient devices with facile manufacturability. Here, we show how the multimodal transistor’s (MMT’s) transfer characteristic, with linear dependence in saturation, replicates the rectified linear unit (ReLU) activation function of convolutional ANNs (CNNs). Using MATLAB, we evaluate CNN performance using systematically distorted ReLU functions, then substitute measured and simulated MMT transfer characteristics as proxies for ReLU. High classification accuracy is maintained, despite large variations in geometrical and electrical parameters, as CNNs use the same activation functions for training and classification.

Download Full-text

Self-gated rectified linear unit for performance improvement of deep neural networks

ICT Express ◽

10.1016/j.icte.2021.12.012 ◽

2022 ◽

Author(s):

Israt Jahan ◽

Md. Faisal Ahmed ◽

Md. Osman Ali ◽

Yeong Min Jang

Keyword(s):

Neural Networks ◽

Performance Improvement ◽

Deep Neural Networks ◽

Rectified Linear Unit

Download Full-text

A note on the applications of one primary function in deep neural networks

International Journal of Wavelets Multiresolution and Information Processing ◽

10.1142/s0219691321500582 ◽

2021 ◽

Author(s):

Hengjie Chen ◽

Zhong Li

Keyword(s):

Neural Networks ◽

Mathematical Knowledge ◽

Deep Neural Networks ◽

Activation Function ◽

Continuous Functions ◽

Middle Point ◽

Deep Network ◽

Rectified Linear Unit ◽

The Mean ◽

The Difference

By applying fundamental mathematical knowledge, this paper proves that the function [Formula: see text] is an integer no less than [Formula: see text] has the property that the difference between the function value of middle point of arbitrarily two adjacent equidistant distribution nodes on [Formula: see text] and the mean of function values of these two nodes is a constant depending only on the number of nodes if and only if [Formula: see text] By them, we establish an important result about deep neural networks that the function [Formula: see text] can be interpolated by a deep Rectified Linear Unit (ReLU) network with depth [Formula: see text] on the equidistant distribution nodes in interval [Formula: see text] and the error of approximation is [Formula: see text] Then based on the main result that has just been proven and the Chebyshev orthogonal polynomials, we construct a deep network and give the error estimate of approximation to polynomials and continuous functions, respectively. In addition, this paper constructs one deep network with local sparse connections, shared weights and activation function [Formula: see text] and discusses its density and complexity.

Download Full-text

Early Prediction of DNN Activation Using Hierarchical Computations

Mathematics ◽

10.3390/math9233130 ◽

2021 ◽

Vol 9 (23) ◽

pp. 3130

Author(s):

Bharathwaj Suresh ◽

Kamlesh Pillai ◽

Gurpreet Singh Kalsi ◽

Avishaii Abuhatzera ◽

Sreenivas Subramoney

Keyword(s):

Deep Neural Networks ◽

State Of The Art ◽

Voice Recognition ◽

Activation Function ◽

Approximate Computing ◽

Inference Process ◽

Computation Cost ◽

Rectified Linear Unit ◽

Computational Resources ◽

Hierarchical Computation

Deep Neural Networks (DNNs) have set state-of-the-art performance numbers in diverse fields of electronics (computer vision, voice recognition), biology, bioinformatics, etc. However, the process of learning (training) from the data and application of the learnt information (inference) process requires huge computational resources. Approximate computing is a common method to reduce computation cost, but it introduces loss in task accuracy, which limits their application. Using an inherent property of Rectified Linear Unit (ReLU), a popular activation function, we propose a mathematical model to perform MAC operation using reduced precision for predicting negative values early. We also propose a method to perform hierarchical computation to achieve the same results as IEEE754 full precision compute. Applying this method on ResNet50 and VGG16 shows that up to 80% of ReLU zeros (which is 50% of all ReLU outputs) can be predicted and detected early by using just 3 out of 23 mantissa bits. This method is equally applicable to other floating-point representations.

Download Full-text

Role of Layers and Neurons in Deep Learning With the Rectified Linear Unit

Cureus ◽

10.7759/cureus.18866 ◽

2021 ◽

Author(s):

Akira Takekawa ◽

Masayuki Kajiura ◽

Hiroya Fukuda

Keyword(s):

Deep Learning ◽

Rectified Linear Unit

Download Full-text

Adaptive Levenberg–Marquardt Algorithm: A New Optimization Strategy for Levenberg–Marquardt Neural Networks

Mathematics ◽

10.3390/math9172176 ◽

2021 ◽

Vol 9 (17) ◽

pp. 2176

Author(s):

Zhiqi Yan ◽

Shisheng Zhong ◽

Lin Lin ◽

Zhiquan Cui

Keyword(s):

Neural Network ◽

Neural Networks ◽

Frequency Noise ◽

Optimization Strategy ◽

Engineering Data ◽

Marquardt Algorithm ◽

Highly Nonlinear ◽

Rectified Linear Unit ◽

Levenberg Marquardt ◽

Lm Algorithm

Engineering data are often highly nonlinear and contain high-frequency noise, so the Levenberg–Marquardt (LM) algorithm may not converge when a neural network optimized by the algorithm is trained with engineering data. In this work, we analyzed the reasons for the LM neural network’s poor convergence commonly associated with the LM algorithm. Specifically, the effects of different activation functions such as Sigmoid, Tanh, Rectified Linear Unit (RELU) and Parametric Rectified Linear Unit (PRLU) were evaluated on the general performance of LM neural networks, and special values of LM neural network parameters were found that could make the LM algorithm converge poorly. We proposed an adaptive LM (AdaLM) algorithm to solve the problem of the LM algorithm. The algorithm coordinates the descent direction and the descent step by the iteration number, which can prevent falling into the local minimum value and avoid the influence of the parameter state of LM neural networks. We compared the AdaLM algorithm with the traditional LM algorithm and its variants in terms of accuracy and speed in the context of testing common datasets and aero-engine data, and the results verified the effectiveness of the AdaLM algorithm.

Download Full-text

Using rectified linear unit and swish based artificial neural networks to describe noise transfer in a full vehicle context

The Journal of the Acoustical Society of America ◽

10.1121/10.0005535 ◽

2021 ◽

Vol 150 (3) ◽

pp. 2088-2105

Author(s):

Dimitrios Ernst Tsokaktsidis ◽

Clemens Nau ◽

Marcus Maeder ◽

Steffen Marburg

Keyword(s):

Neural Networks ◽

Artificial Neural Networks ◽

Full Vehicle ◽

Rectified Linear Unit ◽

Artificial Neural

Download Full-text

Deep Learning Based on Fourier Convolutional Neural Network Incorporating Random Kernels

Electronics ◽

10.3390/electronics10162004 ◽

2021 ◽

Vol 10 (16) ◽

pp. 2004

Author(s):

Yuna Han ◽

Byung-Woo Hong

Keyword(s):

Neural Network ◽

Neural Networks ◽

Convolutional Neural Network ◽

Convolutional Neural Networks ◽

Activation Function ◽

Spatial Domain ◽

Transformation Process ◽

Real Component ◽

Rectified Linear Unit ◽

Classification Tasks

In recent years, convolutional neural networks have been studied in the Fourier domain for a limited environment, where competitive results can be expected for conventional image classification tasks in the spatial domain. We present a novel efficient Fourier convolutional neural network, where a new activation function is used, the additional shift Fourier transformation process is eliminated, and the number of learnable parameters is reduced. First, the Phase Rectified Linear Unit (PhaseReLU) is proposed, which is equivalent to the Rectified Linear Unit (ReLU) in the spatial domain. Second, in the proposed Fourier network, the shift Fourier transform is removed since the process is inessential for training. Lastly, we introduce two ways of reducing the number of weight parameters in the Fourier network. The basic method is to use a three-by-three sized kernel instead of five-by-five in our proposed Fourier convolutional neural network. We use the random kernel in our efficient Fourier convolutional neural network, whose standard deviation of the Gaussian distribution is used as a weight parameter. In other words, since only two scalars for each imaginary and real component per channel are required, a very small number of parameters is applied compressively. Therefore, as a result of experimenting in shallow networks, such as LeNet-3 and LeNet-5, our method achieves competitive accuracy with conventional convolutional neural networks while dramatically reducing the number of parameters. Furthermore, our proposed Fourier network, using a basic three-by-three kernel, mostly performs with higher accuracy than traditional convolutional neural networks in shallow and deep neural networks. Our experiments represent that presented kernel methods have the potential to be applied in all architecture based on convolutional neural networks.

Download Full-text

Reconfiguration layers of convolutional neural network for fundus patches classification

Bulletin of Electrical Engineering and Informatics ◽

10.11591/eei.v10i1.1974 ◽

2021 ◽

Vol 10 (1) ◽

pp. 383-389

Author(s):

Wahyudi Setiawan ◽

Moh. Imam Utoyo ◽

Riries Rulaningtyas

Keyword(s):

Neural Network ◽

Deep Learning ◽

Convolutional Neural Network ◽

Original Data ◽

Max Pooling ◽

Output Layer ◽

Batch Normalization ◽

Rectified Linear Unit ◽

Input Layer ◽

Fully Connected

Convolutional neural network (CNN) is a method of supervised deep learning. The architectures including AlexNet, VGG16, VGG19, ResNet 50, ResNet101, GoogleNet, Inception-V3, Inception ResNet-V2, and Squeezenet that have 25 to 825 layers. This study aims to simplify layers of CNN architectures and increased accuracy for fundus patches classification. Fundus patches classify two categories: normal and neovascularization. Data used for classification is MESSIDOR and Retina Image Bank that have 2,080 patches. Results show the best accuracy of 93.17% for original data and 99,33% for augmentation data using CNN 31 layers. It consists input layer, 7 convolutional layers, 7 batch normalization, 7 rectified linear unit, 6 max-pooling, fully connected layer, softmax, and output layer.

Download Full-text

Weight initialization based‐rectified linear unit activation function to improve the performance of a convolutional neural network model

Concurrency and Computation Practice and Experience ◽

10.1002/cpe.6143 ◽

2020 ◽

Author(s):

Bekhzod Olimov ◽

Sanjar Karshiev ◽

Eungyeong Jang ◽

Sadia Din ◽

Anand Paul ◽

...

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Network Model ◽

Neural Network Model ◽

Activation Function ◽

Rectified Linear Unit ◽

Weight Initialization

Download Full-text

rectified linear unit
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Multimodal transistors as ReLU activation functions in physical neural network classifiers

Self-gated rectified linear unit for performance improvement of deep neural networks

A note on the applications of one primary function in deep neural networks

Early Prediction of DNN Activation Using Hierarchical Computations

Role of Layers and Neurons in Deep Learning With the Rectified Linear Unit

Adaptive Levenberg–Marquardt Algorithm: A New Optimization Strategy for Levenberg–Marquardt Neural Networks

Using rectified linear unit and swish based artificial neural networks to describe noise transfer in a full vehicle context

Deep Learning Based on Fourier Convolutional Neural Network Incorporating Random Kernels

Reconfiguration layers of convolutional neural network for fundus patches classification

Weight initialization based‐rectified linear unit activation function to improve the performance of a convolutional neural network model

Export Citation Format

rectified linear unitRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Multimodal transistors as ReLU activation functions in physical neural network classifiers

Self-gated rectified linear unit for performance improvement of deep neural networks

A note on the applications of one primary function in deep neural networks

Early Prediction of DNN Activation Using Hierarchical Computations

Role of Layers and Neurons in Deep Learning With the Rectified Linear Unit

Adaptive Levenberg–Marquardt Algorithm: A New Optimization Strategy for Levenberg–Marquardt Neural Networks

Using rectified linear unit and swish based artificial neural networks to describe noise transfer in a full vehicle context

Deep Learning Based on Fourier Convolutional Neural Network Incorporating Random Kernels

Reconfiguration layers of convolutional neural network for fundus patches classification

Weight initialization based‐rectified linear unit activation function to improve the performance of a convolutional neural network model

rectified linear unit
Recently Published Documents