Activation function design for deep networks: linearity and effective initialisation

Pedestrian detection is the core of the driver assistance system, which collects the road conditions through the radars or cameras on the vehicle, judges whether there is a pedestrian in front of the vehicle, supports decisions such as raising the alarm, automatically slowing down, or emergency stopping to keep pedestrians safe, and improves the security when the vehicle is moving. Suffering from weather, lighting, clothing, large pose variations, and occlusion, the current pedestrian detection still has a certain distance from the practical applications. In recent years, deep networks have shown excellent performance for image detection, recognition, and classification. Some researchers employed deep network for pedestrian detection and achieve great progress, but deep networks need huge computational resources, which make it difficult to put into practical applications. In real scenarios of autonomous vehicles, the computation ability is limited. Thus, the shallow networks such as UDN (Unified Deep Networks) is a better choice, since it performs well while consuming less computation resources. Based on UDN, this paper proposes a new deep network model named two-stream UDN, which augments another branch for solving traditional UDN’s indistinction of the difference between trees/telegraph poles and pedestrians. The new branch accepts the upper third part of the pedestrian image as input, and the partial image has less deformation, stable features, and more distinguished characters from other objects. For the proposed two-stream UDN, multi-input features including the HOG (Histogram of Oriented Gradients) feature, Sobel feature, color feature, and foreground regions extracted by GrabCut segmentation algorithms are fed. Compared with the original input of UDN, the multi-input features are more conducive for pedestrian detection, since the fused HOG features and significant objects are more significant for pedestrian detection. Two-stream UDN is trained through two steps. First, the two sub-networks are trained until converge; then, we fuse results of the two subnets as the final result and feed it back to the two subnets to fine tune network parameters synchronously. To improve the performance, Swish is adopted as the activation function to obtain a faster training speed, and positive samples are mirrored and rotated with small angles to make the positive and negative samples more balanced.

Download Full-text

A comparison of deep networks with ReLU activation function and linear spline-type methods

Neural Networks ◽

10.1016/j.neunet.2018.11.005 ◽

2019 ◽

Vol 110 ◽

pp. 232-242 ◽

Cited By ~ 35

Author(s):

Konstantin Eckle ◽

Johannes Schmidt-Hieber

Keyword(s):

Activation Function ◽

Linear Spline ◽

Deep Networks

Download Full-text

Improving Convolutional Neural Networks with Competitive Activation Function

Security and Communication Networks ◽

10.1155/2021/1933490 ◽

2021 ◽

Vol 2021 ◽

pp. 1-9

Author(s):

Yao Ying ◽

Nengbo Zhang ◽

Ping He ◽

Silong Peng

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Activation Function ◽

Linear Mapping ◽

Nonlinear Transformation ◽

Nonlinear Mapping ◽

Activation Functions ◽

Competition Mechanism ◽

Function Design ◽

Competitive Activation

The activation function is the basic component of the convolutional neural network (CNN), which provides the nonlinear transformation capability required by the network. Many activation functions make the original input compete with different linear or nonlinear mapping terms to obtain different nonlinear transformation capabilities. Until recently, the original input of funnel activation (FReLU) competed with the spatial conditions, so FReLU not only has the ability of nonlinear transformation but also has the ability of pixelwise modeling. We summarize the competition mechanism in the activation function and then propose a novel activation function design template: competitive activation function (CAF), which promotes competition among different elements. CAF generalizes all activation functions that use competition mechanisms. According to CAF, we propose a parametric funnel rectified exponential unit (PFREU). PFREU promotes competition among linear mapping, nonlinear mapping, and spatial conditions. We conduct experiments on four datasets of different sizes, and the experimental results of three classical convolutional neural networks proved the superiority of our method.

Download Full-text

Pedestrian Detection Based on Two-Stream UDN

10.20944/preprints202001.0029.v1 ◽

2020 ◽

Author(s):

Wentong Wang ◽

Lichun Wang ◽

Xufei Ge ◽

Jinghua Li ◽

Baocai Yin

Keyword(s):

Autonomous Vehicle ◽

Pedestrian Detection ◽

Activation Function ◽

Practical Applications ◽

Deep Network ◽

The Road ◽

Slowing Down ◽

Deep Networks ◽

The Difference ◽

Computational Resources

Pedestrian detection is the core of driver assistance system, which collects the road conditions through the radars or cameras on the vehicle, judges whether there is a pedestrian in front of the vehicle, supports decisions such as raising the alarm, automatically slowing down or emergency stopping to keep pedestrians safe, and improves the security when the vehicle is moving. Suffered from weather, lighting, clothing, large pose variations and occlusion, the current pedestrian detection still has a certain distance from the practical applications. In recent years, deep networks have shown excellent performance for image detection, recognition and classification. Some researchers employed deep network for pedestrian detection and achieve great progress, but deep networks need huge computational resources which make it difficult to put into practical applications. In real scenarios of autonomous vehicle, the computation ability is limited. Thus, the shallow networks such as UDN (Unified Deep Networks) is a better choice since it performs well on consuming less computation resources. Base on UDN, this paper proposes a new deep network model named as two-stream UDN, which augments another branch for solving traditional UDN’s indistinction of the difference between trees / telegraph poles and pedestrians. The new branch accepts the upper third part of the pedestrian image as input, and the partial image has less deformation, stable features and more distinguished characters from other objects. For the proposed two-stream UDN, multi-input features including HOG feature, Sobel feature, color feature and foreground regions extracted by GrabCut segmentation algorithms are fed. Compared with the original input of UDN, the multi-input features are more conducive for pedestrian detection since the fused HOG features and significant objects are more significant for pedestrian detection. Two-stream UDN is trained through two steps: First, the two sub-networks are trained until converge; then we fuse results of the two subnets as the final result and feed it back to the two subnets to fine tune network parameters synchronously. To improve the performance, Softplus is adopted as activation function to obtain faster training speed, and positive samples are mirrored and rotated with small angle to make positive and negative samples more balanced.

Download Full-text

Notice of Retraction: Perceptron Linear Activation Function Design with CMOS-Memristive Circuits

2018 International Conference on Computing and Network Communications (CoCoNet) ◽

10.1109/coconet.2018.8476812 ◽

2018 ◽

Author(s):

Bexultan Nursultan ◽

Olga Krestinskaya

Keyword(s):

Activation Function ◽

Function Design ◽

Memristive Circuits

Download Full-text

Leveraging Product as an Activation Function in Deep Networks

2018 IEEE International Conference on Systems, Man, and Cybernetics (SMC) ◽

10.1109/smc.2018.00280 ◽

2018 ◽

Author(s):

Luke B. Godfrey ◽

Michael S. Gashler

Keyword(s):

Activation Function ◽

Deep Networks

Download Full-text

Extranuclear effects of estrogen on cortical bone in males is dependent on estrogen receptor A activation function-1

Bone Abstracts ◽

10.1530/boneabs.5.oc6.4 ◽

2016 ◽

Author(s):

Helen Farman ◽

Jianyao Wu ◽

Karin Gustafsson ◽

Sara Windahl ◽

Sung Kim ◽

...

Keyword(s):

Estrogen Receptor ◽

Cortical Bone ◽

Activation Function

Download Full-text

SCORING MODELING BASED ON NEURAL NETWORKS FOR DETERMINING A BANK BORROWER'S RATING

Economy of Ukraine ◽

10.15407/economyukr.2020.10.054 ◽

2020 ◽

Vol 2020 (10) ◽

pp. 54-62

Author(s):

Oleksii VASYLIEV ◽

Keyword(s):

Neural Network ◽

Neural Networks ◽

Network Architecture ◽

Statistical Data ◽

Activation Function ◽

Decision Making Process ◽

Neural Network Architecture ◽

Acceptable Accuracy ◽

The Neural Network ◽

Sigmoid Activation Function

The problem of applying neural networks to calculate ratings used in banking in the decision-making process on granting or not granting loans to borrowers is considered. The task is to determine the rating function of the borrower based on a set of statistical data on the effectiveness of loans provided by the bank. When constructing a regression model to calculate the rating function, it is necessary to know its general form. If so, the task is to calculate the parameters that are included in the expression for the rating function. In contrast to this approach, in the case of using neural networks, there is no need to specify the general form for the rating function. Instead, certain neural network architecture is chosen and parameters are calculated for it on the basis of statistical data. Importantly, the same neural network architecture can be used to process different sets of statistical data. The disadvantages of using neural networks include the need to calculate a large number of parameters. There is also no universal algorithm that would determine the optimal neural network architecture. As an example of the use of neural networks to determine the borrower's rating, a model system is considered, in which the borrower's rating is determined by a known non-analytical rating function. A neural network with two inner layers, which contain, respectively, three and two neurons and have a sigmoid activation function, is used for modeling. It is shown that the use of the neural network allows restoring the borrower's rating function with quite acceptable accuracy.

Download Full-text

The New Activation Function for Complex Valued Neural Networks: Complex Swish Function

4th International Symposium on Innovative Approaches in Engineering and Natural Sciences Proceedings ◽

10.36287/setsci.4.6.050 ◽

2019 ◽

Author(s):

Mehmet Çelebi ◽

Murat Ceylan

Keyword(s):

Neural Networks ◽

Activation Function ◽

Complex Valued

Download Full-text