Evolution of Activation Functions: An Empirical Investigation

The hyper-parameters of a neural network are traditionally designed through a time-consuming process of trial and error that requires substantial expert knowledge. Neural Architecture Search algorithms aim to take the human out of the loop by automatically finding a good set of hyper-parameters for the problem at hand. These algorithms have mostly focused on hyper-parameters such as the architectural configurations of the hidden layers and the connectivity of the hidden neurons, but there has been relatively little work on automating the search for completely new activation functions, which are one of the most crucial hyperparameters to choose. There are some widely used activation functions nowadays that are simple and work well, but nonetheless, there has been some interest in finding better activation functions. The work in the literature has mostly focused on designing new activation functions by hand or choosing from a set of predefined functions while this work presents an evolutionary algorithm to automate the search for completely new activation functions. We compare these new evolved activation functions to other existing and commonly used activation functions. The results are favorable and are obtained from averaging the performance of the activation functions found over 30 runs, with experiments being conducted on 10 different datasets and architectures to ensure the statistical robustness of the study.

Download Full-text

Advanced Stochastic Optimization Algorithm for Deep Learning Artificial Neural Networks in Banking and Finance Industries

Risk and Financial Management ◽

10.30560/rfm.v1n1p8 ◽

2019 ◽

Vol 1 (1) ◽

pp. p8

Author(s):

Jamilu Auwalu Adamu

Keyword(s):

Neural Network ◽

Neural Networks ◽

Artificial Neural Network ◽

Artificial Neural Networks ◽

Deep Learning ◽

Probability Distribution ◽

Trial And Error ◽

Activation Functions ◽

Data Set ◽

Artificial Neural

One of the objectives of this paper is to incorporate fat-tail effects into, for instance, Sigmoid in order to introduce Transparency and Stability into the existing stochastic Activation Functions. Secondly, according to the available literature reviewed, the existing set of Activation Functions were introduced into the Deep learning Artificial Neural Network through the “Window” not properly through the “Legitimate Door” since they are “Trial and Error “and “Arbitrary Assumptions”, thus, the Author proposed a “Scientific Facts”, “Definite Rules: Jameel’s Stochastic ANNAF Criterion”, and a “Lemma” to substitute not necessarily replace the existing set of stochastic Activation Functions, for instance, the Sigmoid among others. This research is expected to open the “Black-Box” of Deep Learning Artificial Neural networks. The author proposed a new set of advanced optimized fat-tailed Stochastic Activation Functions EMANATED from the AI-ML-Purified Stocks Data namely; the Log – Logistic (3P) Probability Distribution (1st), Cauchy Probability Distribution (2nd), Pearson 5 (3P) Probability Distribution (3rd), Burr (4P) Probability Distribution (4th), Fatigue Life (3P) Probability Distribution (5th), Inv. Gaussian (3P) Probability Distribution (6th), Dagum (4P) Probability Distribution (7th), and Lognormal (3P) Probability Distribution (8th) for the successful conduct of both Forward and Backward Propagations of Deep Learning Artificial Neural Network. However, this paper did not check the Monotone Differentiability of the proposed distributions. Appendix A, B, and C presented and tested the performances of the stressed Sigmoid and the Optimized Activation Functions using Stocks Data (2014-1991) of Microsoft Corporation (MSFT), Exxon Mobil (XOM), Chevron Corporation (CVX), Honda Motor Corporation (HMC), General Electric (GE), and U.S. Fundamental Macroeconomic Parameters, the results were found fascinating. Thus, guarantee, the first three distributions are excellent Activation Functions to successfully conduct any Stock Deep Learning Artificial Neural Network. Distributions Number 4 to 8 are also good Advanced Optimized Activation Functions. Generally, this research revealed that the Advanced Optimized Activation Functions satisfied Jameel’s ANNAF Stochastic Criterion depends on the Referenced Purified AI Data Set, Time Change and Area of Application which is against the existing “Trial and Error “and “Arbitrary Assumptions” of Sigmoid, Tanh, Softmax, ReLu, and Leaky ReLu.

Download Full-text

Sigma-Pi Cascade Extended Hybrid Neural Network

Journal of Advanced Computational Intelligence and Intelligent Informatics ◽

10.20965/jaciii.2002.p0126 ◽

2002 ◽

Vol 6 (3) ◽

pp. 126-134 ◽

Cited By ~ 2

Author(s):

Eduardo Masato Iyoda ◽

◽

Kaoru Hirota ◽

Fernando J. Von Zuben ◽

Keyword(s):

Neural Network ◽

Learning Algorithm ◽

Approximation Error ◽

Projection Pursuit ◽

Gradient Algorithm ◽

Approximation Accuracy ◽

Hybrid Neural Network ◽

Neural Architecture ◽

Regression Problems ◽

Hidden Neurons

A nonparametric neural architecture called the Sigma-Pi Cascade extended Hybrid Neural Network σπ-(CHNN) is proposed to extend approximation capabilities in neural architectures such as Projection Pursuit Learning (PPL) and Hybrid Neural Networks (HNN). Like PPL and HNN, σπ-CHNN also uses distinct activation functions in its neurons but, unlike these previous neural architectures, it may consider multiplicative operators in its hidden neurons, enabling it to extract higher-order information from given data. σπ-CHNN uses arbitrary connectivity patterns among neurons. An evolutionary learning algorithm combined with a conjugate gradient algorithm is proposed to automatically design the topology and weights of σπ-CHNN. σπ-CHNN performance is evaluated in five benchmark regression problems. Results show that σπ-CHNN provides competitive performance compared to PPL and HNN in most problems, either in computational requirements to implement the proposed neural architecture or in approximation accuracy. In some problems, σπ-CHNN reduces the approximation error on the order of 10-1 compared to PPL and HNN, whereas in other cases it achieves the same approximation error as these neural architectures but uses a smaller number of hidden neurons (usually 1 hidden neuron less than PPL and HNN).

Download Full-text

Location- and Person-Independent Activity Recognition with WiFi, Deep Neural Networks, and Reinforcement Learning

ACM Transactions on Internet of Things ◽

10.1145/3424739 ◽

2021 ◽

Vol 2 (1) ◽

pp. 1-25

Author(s):

Yongsen Ma ◽

Sheheryar Arshad ◽

Swetha Muniraju ◽

Eric Torkildson ◽

Enrico Rantala ◽

...

Keyword(s):

Neural Network ◽

Neural Networks ◽

Reinforcement Learning ◽

Activity Recognition ◽

Deep Neural Networks ◽

State Machine ◽

Recognition Algorithm ◽

The State ◽

Neural Architecture ◽

Learning Agent

In recent years, Channel State Information (CSI) measured by WiFi is widely used for human activity recognition. In this article, we propose a deep learning design for location- and person-independent activity recognition with WiFi. The proposed design consists of three Deep Neural Networks (DNNs): a 2D Convolutional Neural Network (CNN) as the recognition algorithm, a 1D CNN as the state machine, and a reinforcement learning agent for neural architecture search. The recognition algorithm learns location- and person-independent features from different perspectives of CSI data. The state machine learns temporal dependency information from history classification results. The reinforcement learning agent optimizes the neural architecture of the recognition algorithm using a Recurrent Neural Network (RNN) with Long Short-Term Memory (LSTM). The proposed design is evaluated in a lab environment with different WiFi device locations, antenna orientations, sitting/standing/walking locations/orientations, and multiple persons. The proposed design has 97% average accuracy when testing devices and persons are not seen during training. The proposed design is also evaluated by two public datasets with accuracy of 80% and 83%. The proposed design needs very little human efforts for ground truth labeling, feature engineering, signal processing, and tuning of learning parameters and hyperparameters.

Download Full-text

Enabling Resistive-RAM-based Activation Functions for Deep Neural Network Acceleration

Proceedings of the 2020 on Great Lakes Symposium on VLSI ◽

10.1145/3386263.3406915 ◽

2020 ◽

Author(s):

Zihan Zhang ◽

Taozhong Li ◽

Ning Guan ◽

Qin Wang ◽

Guanghui He ◽

...

Keyword(s):

Neural Network ◽

Deep Neural Network ◽

Activation Functions ◽

Resistive Ram

Download Full-text

Recurrent neural network synthesis using interaction activation functions

Proceedings of IEEE International Conference on Robotics and Automation ◽

10.1109/robot.1996.506942 ◽

2002 ◽

Author(s):

B.M. Novakovic

Keyword(s):

Neural Network ◽

Recurrent Neural Network ◽

Network Synthesis ◽

Activation Functions

Download Full-text

An efficient stochastic computing based deep neural network accelerator with optimized activation functions

International Journal of Information Technology ◽

10.1007/s41870-021-00682-2 ◽

2021 ◽

Author(s):

Sunny Bodiwala ◽

Nirali Nanavati

Keyword(s):

Neural Network ◽

Deep Neural Network ◽

Activation Functions ◽

Stochastic Computing

Download Full-text

A Comparative Study of Activation Functions and Training Algorithm of NAR Neural Network for Crop Prediction

2020 4th International Conference on Electronics, Communication and Aerospace Technology (ICECA) ◽

10.1109/iceca49313.2020.9297469 ◽

2020 ◽

Author(s):

V. Kaleeswaran ◽

S. Dhamodharavadhani ◽

R. Rathipriya

Keyword(s):

Neural Network ◽

Comparative Study ◽

Training Algorithm ◽

Activation Functions ◽

And Training

Download Full-text

LC–MS/MS Software for Screening Unknown Erectile Dysfunction Drugs and Analogues: Artificial Neural Network Classification, Peak-Count Scoring, Simple Similarity Search, and Hybrid Similarity Search Algorithms

Analytical Chemistry ◽

10.1021/acs.analchem.9b01643 ◽

2019 ◽

Vol 91 (14) ◽

pp. 9119-9128 ◽

Cited By ~ 5

Author(s):

Inae Jang ◽

Jae-ung Lee ◽

Jung-min Lee ◽

Beom Hee Kim ◽

Bongjin Moon ◽

...

Keyword(s):

Neural Network ◽

Artificial Neural Network ◽

Erectile Dysfunction ◽

Similarity Search ◽

Search Algorithms ◽

Neural Network Classification ◽

Artificial Neural ◽

Peak Count

Download Full-text

Neural Network Activation Functions with Electro-Optic Absorption Modulators

2018 IEEE International Conference on Rebooting Computing (ICRC) ◽

10.1109/icrc.2018.8638590 ◽

2018 ◽

Cited By ~ 1

Author(s):

Jonathan George ◽

Armin Mehrabian ◽

Rubab Amin ◽

Paul R. Prucnal ◽

Tarek El-Ghazawi ◽

...

Keyword(s):

Neural Network ◽

Activation Functions ◽

Network Activation ◽

Electro Optic

Download Full-text

Research on Performance Evaluation of Finance Transportation Projects Based on Fuzzy Neural Network

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.667.60 ◽

2014 ◽

Vol 667 ◽

pp. 60-63

Author(s):

Wei Guo ◽

Zhen Ji Zhang

Keyword(s):

Neural Network ◽

Performance Evaluation ◽

Set Theory ◽

Evaluation System ◽

Fuzzy Neural Network ◽

Expert Knowledge ◽

Highway Projects ◽

Performance Evaluation System ◽

Transportation Projects ◽

Fuzzy Neural

A performance evaluation system of finance transportation projects is mainly researched, in which the sub-module of the highway projects evaluation, waterway projects evaluation, Passenger stations projects evaluation, Energy saving projects evaluation are incorporated. In addition, the expert knowledge are inserted in the system, the multi-layer neural network and fuzzy-set theory are used to implement Performance Evaluation system of Finance invest Transportation Projects, and the feasibility and effectiveness of the evaluation system are finally verified by practice.

Download Full-text