An Analysis of State-of-the-art Activation Functions For Supervised Deep Neural Network

Mapping Intimacies ◽

10.31219/osf.io/2zk6a ◽

2021 ◽

Author(s):

Anh Nguyen ◽

Khoa Pham ◽

Dat Ngo ◽

Thanh Ngo ◽

Lam Pham

Keyword(s):

Neural Network ◽

Supervised Classification ◽

Deep Neural Network ◽

State Of The Art ◽

Network Architectures ◽

Activation Functions ◽

Scene Classification ◽

Learning Network ◽

Deep Learning Network

This paper provides an analysis of state-of-the-art activation functions with respect to supervised classification of deep neural network. These activation functions comprise of Rectified Linear Units (ReLU), Exponential Linear Unit (ELU), Scaled Exponential Linear Unit (SELU), Gaussian Error Linear Unit (GELU), and the Inverse Square Root Linear Unit (ISRLU). To evaluate, experiments over two deep learning network architectures integrating these activation functions are conducted. The first model, basing on Multilayer Perceptron (MLP), is evaluated with MNIST dataset to perform these activation functions.Meanwhile, the second model, likely VGGish-based architecture, is applied for Acoustic Scene Classification (ASC) Task 1A in DCASE 2018 challenge, thus evaluate whether these activation functions work well in different datasets as well as different network architectures.

Download Full-text

CTL-DNNet: Effective Circular Traffic Light Recognition with a Deep Neural Network

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s0218001417500379 ◽

2017 ◽

Vol 31 (11) ◽

pp. 1750037 ◽

Cited By ~ 3

Author(s):

Di Wang ◽

Hong Bao ◽

Feifei Zhang

Keyword(s):

Neural Network ◽

Autonomous Vehicles ◽

Deep Neural Network ◽

Road Traffic ◽

High Accuracy ◽

Activation Functions ◽

Traffic Light ◽

Traffic Lights ◽

Learning Network ◽

Deep Learning Network

This paper proposed an algorithm for a deep learning network for identifying circular traffic lights (CTL-DNNet). The sample labeling process uses translation to increase the number of positive samples, and the similarity is calculated to reduce the number of negative samples, thereby reducing overfitting. We use a dataset of approximately 370[Formula: see text]000 samples, with approximately 20[Formula: see text]000 positive samples and approximately 350[Formula: see text]000 negative samples. The datasets are generated from images taken at the Beijing Garden Expo. To obtain a very robust method for the detection of traffic lights, we use different layers, different cost functions and different activation functions of the depth neural network for training and comparison. Our algorithm has evaluated autonomous vehicles in varying illumination and gets the result with high accuracy and robustness. The experimental results show that CTL-DNNet is effective at recognizing road traffic lights in the Beijing Garden Expo area.

Download Full-text

A Deep-Neural-Network-Based Hybrid Method for Semi-Supervised Classification of Polarimetric SAR Data

2019 6th Asia-Pacific Conference on Synthetic Aperture Radar (APSAR) ◽

10.1109/apsar46974.2019.9048529 ◽

2019 ◽

Author(s):

Chi Liu ◽

Wenzhi Liao ◽

Heng-Chao Li ◽

Shaoguang Huang ◽

Wilfried Philips

Keyword(s):

Neural Network ◽

Hybrid Method ◽

Supervised Classification ◽

Deep Neural Network ◽

Polarimetric Sar ◽

Sar Data

Download Full-text

FLEX: Faithful Linguistic Explanations for Neural Net Based Model Decisions

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33012539 ◽

2019 ◽

Vol 33 ◽

pp. 2539-2546

Author(s):

Sandareka Wickramanayake ◽

Wynne Hsu ◽

Mong Li Lee

Keyword(s):

Neural Network ◽

Deep Learning ◽

State Of The Art ◽

Neural Net ◽

End User ◽

Learning Network ◽

Benchmark Datasets ◽

Deep Learning Network ◽

Post Hoc ◽

User Trust

Explaining the decisions of a Deep Learning Network is imperative to safeguard end-user trust. Such explanations must be intuitive, descriptive, and faithfully explain why a model makes its decisions. In this work, we propose a framework called FLEX (Faithful Linguistic EXplanations) that generates post-hoc linguistic justifications to rationalize the decision of a Convolutional Neural Network. FLEX explains a model’s decision in terms of features that are responsible for the decision. We derive a novel way to associate such features to words, and introduce a new decision-relevance metric that measures the faithfulness of an explanation to a model’s reasoning. Experiment results on two benchmark datasets demonstrate that the proposed framework can generate discriminative and faithful explanations compared to state-of-the-art explanation generators. We also show how FLEX can generate explanations for images of unseen classes as well as automatically annotate objects in images.

Download Full-text

A Novel Architecture to Classify Histopathology Images Using Convolutional Neural Networks

Applied Sciences ◽

10.3390/app10082929 ◽

2020 ◽

Vol 10 (8) ◽

pp. 2929 ◽

Cited By ~ 2

Author(s):

Ibrahem Kandel ◽

Mauro Castelli

Keyword(s):

Neural Network ◽

Neural Networks ◽

State Of The Art ◽

Treatment Plan ◽

Tissue Structure ◽

Activation Functions ◽

Proposed Model ◽

Histopathology Images ◽

Fully Connected

Histopathology is the study of tissue structure under the microscope to determine if the cells are normal or abnormal. Histopathology is a very important exam that is used to determine the patients’ treatment plan. The classification of histopathology images is very difficult to even an experienced pathologist, and a second opinion is often needed. Convolutional neural network (CNN), a particular type of deep learning architecture, obtained outstanding results in computer vision tasks like image classification. In this paper, we propose a novel CNN architecture to classify histopathology images. The proposed model consists of 15 convolution layers and two fully connected layers. A comparison between different activation functions was performed to detect the most efficient one, taking into account two different optimizers. To train and evaluate the proposed model, the publicly available PatchCamelyon dataset was used. The dataset consists of 220,000 annotated images for training and 57,000 unannotated images for testing. The proposed model achieved higher performance compared to the state-of-the-art architectures with an AUC of 95.46%.

Download Full-text

Two-stream feature aggregation deep neural network for scene classification of remote sensing images

Information Sciences ◽

10.1016/j.ins.2020.06.011 ◽

2020 ◽

Vol 539 ◽

pp. 250-268

Author(s):

Kejie Xu ◽

Hong Huang ◽

Peifang Deng ◽

Guangyao Shi

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Deep Neural Network ◽

Scene Classification ◽

Remote Sensing Images ◽

Feature Aggregation

Download Full-text

A Convolutional Neural Networks Based Method for Anthracnose Infected Walnut Tree Leaves Identification

Applied Sciences ◽

10.3390/app10020469 ◽

2020 ◽

Vol 10 (2) ◽

pp. 469 ◽

Cited By ~ 5

Author(s):

Athanasios Anagnostis ◽

Gavriela Asiminari ◽

Elpiniki Papageorgiou ◽

Dionysis Bochtis

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

State Of The Art ◽

Initial Step ◽

Machine Learning Algorithms ◽

Site Classification ◽

Network Architectures ◽

Tree Leaves ◽

Mode A

Anthracnose is a fungal disease that infects a large number of trees worldwide, damages intensively the canopy, and spreads with ease to neighboring trees, resulting in the potential destruction of whole crops. Even though it can be treated relatively easily with good sanitation, proper pruning and copper spraying, the main issue is the early detection for the prevention of spreading. Machine learning algorithms can offer the tools for the on-site classification of healthy and affected leaves, as an initial step towards managing such diseases. The purpose of this study was to build a robust convolutional neural network (CNN) model that is able to classify images of leaves, depending on whether or not these are infected by anthracnose, and therefore determine whether a tree is infected. A set of images were used both in grayscale and RGB mode, a fast Fourier transform was implemented for feature extraction, and a CNN architecture was selected based on its performance. Finally, the best performing method was compared with state-of-the-art convolutional neural network architectures.

Download Full-text

Classification of Fermi-LAT sources with deep learning using energy and time spectra

Monthly Notices of the Royal Astronomical Society ◽

10.1093/mnras/stab2389 ◽

2021 ◽

Vol 507 (3) ◽

pp. 4061-4073

Author(s):

Thorben Finke ◽

Michael Krämer ◽

Silvia Manconi

Keyword(s):

Neural Network ◽

Neural Networks ◽

Active Galactic Nuclei ◽

Photon Energy ◽

Deep Neural Network ◽

Gamma Ray ◽

Galactic Nuclei ◽

Network Architectures ◽

Neural Network Architectures

ABSTRACT Despite the growing number of gamma-ray sources detected by the Fermi-Large Area Telescope (LAT), about one-third of the sources in each survey remains of uncertain type. We present a new deep neural network approach for the classification of unidentified or unassociated gamma-ray sources in the last release of the Fermi-LAT catalogue (4FGL-DR2) obtained with 10 yr of data. In contrast to previous work, our method directly uses the measurements of the photon energy spectrum and time series as input for the classification, instead of specific, human-crafted features. Dense neural networks, and for the first time in the context of gamma-ray source classification recurrent neural networks, are studied in depth. We focus on the separation between extragalactic sources, i.e. active galactic nuclei, and Galactic pulsars, and on the further classification of pulsars into young and millisecond pulsars. Our neural network architectures provide powerful classifiers, with a performance that is comparable to previous analyses based on human-crafted features. Our benchmark neural network predicts that of the sources of uncertain type in the 4FGL-DR2 catalogue, 1050 are active galactic nuclei and 78 are Galactic pulsars, with both classes following the expected sky distribution and the clustering in the variability–curvature plane. We investigate the problem of sample selection bias by testing our architectures against a cross-match test data set using an older catalogue, and propose a feature selection algorithm using autoencoders. Our list of high-confidence candidate sources labelled by the neural networks provides a set of targets for further multiwavelength observations addressed to identify their nature. The deep neural network architectures we develop can be easily extended to include specific features, as well as multiwavelength data on the source photon energy and time spectra coming from different instruments.

Download Full-text

Deep Learning and Handcrafted Features for Virus Image Classification

Journal of Imaging ◽

10.3390/jimaging6120143 ◽

2020 ◽

Vol 6 (12) ◽

pp. 143

Author(s):

Loris Nanni ◽

Eugenio De Luca ◽

Marco Ludovico Facin ◽

Gianluca Maguolo

Keyword(s):

Neural Network ◽

Electron Microscopy ◽

Deep Neural Network ◽

State Of The Art ◽

Support Vector ◽

Transmission Electron ◽

Vector Machines ◽

Multiple Support ◽

Multiple Support Vector Machines

In this work, we present an ensemble of descriptors for the classification of virus images acquired using transmission electron microscopy. We trained multiple support vector machines on different sets of features extracted from the data. We used both handcrafted algorithms and a pretrained deep neural network as feature extractors. The proposed fusion strongly boosts the performance obtained by each stand-alone approach, obtaining state of the art performance.

Download Full-text