Accelerating Convolutional Neural Network Using Discrete Orthogonal Transforms

10.36227/techrxiv.14593686 ◽

2021 ◽

Author(s):

Eduardo Reis ◽

Rachid Benlamri

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Activation Function ◽

Learning Rate ◽

Step Size ◽

Linear Operation ◽

Input Size ◽

Non Linear ◽

Discrete Orthogonal ◽

Complex Dataset

<div> <div> <div> <div> <p>All experiments are implemented in Python, using the PyTorch and the Torch-DCT libraries under the Google Colab environment. The Intel(R) Xeon(R) CPU @ 2.00GHz and a Tesla V100-SXM2-16GB GPU were assignment to the Google Colab runtime when profiling the DOT models. It should be noted that the current stable version of the PyTorch library, version 1.8.1, offers only the implementation of the FFT algorithm. Therefore, the implementations of the Hartley and Cosine transforms, listed in Table 1, are not implemented using the same optimizations (algorithm and code wise) adopted in the FFT. We benchmark the DOT methods using the LENET-5 network shown in Figure 10. The ReLU activation function is adopted a non-linear operation across the entire architecture. In this network, the convolutional operations have a kernel of size K = 5. The convolution is of type “valid”, i.e., padding is not applied to the input. Hence the output size M of each layer is smaller than its input size N, that is M=N−K+1. The optimizers used in our experiments are Adam, SGD, SGD with Momentum of 0.9, and RMSProp with α = 0.99. The StepLR scheduler is used with a step size of 20 epochs and a γ = 0.5. We train our model for 40 epochs using a mini-batch of size 128 and a learning rate of 0.001. Five datasets are used in order to benchmark the proposed DOT methods. Among them, we have the MNIST dataset and some variants of the MNIST dataset such as EMNIST, KMNIST and Fashion-MNIST. Additionally, a more complex dataset, CIFAR-10 is also used in our benchmark.</p> </div> </div> </div> </div>

Download Full-text

Use of Convolutional Neural Network for Fish Species Classification

Journal of Maritime & Transportation Science ◽

10.18048/2020.59.08. ◽

2020 ◽

Vol 59 (1) ◽

pp. 131-142

Author(s):

Daniel Štifanić ◽

Zlatan Car

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Fish Species ◽

Video Recording ◽

Activation Function ◽

Classification Performance ◽

Learning Rate ◽

Machine Learning Algorithms ◽

Species Classification ◽

The Impact

Fish population monitoring systems based on underwater video recording are becoming more popular nowadays, however, manual processing and analysis of such data can be time-consuming. Therefore, by utilizing machine learning algorithms, the data can be processed more efficiently. In this research, authors investigate the possibility of convolutional neural network (CNN) implementation for fish species classification. The dataset used in this research consists of four fish species (Plectroglyphidodon dickii, Chromis chrysura, Amphiprion clarkii, and Chaetodon lunulatus), which gives a total of 12859 fish images. For the aforementioned classification algorithm, different combinations of hyperparameters were examined as well as the impact of different activation functions on the classification performance. As a result, the best CNN classification performance was achieved when Identity activation function is applied to hidden layers, RMSprop is used as a solver with a learning rate of 0.001, and a learning rate decay of 1e-5. Accordingly, the proposed CNN model is capable of performing high-quality fish species classifications.

Download Full-text

Improving Convolutional Neural Network (CNN) Architecture (miniVGGNet) with Batch Normalization and Learning Rate Decay Factor for Image Classification

International Journal of Integrated Engineering ◽

10.30880/ijie.2019.11.04.006 ◽

2019 ◽

Vol 11 (4) ◽

Author(s):

Asmida Ismail ◽

◽

Siti Anom Ahmad ◽

Azura Che Soh ◽

Khair Hassan ◽

...

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Image Classification ◽

Learning Rate ◽

Decay Factor ◽

Batch Normalization ◽

Rate Decay

Download Full-text

Research on Activation Function in Deep Convolutional Neural Network

Proceedings of the 2020 Conference on Artificial Intelligence and Healthcare ◽

10.1145/3433996.3434001 ◽

2020 ◽

Author(s):

Hong Hua Xiu

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Activation Function ◽

Deep Convolutional Neural Network

Download Full-text

Object recognition algorithm based on optimized nonlinear activation function-global convolutional neural network

The Visual Computer ◽

10.1007/s00371-020-02033-x ◽

2021 ◽

Author(s):

Feng-Ping An ◽

Jun-e Liu ◽

Lei Bai

Keyword(s):

Neural Network ◽

Object Recognition ◽

Convolutional Neural Network ◽

Activation Function ◽

Recognition Algorithm ◽

Nonlinear Activation Function

Download Full-text

A Convolutional Neural Network based Model with Improved Activation Function and Optimizer for Effective Intrusion Detection and Classification

2021 International Conference on Advance Computing and Innovative Technologies in Engineering (ICACITE) ◽

10.1109/icacite51222.2021.9404584 ◽

2021 ◽

Author(s):

Solaiman Kabir ◽

Sadman Sakib ◽

Md. Akib Hossain ◽

Safi Islam ◽

Muhammad Iqbal Hossain

Keyword(s):

Neural Network ◽

Intrusion Detection ◽

Convolutional Neural Network ◽

Activation Function

Download Full-text

Yamatani Activation: Edge Homogeneous Response Super Resolution Neural Network

10.36227/techrxiv.11861187.v1 ◽

2020 ◽

Author(s):

Takuma Yoshimura

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Dynamic Range ◽

Super Resolution ◽

Activation Function

In this research, I propose a two-variable activation function "Yamatani" that satisfies the first-degree homogeneity, and realize a super-resolution convolutional neural network that is independent of the dynamic range and symmetrical about the luminance inversion.

Download Full-text

AlexNet convolutional neural network to classify the types of Indonesian coffee beans

IOP Conference Series Earth and Environmental Science ◽

10.1088/1755-1315/905/1/012059 ◽

2021 ◽

Vol 905 (1) ◽

pp. 012059

Author(s):

Y Hendrawan ◽

B Rohmatulloh ◽

F I Ilmi ◽

M R Fauzy ◽

R Damayanti ◽

...

Keyword(s):

Neural Network ◽

Computer Vision ◽

Sensitivity Analysis ◽

Convolutional Neural Network ◽

Confusion Matrix ◽

Learning Rate ◽

Agricultural Products ◽

Coffee Bean ◽

Coffee Beans ◽

Non Destructive

Abstract Various types of Indonesian coffee are already popular internationally. Recently, there are still not many methods to classify the types of typical Indonesian coffee. Computer vision is a non-destructive method for classifying agricultural products. This study aimed to classify three types of Indonesian Arabica coffee beans, i.e., Gayo Aceh, Kintamani Bali, and Toraja Tongkonan, using computer vision. The classification method used was the AlexNet convolutional neural network with sensitivity analysis using several variations of the optimizer such as SGDm, Adam, and RMSProp and the learning rate of 0.00005 and 0.0001. Each type of coffee used 500 data for training and validation with the distribution of 70% training and 30% validation. The results showed that all AlexNet models achieved a perfect validation accuracy value of 100% in 1,040 iterations. This study also used 100 testing-set data on each type of coffee bean. In the testing confusion matrix, the accuracy reached 99.6%.

Download Full-text

IMPLEMENTASI DEEP LEARNING PADA PENGENALAN AKSARA SUNDA MENGGUNAKAN METODE CONVOLUTIONAL NEURAL NETWORK

INSERT : Information System and Emerging Technology Journal ◽

10.23887/insert.v2i1.37405 ◽

2021 ◽

Vol 2 (1) ◽

pp. 46

Author(s):

Shelvi Nur Rahmawati ◽

Eka Wahyu Hidayat ◽

Husni Mubarok

Keyword(s):

Neural Network ◽

Deep Learning ◽

Convolutional Neural Network ◽

Learning Rate

Aksara Sunda merupakan salah satu aksara daerah Indonesia khususnya masyarakat Sunda. Seiring dengan perkembangan teknologi seperti sekarang ini, bahasa daerah pun semakin tergerus dari waktu kewaktu. Aksara Sunda pun mulai terlupakan, bahkan jarang digunakan oleh masyarakat Sunda dalam kehidupan sehari-hari serta kurangnya memahami Bahasa daerahnya sendiri. Oleh karena itu, perlu adanya pelestarian Bahasa daerah yang dikembangkan menyesuaikan perkembangan jaman agar bisa terus dikenal dan dilestarikan, salahsatunya dengan identifikasi aksara Sunda menggunakan metode Convolutional Neural Network (CNN). Convolutional Neural Network (CNN) adalah bagian dari deep learning yang biasanya digunakan dalam pengolahan data gambar. Hasil dari penelitian ini menggunakan optimasi ADAM dengan penggunaan epoch 20, 50, 100 dan 500. Penggunaan epoch 500, learning rate 0.1 merupakan nilai tertinggi dengan akurasi 98.03%. Berdasarkan hasil data training dengan nilai epoch 100, learning rate 0.001 hasil akurasi sebesar 96.71% data training dan 92.02% data testing.

Download Full-text

IMPLEMENTASI DEEP LEARNING BERBASIS TENSORFLOW UNTUK PENGENALAN SIDIK JARI

Emitor: Jurnal Teknik Elektro ◽

10.23917/emitor.v18i01.6236 ◽

2018 ◽

Vol 18 (01) ◽

pp. 22-27 ◽

Cited By ~ 1

Author(s):

Royani Darma Nurfita ◽

Gunawan Ariyanto

Keyword(s):

Neural Network ◽

Deep Learning ◽

Convolutional Neural Network ◽

Input Data ◽

Data Preprocessing ◽

Learning Rate

Sistem pengenalan sidik jari banyak digunakan dala bidang biometrik untuk berbagai keperluan pada beberapa tahun terakhir ini. Pengenalan sidik jari digunakan karena memiliki pola yang rumit yang dapat mengenali seseorang dan merupakan identitas setiap manusia. Sidik jari juga banyak digunakan sebagai verifikasi maupun identifikasi. Permasalahan yang dihadapi dalam penelitian ini adalah komputer sulit melakukan klasifikasi objek salah satunya pada sidikjari. Dalam penelitian ini penulismenggunakan deep learning yang menggunakan metode Convolutional Neural Network (CNN) untuk mengatasi masalah tersebut. CNN digunakan untuk melakukan proses pembelajaran mesin pada komputer. Tahapan pada CNN adalah input data, preprocessing, proses training. Implementasi CNN yang digunakan library tensorflow dengan menggunakan bahasa pemrograman python. Dataset yang digunakan bersumber dari sebuah website kompetisi verifikasi sidik jari pada tahun 2004 yang menggunakan sensor bertipe opticalsensor “V300” by crossMatch dan didalamnya terdapat 80 gambar sidik jari. Proses pelatihan menggunakan data yang berukuran 24x24 pixel dan melakukan pengujian dengan membandingkan jumlah epoch dan learning rate sehingga diketahui bahwa jika semakin besar jumlah epoch dan semakin kecil learning rate maka semakin baik tingkat akurasi pelatihan yang didapatkan. Pada penelitian ini tingkat akurasi pelatihan yang dicapai sebesar 100%

Download Full-text