Design of High Performance Convolutional Neural Network for Lightweight Platform

As it is the seventh most-spoken language and fifth most-spoken native language in the world, the domain of Bengali handwritten character recognition has fascinated researchers for decades. Although other popular languages i.e., English, Chinese, Hindi, Spanish, etc. have received many contributions in the area of handwritten character recognition, Bengali has not received many noteworthy contributions in this domain because of the complex curvatures and similar writing fashions of Bengali characters. Previously, studies were conducted by using different approaches based on traditional learning, and deep learning. In this research, we proposed a low-cost novel convolutional neural network architecture for the recognition of Bengali characters with only 2.24 to 2.43 million parameters based on the number of output classes. We considered 8 different formations of CMATERdb datasets based on previous studies for the training phase. With experimental analysis, we showed that our proposed system outperformed previous works by a noteworthy margin for all 8 datasets. Moreover, we tested our trained models on other available Bengali characters datasets such as Ekush, BanglaLekha, and NumtaDB datasets. Our proposed architecture achieved 96–99% overall accuracies for these datasets as well. We believe our contributions will be beneficial for developing an automated high-performance recognition tool for Bengali handwritten characters.

Download Full-text

High Performance Kernel Architecture for Convolutional Neural Network Acceleration

Journal of Circuits System and Computers ◽

10.1142/s0218126621502662 ◽

2021 ◽

Author(s):

Anakhi Hazarika ◽

Soumyajit Poddarr ◽

Hafizur Rahaman

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

High Performance

Download Full-text

High Performance Convolutional Neural Network Accelerator Based on Design Space Exploration

10.1109/icisce50968.2020.00143 ◽

2020 ◽

Author(s):

Wei Pang ◽

Shengli Lu

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

High Performance ◽

Design Space Exploration ◽

Design Space ◽

Space Exploration

Download Full-text

High Performance Ensembled Convolutional Neural Network for Plant Species Recognition

Communications in Computer and Information Science - Computer Vision and Image Processing ◽

10.1007/978-981-16-1103-2_44 ◽

2021 ◽

pp. 526-538

Author(s):

S. Anubha Pearline ◽

V. Sathiesh Kumar

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Plant Species ◽

High Performance ◽

Species Recognition

Download Full-text

Research on improved wavelet convolutional wavelet neural networks

Applied Intelligence ◽

10.1007/s10489-020-02015-5 ◽

2020 ◽

Author(s):

Jing-Wei Liu ◽

Fang-Ling Zuo ◽

Ying-Xiao Guo ◽

Tian-Yue Li ◽

Jia-Ming Chen

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Image Classification ◽

High Performance ◽

Learning Algorithm ◽

Wavelet Neural Network ◽

Deep Learning Algorithm ◽

Main Work ◽

Good Ability ◽

Fully Connected

AbstractConvolutional neural network (CNN) is recognized as state of the art of deep learning algorithm, which has a good ability on the image classification and recognition. The problems of CNN are as follows: the precision, accuracy and efficiency of CNN are expected to be improved to satisfy the requirements of high performance. The main work is as follows: Firstly, wavelet convolutional neural network (wCNN) is proposed, where wavelet transform function is added to the convolutional layers of CNN. Secondly, wavelet convolutional wavelet neural network (wCwNN) is proposed, where fully connected neural network (FCNN) of wCNN and CNN are replaced by wavelet neural network (wNN). Thirdly, image classification experiments using CNN, wCNN and wCwNN algorithms, and comparison analysis are implemented with MNIST dataset. The effect of the improved methods are as follows: (1) Both precision and accuracy are improved. (2) The mean square error and the rate of error are reduced. (3) The complexitie of the improved algorithms is increased.

Download Full-text

Comparative evaluation of performance and scalability of convolutional neural network implementations on a multisystem HPC architecture

Journal of Physics Conference Series ◽

10.1088/1742-6596/2062/1/012008 ◽

2021 ◽

Vol 2062 (1) ◽

pp. 012008

Author(s):

Sunil Pandey ◽

Naresh Kumar Nagwani ◽

Shrish Verma

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

High Performance ◽

Processing Unit ◽

Neural Network Training ◽

Central Processing ◽

Network Training ◽

Hot Rolled ◽

Hot Rolled Steel ◽

Performance Computing

Abstract The convolutional neural network training algorithm has been implemented for a central processing unit based high performance multisystem architecture machine. The multisystem or the multicomputer is a parallel machine model which is essentially an abstraction of distributed memory parallel machines. In actual practice, this model corresponds to high performance computing clusters. The proposed implementation of the convolutional neural network training algorithm is based on modeling the convolutional neural network as a computational pipeline. The various functions or tasks of the convolutional neural network pipeline have been mapped onto the multiple nodes of a central processing unit based high performance computing cluster for task parallelism. The pipeline implementation provides a first level performance gain through pipeline parallelism. Further performance gains are obtained by distributing the convolutional neural network training onto the different nodes of the compute cluster. The two gains are multiplicative. In this work, the authors have carried out a comparative evaluation of the computational performance and scalability of this pipeline implementation of the convolutional neural network training with a distributed neural network software program which is based on conventional multi-model training and makes use of a centralized server. The dataset considered for this work is the North Eastern University’s hot rolled steel strip surface defects imaging dataset. In both the cases, the convolutional neural networks have been trained to classify the different defects on hot rolled steel strips on the basis of the input image. One hundred images corresponding to each class of defects have been used for the training in order to keep the training times manageable. The hyperparameters of both the convolutional neural networks were kept identical and the programs were run on the same computational cluster to enable fair comparison. Both the convolutional neural network implementations have been observed to train to nearly 80% training accuracy in 200 epochs. In effect, therefore, the comparison is on the time taken to complete the training epochs.

Download Full-text

Interpretable Diagnosis for Whole-Slide Melanoma Histology Images Using Convolutional Neural Network

Journal of Healthcare Engineering ◽

10.1155/2021/8396438 ◽

2021 ◽

Vol 2021 ◽

pp. 1-7

Author(s):

Peizhen Xie ◽

Ke Zuo ◽

Jie Liu ◽

Mingliang Chen ◽

Shuang Zhao ◽

...

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

High Performance ◽

Operating Characteristic ◽

Distinctive Features ◽

Proposed Model ◽

Medical Image Diagnosis ◽

Melanoma Classification ◽

Activation Mapping ◽

Whole Slide Images

At present, deep learning-based medical image diagnosis had achieved high performance in several diseases. However, the black-box nature of the convolutional neural network (CNN) limits their role in diagnosis. In this study, a novel interpretable diagnosis pipeline using the CNN model was proposed. Furthermore, a sizeable melanoma database that contains 841 digital whole-slide images (WSIs) was built to train and evaluate the model. The model achieved strong melanoma classification ability (0.962 areas under the receiver operating characteristic, 0.887 sensitivity, and 0.925 specificity). Moreover, the proposed model outperformed the existing schemes in terms of accuracy that is 20 pathologists (0.933 vs 0.732 accuracy). Finally, the gradient-weighted class activation mapping (Grad-CAM) method was used to show the inner logic of the proposed model and its feasibility to improve diagnosis process in healthcare. The mechanism of feature heat maps which is visualized through a saliency mapping has demonstrated that features learned or extracted by the proposed model are compatible with the accepted pathological features. Conclusively, the proposed model provides a rapid and accurate diagnosis by locating the distinctive features of melanoma to build doctors’ trust in the CNNs’ diagnosis results.

Download Full-text

Classification of EgyptSat-1 Images Using Deep Learning Methods

International Journal of Sensors Wireless Communications and Control ◽

10.2174/2210327909666190207153858 ◽

2020 ◽

Vol 10 (1) ◽

pp. 37-46 ◽

Cited By ~ 3

Author(s):

Hatem Keshk ◽

Xu-Cheng Yin

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Deep Learning ◽

Convolutional Neural Network ◽

High Performance ◽

Satellite Images ◽

Satellite Image ◽

Aerial Photographs ◽

Average Accuracy

Background: Deep Learning (DL) neural network methods have become a hotspot subject of research in the remote sensing field. Classification of aerial satellite images depends on spectral content, which is a challenging topic in remote sensing. Objective: With the aim to accomplish a high performance and accuracy of Egyptsat-1 satellite image classification, the use of the Convolutional Neural Network (CNN) is raised in this paper because CNN is considered a leading deep learning method. CNN is developed to classify aerial photographs into land cover classes such as urban, vegetation, desert, water bodies, soil, roads, etc. In our work, a comparison between MAXIMUM Likelihood (ML) which represents the traditional supervised classification methods and CNN method is conducted. Conclusion: This research finds that CNN outperforms ML by 9%. The convolutional neural network has better classification result, which reached 92.25% as its average accuracy. Also, the experiments showed that the convolutional neural network is the most satisfactory and effective classification method applied to classify Egyptsat-1 satellite images.

Download Full-text

Multi-Task Convolutional Neural Network Leading to High Performance and Interpretability via Attribute Estimation

IEICE Transactions on Fundamentals of Electronics Communications and Computer Sciences ◽

10.1587/transfun.2020sml0006 ◽

2020 ◽

Vol E103.A (12) ◽

pp. 1609-1612

Author(s):

Keisuke MAEDA ◽

Kazaha HORII ◽

Takahiro OGAWA ◽

Miki HASEYAMA

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

High Performance

Download Full-text

NU-LiteNet: Mobile Landmark Recognition using Convolutional Neural Networks

ECTI Transactions on Computer and Information Technology (ECTI-CIT) ◽

10.37936/ecti-cit.2019131.165074 ◽

2019 ◽

Vol 13 (1) ◽

pp. 21-28 ◽

Cited By ~ 1

Author(s):

Chakkrit Termritthikun ◽

Paisarn Muneesawang

Keyword(s):

Neural Network ◽

Neural Networks ◽

Convolutional Neural Network ◽

Mobile Devices ◽

Mobile Phones ◽

High Performance ◽

Deep Neural Networks ◽

Landmark Recognition ◽

Research Problems ◽

Model Size

The growth of high-performance mobile devices has resulted in more research into on-device image recognition. The research problems have been the latency and accuracy of automatic recognition, which remain as obstacles to its real-world usage. Although the recently developed deep neural networks can achieve accuracy comparable to that of a human user, some of them are still too slow. This paper describes the development of the architecture of a new convolutional neural network model, NU-LiteNet. For this, SqueezeNet was developed to reduce the model size to a degree suitable for smartphones. The model size of NU-LiteNet was therefore 2.6 times smaller than that of SqueezeNet. The model outperformed other Convolutional Neural Network (CNN) models for mobile devices (eg. SqueezeNet and MobileNet) with an accuracy of 81.15% and 69.58% on Singapore and Paris landmark datasets respectively. The shortest execution time of 0.7 seconds per image was recorded with NU-LiteNet on mobile phones.

Download Full-text