Analysing the Correlation between Convolutional Filter Sizes and Classification Accuracy Using MNIST

Mapping Intimacies ◽

10.31237/osf.io/g7ct6 ◽

2021 ◽

Author(s):

Gentian Gashi

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Strong Correlation ◽

Classification Accuracy ◽

Handwriting Recognition ◽

Correlational Analysis ◽

Electronic Text ◽

Training Set ◽

Handwritten Text ◽

Filter Size

Handwriting recognition is the process of automatically converting handwritten text into electronic text (letter codes) usable by a computer. The increase in technology reliance during an international pandemic caused by COVID-19 has showcased the importance of ensuring the information stored and digitised is done accurately and efficiently. Interpreting handwriting remains complex for both humans and computers due to the various styles and skewed characters. In this study, we conducted a correlational analysis on the association between filter sizes and the convolutional neural networks (CNN’s) classification accuracy. The testing has been conducted from the publicly available MNIST database of handwritten digits (LeCun and Cortes, 2010). The dataset consists of a training set (N=60,000) and a testing set (N=10,000). Using ANOVA, our results indicate a strong correlation (.000,P≤0.05) between filter size and classification accuracy. However, this significance is only present when increasing the filter size from 1x1 to 2x2. Larger filter sizes were insignificant therefore, a filter size above 2x2 cannot be recommended.

Download Full-text

Detection of liver cirrhosis in standard T2-weighted MRI using deep transfer learning

European Radiology ◽

10.1007/s00330-021-07858-1 ◽

2021 ◽

Author(s):

Sebastian Nowak ◽

Narine Mesropyan ◽

Anton Faron ◽

Wolfgang Block ◽

Martin Reuter ◽

...

Keyword(s):

Neural Networks ◽

Liver Cirrhosis ◽

Transfer Learning ◽

Convolutional Neural Networks ◽

Classification Accuracy ◽

Automatic Segmentation ◽

Analysis Data ◽

Caudate Lobe ◽

Natural Image ◽

Liver Segmentation

Abstract Objectives To investigate the diagnostic performance of deep transfer learning (DTL) to detect liver cirrhosis from clinical MRI. Methods The dataset for this retrospective analysis consisted of 713 (343 female) patients who underwent liver MRI between 2017 and 2019. In total, 553 of these subjects had a confirmed diagnosis of liver cirrhosis, while the remainder had no history of liver disease. T2-weighted MRI slices at the level of the caudate lobe were manually exported for DTL analysis. Data were randomly split into training, validation, and test sets (70%/15%/15%). A ResNet50 convolutional neural network (CNN) pre-trained on the ImageNet archive was used for cirrhosis detection with and without upstream liver segmentation. Classification performance for detection of liver cirrhosis was compared to two radiologists with different levels of experience (4th-year resident, board-certified radiologist). Segmentation was performed using a U-Net architecture built on a pre-trained ResNet34 encoder. Differences in classification accuracy were assessed by the χ2-test. Results Dice coefficients for automatic segmentation were above 0.98 for both validation and test data. The classification accuracy of liver cirrhosis on validation (vACC) and test (tACC) data for the DTL pipeline with upstream liver segmentation (vACC = 0.99, tACC = 0.96) was significantly higher compared to the resident (vACC = 0.88, p < 0.01; tACC = 0.91, p = 0.01) and to the board-certified radiologist (vACC = 0.96, p < 0.01; tACC = 0.90, p < 0.01). Conclusion This proof-of-principle study demonstrates the potential of DTL for detecting cirrhosis based on standard T2-weighted MRI. The presented method for image-based diagnosis of liver cirrhosis demonstrated expert-level classification accuracy. Key Points • A pipeline consisting of two convolutional neural networks (CNNs) pre-trained on an extensive natural image database (ImageNet archive) enables detection of liver cirrhosis on standard T2-weighted MRI. • High classification accuracy can be achieved even without altering the pre-trained parameters of the convolutional neural networks. • Other abdominal structures apart from the liver were relevant for detection when the network was trained on unsegmented images.

Download Full-text

Convolutional neural networks based image resampling with noisy training set

2016 IEEE 13th International Conference on Signal Processing (ICSP) ◽

10.1109/icsp.2016.7877797 ◽

2016 ◽

Cited By ~ 3

Author(s):

Andrey Nasonov ◽

Konstantin Chesnakov ◽

Andrey Krylov

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Training Set

Download Full-text

Tobacco Leaf Grading Based on Deep Convolutional Neural Networks and Machine Vision

Journal of the ASABE ◽

10.13031/ja.14537 ◽

2021 ◽

Vol 65 (1) ◽

pp. 11-22

Author(s):

Mengyao Lu ◽

Shuwen Jiang ◽

Cong Wang ◽

Dong Chen ◽

Tian’en Chen

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Transfer Learning ◽

Convolutional Neural Networks ◽

Classification Accuracy ◽

Classification Model ◽

List Type ◽

Tobacco Leaves ◽

Tobacco Leaf ◽

Grading Model

HighlightsA classification model for the front and back sides of tobacco leaves was developed for application in industry.A tobacco leaf grading method that combines a CNN with double-branch integration was proposed.The A-ResNet network was proposed and compared with other classic CNN networks.The grading accuracy of eight different grades was 91.30% and the testing time was 82.180 ms, showing a relatively high classification accuracy and efficiency.Abstract. Flue-cured tobacco leaf grading is a key step in the production and processing of Chinese-style cigarette raw materials, directly affecting cigarette blend and quality stability. At present, manual grading of tobacco leaves is dominant in China, resulting in unsatisfactory grading quality and consuming considerable material and financial resources. In this study, for fast, accurate, and non-destructive tobacco leaf grading, 2,791 flue-cured tobacco leaves of eight different grades in south Anhui Province, China, were chosen as the study sample, and a tobacco leaf grading method that combines convolutional neural networks and double-branch integration was proposed. First, a classification model for the front and back sides of tobacco leaves was trained by transfer learning. Second, two processing methods (equal-scaled resizing and cropping) were used to obtain global images and local patches from the front sides of tobacco leaves. A global image-based tobacco leaf grading model was then developed using the proposed A-ResNet-65 network, and a local patch-based tobacco leaf grading model was developed using the ResNet-34 network. These two networks were compared with classic deep learning networks, such as VGGNet, GoogLeNet-V3, and ResNet. Finally, the grading results of the two grading models were integrated to realize tobacco leaf grading. The tobacco leaf classification accuracy of the final model, for eight different grades, was 91.30%, and grading of a single tobacco leaf required 82.180 ms. The proposed method achieved a relatively high grading accuracy and efficiency. It provides a method for industrial implementation of the tobacco leaf grading and offers a new approach for the quality grading of other agricultural products. Keywords: Convolutional neural network, Deep learning, Image classification, Transfer learning, Tobacco leaf grading

Download Full-text

Tree Species Classification of Drone Hyperspectral and RGB Imagery with Deep Learning Convolutional Neural Networks

Remote Sensing ◽

10.3390/rs12071070 ◽

2020 ◽

Vol 12 (7) ◽

pp. 1070 ◽

Cited By ~ 6

Author(s):

Somayeh Nezami ◽

Ehsan Khoramshahi ◽

Olli Nevalainen ◽

Ilkka Pölönen ◽

Eija Honkavaara

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Tree Species ◽

Classification Accuracy ◽

Performance Metrics ◽

Test Site ◽

Biomass Estimation ◽

Species Classification ◽

Tree Species Classification ◽

3D Cnn

Interest in drone solutions in forestry applications is growing. Using drones, datasets can be captured flexibly and at high spatial and temporal resolutions when needed. In forestry applications, fundamental tasks include the detection of individual trees, tree species classification, biomass estimation, etc. Deep neural networks (DNN) have shown superior results when comparing with conventional machine learning methods such as multi-layer perceptron (MLP) in cases of huge input data. The objective of this research is to investigate 3D convolutional neural networks (3D-CNN) to classify three major tree species in a boreal forest: pine, spruce, and birch. The proposed 3D-CNN models were employed to classify tree species in a test site in Finland. The classifiers were trained with a dataset of 3039 manually labelled trees. Then the accuracies were assessed by employing independent datasets of 803 records. To find the most efficient set of feature combination, we compare the performances of 3D-CNN models trained with hyperspectral (HS) channels, Red-Green-Blue (RGB) channels, and canopy height model (CHM), separately and combined. It is demonstrated that the proposed 3D-CNN model with RGB and HS layers produces the highest classification accuracy. The producer accuracy of the best 3D-CNN classifier on the test dataset were 99.6%, 94.8%, and 97.4% for pines, spruces, and birches, respectively. The best 3D-CNN classifier produced ~5% better classification accuracy than the MLP with all layers. Our results suggest that the proposed method provides excellent classification results with acceptable performance metrics for HS datasets. Our results show that pine class was detectable in most layers. Spruce was most detectable in RGB data, while birch was most detectable in the HS layers. Furthermore, the RGB datasets provide acceptable results for many low-accuracy applications.

Download Full-text

Decoding Attempted Hand Movements in Stroke Patients Using Surface Electromyography

Sensors ◽

10.3390/s20236763 ◽

2020 ◽

Vol 20 (23) ◽

pp. 6763

Author(s):

Mads Jochumsen ◽

Imran Khan Niazi ◽

Muhammad Zia ur Rehman ◽

Imran Amjad ◽

Muhammad Shafique ◽

...

Keyword(s):

Neural Networks ◽

Discriminant Analysis ◽

Linear Discriminant Analysis ◽

Convolutional Neural Networks ◽

Classification Accuracy ◽

Intraclass Correlation ◽

Motor Impairment ◽

Wrist Flexion ◽

Stroke Patients ◽

Linear Discriminant

Brain- and muscle-triggered exoskeletons have been proposed as a means for motor training after a stroke. With the possibility of performing different movement types with an exoskeleton, it is possible to introduce task variability in training. It is difficult to decode different movement types simultaneously from brain activity, but it may be possible from residual muscle activity that many patients have or quickly regain. This study investigates whether nine different motion classes of the hand and forearm could be decoded from forearm EMG in 15 stroke patients. This study also evaluates the test-retest reliability of a classical, but simple, classifier (linear discriminant analysis) and advanced, but more computationally intensive, classifiers (autoencoders and convolutional neural networks). Moreover, the association between the level of motor impairment and classification accuracy was tested. Three channels of surface EMG were recorded during the following motion classes: Hand Close, Hand Open, Wrist Extension, Wrist Flexion, Supination, Pronation, Lateral Grasp, Pinch Grasp, and Rest. Six repetitions of each motion class were performed on two different days. Hudgins time-domain features were extracted and classified using linear discriminant analysis and autoencoders, and raw EMG was classified with convolutional neural networks. On average, 79 ± 12% and 80 ± 12% (autoencoders) of the movements were correctly classified for days 1 and 2, respectively, with an intraclass correlation coefficient of 0.88. No association was found between the level of motor impairment and classification accuracy (Spearman correlation: 0.24). It was shown that nine motion classes could be decoded from residual EMG, with autoencoders being the best classification approach, and that the results were reliable across days; this may have implications for the development of EMG-controlled exoskeletons for training in the patient’s home.

Download Full-text

Hybridizing Evolutionary Computation and Deep Neural Networks: An Approach to Handwriting Recognition Using Committees and Transfer Learning

Complexity ◽

10.1155/2019/2952304 ◽

2019 ◽

Vol 2019 ◽

pp. 1-16 ◽

Cited By ~ 7

Author(s):

Alejandro Baldominos ◽

Yago Saez ◽

Pedro Isasi

Keyword(s):

Neural Networks ◽

Evolutionary Computation ◽

Transfer Learning ◽

Convolutional Neural Networks ◽

Character Recognition ◽

Deep Neural Networks ◽

Handwriting Recognition ◽

Computational Time ◽

Learning Models ◽

Expensive Process

Neuroevolution is the field of study that uses evolutionary computation in order to optimize certain aspect of the design of neural networks, most often its topology and hyperparameters. The field was introduced in the late-1980s, but only in the latest years the field has become mature enough to enable the optimization of deep learning models, such as convolutional neural networks. In this paper, we rely on previous work to apply neuroevolution in order to optimize the topology of deep neural networks that can be used to solve the problem of handwritten character recognition. Moreover, we take advantage of the fact that evolutionary algorithms optimize a population of candidate solutions, by combining a set of the best evolved models resulting in a committee of convolutional neural networks. This process is enhanced by using specific mechanisms to preserve the diversity of the population. Additionally, in this paper, we address one of the disadvantages of neuroevolution: the process is very expensive in terms of computational time. To lessen this issue, we explore the performance of topology transfer learning: whether the best topology obtained using neuroevolution for a certain domain can be successfully applied to a different domain. By doing so, the expensive process of neuroevolution can be reused to tackle different problems, turning it into a more appealing approach for optimizing the design of neural networks topologies. After evaluating our proposal, results show that both the use of neuroevolved committees and the application of topology transfer learning are successful: committees of convolutional neural networks are able to improve classification results when compared to single models, and topologies learned for one problem can be reused for a different problem and data with a good performance. Additionally, both approaches can be combined by building committees of transferred topologies, and this combination attains results that combine the best of both approaches.

Download Full-text

Image Classification: A Survey

Journal of Informatics Electrical and Electronics Engineering (JIEEE) ◽

10.54060/jieee/001.02.002 ◽

2020 ◽

Vol 1 (2) ◽

pp. 1-9

Author(s):

Ankita Singh ◽

◽

Pawan Singh

Keyword(s):

Neural Network ◽

Neural Networks ◽

Deep Learning ◽

Image Classification ◽

Convolutional Neural Networks ◽

Language Processing ◽

Classification Accuracy ◽

Deep Neural Network ◽

Learning Ability ◽

Final Decision

The Classification of images is a paramount topic in artificial vision systems which have drawn a notable amount of interest over the past years. This field aims to classify an image, which is an input, based on its visual content. Currently, most people relied on hand-crafted features to describe an image in a particular way. Then, using classifiers that are learnable, such as random forest, and decision tree was applied to the extract features to come to a final decision. The problem arises when large numbers of photos are concerned. It becomes a too difficult problem to find features from them. This is one of the reasons that the deep neural network model has been introduced. Owing to the existence of Deep learning, it can become feasible to represent the hierarchical nature of features using a various number of layers and corresponding weight with them. The existing image classification methods have been gradually applied in real-world problems, but then there are various problems in its application processes, such as unsatisfactory effect and extremely low classification accuracy or then and weak adaptive ability. Models using deep learning concepts have robust learning ability, which combines the feature extraction and the process of classification into a whole which then completes an image classification task, which can improve the image classification accuracy effectively. Convolutional Neural Networks are a powerful deep neural network technique. These networks preserve the spatial structure of a problem and were built for object recognition tasks such as classifying an image into respective classes. Neural networks are much known because people are getting a state-of-the-art outcome on complex computer vision and natural language processing tasks. Convolutional neural networks have been extensively used.

Download Full-text

Automatic Assessment of Buildings Location Fitness for Solar Panels Installation Using Drones and Neural Network

CivilEng ◽

10.3390/civileng2040056 ◽

2021 ◽

Vol 2 (4) ◽

pp. 1052-1064

Author(s):

Ammar Alzarrad ◽

Chance Emanuels ◽

Mohammad Imtiaz ◽

Haseeb Akbar

Keyword(s):

Neural Network ◽

Neural Networks ◽

Solar Energy ◽

Convolutional Neural Networks ◽

Classification Accuracy ◽

Solar Panels ◽

Automatic Assessment ◽

Surrounding Environment ◽

Proposed Model ◽

The City

Solar panel location assessment is usually a time-consuming manual process, and many criteria should be taken into consideration before deciding. One of the most significant criteria is the building location and surrounding environment. This research project aims to propose a model to automatically identify potential roof spaces for solar panels using drones and convolutional neural networks (CNN). Convolutional neural networks (CNNs) are used to identify buildings’ roofs from drone imagery. Transfer learning on the CNN is used to classify roofs of buildings into two categories of shaded and unshaded. The CNN is trained and tested on separate imagery databases to improve classification accuracy. Results of the current project demonstrate successful segmentation of buildings and identification of shaded roofs. The model presented in this paper can be used to prioritize the buildings based on the likelihood of getting benefits from switching to solar energy. To illustrate an implementation of the presented model, it has been applied to a selected neighborhood in the city of Hurricane in West Virginia. The research results show that the proposed model can assist investors in the energy and building sectors to make better and more informed decisions.

Download Full-text

Dependence Representation Learning with Convolutional Neural Networks and 2D Histograms

Applied Sciences ◽

10.3390/app10030955 ◽

2020 ◽

Vol 10 (3) ◽

pp. 955

Author(s):

Taejun Kim ◽

Han-joon Kim

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Classification Accuracy ◽

Visual Representations ◽

Representation Learning ◽

Single Image ◽

Feature Generation ◽

Average Classification Accuracy ◽

Scatter Plots ◽

Better Than

Researchers frequently use visualizations such as scatter plots when trying to understand how random variables are related to each other, because a single image represents numerous pieces of information. Dependency measures have been widely used to automatically detect dependencies, but these measures only take into account a few types of data, such as the strength and direction of the dependency. Based on advances in the applications of deep learning to vision, we believe that convolutional neural networks (CNNs) can come to understand dependencies by analyzing visualizations, as humans do. In this paper, we propose a method that uses CNNs to extract dependency representations from 2D histograms. We carried out three sorts of experiments and found that CNNs can learn from visual representations. In the first experiment, we used a synthetic dataset to show that CNNs can perfectly classify eight types of dependency. Then, we showed that CNNs can predict correlations based on 2D histograms of real datasets and visualize the learned dependency representation space. Finally, we applied our method and demonstrated that it performs better than the AutoLearn feature generation algorithm in terms of average classification accuracy, while generating half as many features.

Download Full-text

Optimizing Filter Size in Convolutional Neural Networks for Facial Action Unit Recognition

2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition ◽

10.1109/cvpr.2018.00532 ◽

2018 ◽

Cited By ~ 12

Author(s):

Shizhong Han ◽

Zibo Meng ◽

Zhiyuan Li ◽

James O'Reilly ◽

Jie Cai ◽

...

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Action Unit ◽

Facial Action ◽

Filter Size

Download Full-text