Image Classification for the Automatic Feature Extraction in Human Worn Fashion Data

Stefan Rohrmanstorfer; Mikhail Komarov; Felix Mödritscher

doi:10.3390/math9060624

Data Augmentation Methods Applying Grayscale Images for Convolutional Neural Networks in Machine Vision

Applied Sciences ◽

10.3390/app11156721 ◽

2021 ◽

Vol 11 (15) ◽

pp. 6721

Author(s):

Jinyeong Wang ◽

Sanghwan Lee

Keyword(s):

Neural Networks ◽

Machine Vision ◽

Object Detection ◽

Image Classification ◽

Convolutional Neural Networks ◽

Data Augmentation ◽

Image Data ◽

Manufacturing Productivity ◽

Smart Factories ◽

Grayscale Images

In increasing manufacturing productivity with automated surface inspection in smart factories, the demand for machine vision is rising. Recently, convolutional neural networks (CNNs) have demonstrated outstanding performance and solved many problems in the field of computer vision. With that, many machine vision systems adopt CNNs to surface defect inspection. In this study, we developed an effective data augmentation method for grayscale images in CNN-based machine vision with mono cameras. Our method can apply to grayscale industrial images, and we demonstrated outstanding performance in the image classification and the object detection tasks. The main contributions of this study are as follows: (1) We propose a data augmentation method that can be performed when training CNNs with industrial images taken with mono cameras. (2) We demonstrate that image classification or object detection performance is better when training with the industrial image data augmented by the proposed method. Through the proposed method, many machine-vision-related problems using mono cameras can be effectively solved by using CNNs.

Download Full-text

An Automatic Feature Extraction Approach to Image Classification Using Genetic Programming

10.26686/wgtn.13158281.v1 ◽

2020 ◽

Author(s):

Ying Bi ◽

Bing Xue ◽

Mengjie Zhang

Keyword(s):

Feature Extraction ◽

Genetic Programming ◽

Image Classification ◽

Image Data ◽

Data Sets ◽

Automatic Feature Extraction ◽

Terminal Set ◽

Comparable Performance ◽

International Publishing ◽

High Level

© Springer International Publishing AG, part of Springer Nature 2018. Feature extraction is an essential process for image data dimensionality reduction and classification. However, feature extraction is very difficult and often requires human intervention. Genetic Programming (GP) can achieve automatic feature extraction and image classification but the majority of existing methods extract low-level features from raw images without any image-related operations. Furthermore, the work on the combination of image-related operators/descriptors in GP for feature extraction and image classification is limited. This paper proposes a multi-layer GP approach (MLGP) to performing automatic high-level feature extraction and classification. A new program structure, a new function set including a number of image operators/descriptors and two region detectors, and a new terminal set are designed in this approach. The performance of the proposed method is examined on six different data sets of varying difficulty and compared with five GP based methods and 42 traditional image classification methods. Experimental results show that the proposed method achieves better or comparable performance than these baseline methods. Further analysis on the example programs evolved by the proposed MLGP method reveals the good interpretability of MLGP and gives insight into how this method can effectively extract high-level features for image classification.

Download Full-text

USE OF CONVOLUTIONAL NEURAL NETWORKS FOR X-RAY IMAGE ORIENTATION DETERMINATION

10.46793/iccbi21.263bs ◽

2021 ◽

Author(s):

Sandi Baressi Šegota ◽

◽

Simon Lysdahlgaard ◽

Søren Hess ◽

Ronald Antulov

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

High Performance ◽

Data Augmentation ◽

Classification Model ◽

Orientation Sensitivity ◽

X Ray ◽

Artificial Neural Network Ann ◽

Image Orientation ◽

Single Orientation

The fact that Artificial Intelligence (AI) based algorithms exhibit a high performance on image classification tasks has been shown many times. Still, certain issues exist with the application of machine learning (ML) artificial neural network (ANN) algorithms. The best known is the need for a large amount of statistically varied data, which can be addressed with expanded collection or data augmentation. Other issues are also present. Convolutional neural networks (CNNs) show extremely high performance on image-shaped data. Despite their performance, CNNs exhibit a large issue which is the sensitivity to image orientation. Previous research shows that varying the orientation of images may greatly lower the performance of the trained CNN. This is especially problematic in certain applications, such as X-ray radiography, an example of which is presented here. Previous research shows that the performance of CNNs is higher when used on images in a single orientation (left or right), as opposed to the combination of both. This means that the data needs to be differentiated before it enters the classification model. In this paper, the CNN-based model for differentiation between left and right-oriented images is presented. Multiple CNNs are trained and tested, with the highest performing being the VGG16 architecture which achieved an Accuracy of 0.99 (+/- 0.01), and an AUC of 0.98 (+/- 0.01). These results show that CNNs can be used to address the issue of orientation sensitivity by splitting the data in advance of being used in classification models.

Download Full-text

Penerapan Convolutional Neural Networks untuk Mesin Penerjemah Bahasa Daerah Minangkabau Berbasis Gambar

Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi) ◽

10.29207/resti.v5i6.3614 ◽

2021 ◽

Vol 5 (6) ◽

pp. 1153-1160

Author(s):

Mayanda Mega Santoni ◽

Nurul Chamidah ◽

Desta Sandya Prasvita ◽

Helena Nurramdhani Irmanda ◽

Ria Astriratma ◽

...

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Image Recognition ◽

Image Data ◽

Classification Model ◽

Distance Method ◽

Regional Language ◽

Translation Accuracy ◽

Regional Languages ◽

Text Images

One of efforts by the Indonesian people to defend the country is to preserve and to maintain the regional languages. The current era of modernity makes the regional language image become old-fashioned, so that most them are no longer spoken. If it is ignored, then there will be a cultural identity crisis that causes regional languages to be vulnerable to extinction. Technological developments can be used as a way to preserve regional languages. Digital image-based artificial intelligence technology using machine learning methods such as machine translation can be used to answer the problems. This research will use Deep Learning method, namely Convolutional Neural Networks (CNN). Data of this research were 1300 alphabetic images, 5000 text images and 200 vocabularies of Minangkabau regional language. Alphabetic image data is used for the formation of the CNN classification model. This model is used for text image recognition, the results of which will be translated into regional languages. The accuracy of the CNN model is 98.97%, while the accuracy for text image recognition (OCR) is 50.72%. This low accuracy is due to the failure of segmentation on the letters i and j. However, the translation accuracy increases after the implementation of the Leveinstan Distance algorithm which can correct text classification errors, with an accuracy value of 75.78%. Therefore, this research has succeeded in implementing the Convolutional Neural Networks (CNN) method in identifying text in text images and the Leveinstan Distance method in translating Indonesian text into regional language texts.

Download Full-text

Underwater image classification using deep convolutional neural networks and data augmentation

2017 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC) ◽

10.1109/icspcc.2017.8242527 ◽

2017 ◽

Cited By ~ 7

Author(s):

Yifeng Xu ◽

Yang Zhang ◽

Huigang Wang ◽

Xing Liu

Keyword(s):

Neural Networks ◽

Image Classification ◽

Convolutional Neural Networks ◽

Data Augmentation ◽

Deep Convolutional Neural Networks ◽

Underwater Image

Download Full-text

Assessment and Impact of Feature Extraction Methods for Hyperspectral Remote Sensing Image Classification Based on Deep Convolutional Neural Networks

10.1109/icirca51532.2021.9544558 ◽

2021 ◽

Author(s):

Venkata Gopi Mandoori ◽

Radhesyam Vaddi

Keyword(s):

Remote Sensing ◽

Neural Networks ◽

Feature Extraction ◽

Image Classification ◽

Convolutional Neural Networks ◽

Remote Sensing Image ◽

Extraction Methods ◽

Deep Convolutional Neural Networks ◽

Remote Sensing Image Classification ◽

Hyperspectral Remote Sensing Image

Download Full-text

An Automatic Feature Extraction Approach to Image Classification Using Genetic Programming

10.26686/wgtn.13158281 ◽

2020 ◽

Author(s):

Ying Bi ◽

Bing Xue ◽

Mengjie Zhang

Keyword(s):

Feature Extraction ◽

Genetic Programming ◽

Image Classification ◽

Image Data ◽

Data Sets ◽

Automatic Feature Extraction ◽

Terminal Set ◽

Comparable Performance ◽

International Publishing ◽

High Level

© Springer International Publishing AG, part of Springer Nature 2018. Feature extraction is an essential process for image data dimensionality reduction and classification. However, feature extraction is very difficult and often requires human intervention. Genetic Programming (GP) can achieve automatic feature extraction and image classification but the majority of existing methods extract low-level features from raw images without any image-related operations. Furthermore, the work on the combination of image-related operators/descriptors in GP for feature extraction and image classification is limited. This paper proposes a multi-layer GP approach (MLGP) to performing automatic high-level feature extraction and classification. A new program structure, a new function set including a number of image operators/descriptors and two region detectors, and a new terminal set are designed in this approach. The performance of the proposed method is examined on six different data sets of varying difficulty and compared with five GP based methods and 42 traditional image classification methods. Experimental results show that the proposed method achieves better or comparable performance than these baseline methods. Further analysis on the example programs evolved by the proposed MLGP method reveals the good interpretability of MLGP and gives insight into how this method can effectively extract high-level features for image classification.

Download Full-text

Early and Late Level Fusion of Deep Convolutional Neural Networks for Visual Concept Recognition

International Journal of Semantic Computing ◽

10.1142/s1793351x16400158 ◽

2016 ◽

Vol 10 (03) ◽

pp. 379-397 ◽

Cited By ~ 7

Author(s):

Hilal Ergun ◽

Yusuf Caglar Akyuz ◽

Mustafa Sert ◽

Jianquan Liu

Keyword(s):

Neural Networks ◽

Best Practices ◽

Convolutional Neural Networks ◽

Network Architecture ◽

Data Augmentation ◽

State Of The Art ◽

Great Promise ◽

Visual Concept ◽

Deep Convolutional Neural Networks ◽

Concept Recognition

Visual concept recognition is an active research field in the last decade. Related to this attention, deep learning architectures are showing great promise in various computer vision domains including image classification, object detection, event detection and action recognition in videos. In this study, we investigate various aspects of convolutional neural networks for visual concept recognition. We analyze recent studies and different network architectures both in terms of running time and accuracy. In our proposed visual concept recognition system, we first discuss various important properties of popular convolutional network architecture under consideration. Then we describe our method for feature extraction at different levels of abstraction. We present extensive empirical information along with best practices for big data practitioners. Using these best practices we propose efficient fusion mechanisms both for single and multiple network models. We present state-of-the-art results on benchmark datasets while keeping computational costs at low level. Our results show that these state-of-the-art results can be reached without using extensive data augmentation techniques.

Download Full-text

A Novel Electricity Theft Detection Scheme Based on Text Convolutional Neural Networks

Energies ◽

10.3390/en13215758 ◽

2020 ◽

Vol 13 (21) ◽

pp. 5758

Author(s):

Xiaofeng Feng ◽

Hengyu Hui ◽

Ziyang Liang ◽

Wenchong Guo ◽

Huakun Que ◽

...

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Data Augmentation ◽

State Of The Art ◽

Electricity Consumption ◽

Temporal Data ◽

Detection Scheme ◽

Fine Grained ◽

Electricity Theft ◽

Proposed Model

Electricity theft decreases electricity revenues and brings risks to power usage’s safety, which has been increasingly challenging nowadays. As the mainstream in the relevant studies, the state-of-the-art data-driven approaches mainly detect electricity theft events from the perspective of the correlations between different daily or weekly loads, which is relatively inadequate to extract features from hours or more of fine-grained temporal data. In view of the above deficiencies, we propose a novel electricity theft detection scheme based on text convolutional neural networks (TextCNN). Specifically, we convert electricity consumption measurements over a horizon of interest into a two-dimensional time-series containing the intraday electricity features. Based on the data structure, the proposed method can accurately capture various periodical features of electricity consumption. Moreover, a data augmentation method is proposed to cope with the imbalance of electricity theft data. Extensive experimental results based on realistic Chinese and Irish datasets indicate that the proposed model achieves a better performance compared with other existing methods.

Download Full-text

Direct Quantization for Training Highly Accurate Low Bit-width Deep Neural Networks

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2020/292 ◽

2020 ◽

Author(s):

Tuan Hoang ◽

Thanh-Toan Do ◽

Tam V. Nguyen ◽

Ngai-Man Cheung

Keyword(s):

Neural Networks ◽

Cost Function ◽

Image Classification ◽

Convolutional Neural Networks ◽

Gradient Descent ◽

Deep Neural Networks ◽

State Of The Art ◽

Deep Convolutional Neural Networks ◽

Novel Method ◽

The Cost

This paper proposes two novel techniques to train deep convolutional neural networks with low bit-width weights and activations. First, to obtain low bit-width weights, most existing methods obtain the quantized weights by performing quantization on the full-precision network weights. However, this approach would result in some mismatch: the gradient descent updates full-precision weights, but it does not update the quantized weights. To address this issue, we propose a novel method that enables direct updating of quantized weights with learnable quantization levels to minimize the cost function using gradient descent. Second, to obtain low bit-width activations, existing works consider all channels equally. However, the activation quantizers could be biased toward a few channels with high-variance. To address this issue, we propose a method to take into account the quantization errors of individual channels. With this approach, we can learn activation quantizers that minimize the quantization errors in the majority of channels. Experimental results demonstrate that our proposed method achieves state-of-the-art performance on the image classification task, using AlexNet, ResNet and MobileNetV2 architectures on CIFAR-100 and ImageNet datasets.

Download Full-text