scholarly journals Batik pattern recognition using convolutional neural network

2020 ◽  
Vol 9 (4) ◽  
pp. 1430-1437
Author(s):  
Mohammad Arif Rasyidi ◽  
Taufiqotul Bariyah

Batik is one of Indonesia's cultures that is well-known worldwide. Batik is a fabric that is painted using canting and liquid wax so that it forms patterns of high artistic value. In this study, we applied the convolutional neural network (CNN) to identify six batik patterns, namely Banji, Ceplok, Kawung, Mega Mendung, Parang, and Sekar Jagad. 994 images from the 6 categories were collected and then divided into training and test data with a ratio of 8:2. Image augmentation was also done to provide variations in training data as well as to prevent overfitting. Experimental results on the test data showed that CNN produced an excellent performance as indicated by accuracy of 94% and top-2 accuracy of 99% which was obtained using the DenseNet network architecture.

2020 ◽  
Vol 10 (7) ◽  
pp. 1494-1505
Author(s):  
Hyo-Hun Kim ◽  
Byung-Woo Hong

In this work, we present an image segmentation algorithm based on the convolutional neural network framework where the scale space theory is incorporated in the course of training procedure. The construction of data augmentation is designed to apply the scale space to the training data in order to effectively deal with the variability of regions of interest in geometry and appearance such as shape and contrast. The proposed data augmentation algorithm via scale space is aimed to improve invariant features with respect to both geometry and appearance by taking into consideration of their diffusion process. We develop a segmentation algorithm based on the convolutional neural network framework where the network architecture consists of encoding and decoding substructures in combination with the data augmentation scheme via the scale space induced by the heat equation. The quantitative analysis using the cardiac MRI dataset indicates that the proposed algorithm achieves better accuracy in the delineation of the left ventricles, which demonstrates the potential of the algorithm in the application of the whole heart segmentation as a compute-aided diagnosis system for the cardiac diseases.


2019 ◽  
Vol 8 (3) ◽  
pp. 3429-3434

The theory of control systems deals with the analysis and design of interacting components of a system in a configuration that provides the desired behavior. This paper deals with the problem of the identification of non-linear systems through Convolutional Neural Network (CNN). We propose a structure of a CNN and perform simulations with test data using unsupervised learning for the identification of nonlinear systems. Also, MLP is used to compare the results when there is noise in the training data, which allows us to see that the proposed CNN has better performance and can be used for cases where the noise is present. The proposed CNN is validated with test data. Tests are carried out with Gas oven data, comparing the proposed structure of CNN with a MLP. When there is noise in the data, CNN has better performance than MLP.


2021 ◽  
Vol 905 (1) ◽  
pp. 012018
Author(s):  
I Y Prayogi ◽  
Sandra ◽  
Y Hendrawan

Abstract The objective of this study is to classify the quality of dried clove flowers using deep learning method with Convolutional Neural Network (CNN) algorithm, and also to perform the sensitivity analysis of CNN hyperparameters to obtain best model for clove quality classification process. The quality of clove as raw material in this study was determined according to SNI 3392-1994 by PT. Perkebunan Nusantara XII Pancusari Plantation, Malang, East Java, Indonesia. In total 1,600 images of dried clove flower were divided into 4 qualities. Each clove quality has 225 training data, 75 validation data, and 100 test data. The first step of this study is to build CNN model architecture as first model. The result of that model gives 65.25% reading accuracy. The second step is to analyze CNN sensitivity or CNN hyperparameter on the first model. The best value of CNN hyperparameter in each step then to be used in the next stage. Finally, after CNN hyperparameter carried out the reading accuracy of the test data is improved to 87.75%.


2021 ◽  
Vol 2021 ◽  
pp. 1-11
Author(s):  
Guiyong Xu ◽  
Yang Xu ◽  
Sicong Zhang ◽  
Xiaoyao Xie

In the era of big data, convolutional neural network (CNN) has been widely used in the field of image classification and has achieved excellent performance. More and more researchers are beginning to combine deep neural networks with steganalysis to improve performance in recent years. However, most of the steganalysis algorithm based on the convolutional neural network has only run test against the WOW and S-UNIWARD algorithms; meanwhile, their versatility is insufficient due to long training time and the limit of image size. This paper proposes a new network architecture, called SFRNet, to solve these problems. The feature extraction and fusion layer can extract more features from the digital image. The RepVgg block is used to accelerate the inference and increase memory utilization. The SE block improves the detection accuracy rate because it can learn feature weights to make effective feature maps with significant weights and invalid or ineffective feature maps with small weights. Experimental results show that the SFRNet has achieved excellent performance in the detection accuracy rate against four state-of-the-art steganography algorithms in the spatial domain, e.g., HUGO, WOW, S-UNIWARD, and MiPOD, under different payloads. The SFRNet detection accuracy rate achieves 89.6% against S-UNIWARD algorithm with the payload of 0.4bpp and 72.5% at 0.2bpp. As the same time, the training time of our network is greatly reduced by 35% compared with Yedroudj-Net.


2021 ◽  
Vol 6 (2) ◽  
pp. 62-71
Author(s):  
Chaerul Umam ◽  
Andi Danang Krismawan ◽  
Rabei Raad Ali

Hiragana is one of the letters in Japanese. In this study, CNN (Convolutional Neural Network) method used as identication method, while he preprocessing used thresholding. Then carry out the normalization stage and the filtering stage to remove noise in the image. At the training stage use maxpooling and danse methods as a liaison in the training process, wherea in testing stage using the Adam Optimizer method. Here, we use 1000 images from 50 hiragana characters with a ratio of 950: 50, 950 as training data and 50 data as testing data. Our experiment yield accuracy in 95%.


2020 ◽  
Vol 57 (1-2) ◽  
pp. 71-77
Author(s):  
R. Ķēniņš

AbstractThe paper describes the process of training a convolutional neural network to segment land into its labelled land cover types such as grass, water, forest and buildings. This segmentation can promote automated updating of topographical maps since doing this manually is a time-consuming process, which is prone to human error. The aim of the study is to evaluate the application of U-net convolutional neural network for land cover classification using countrywide aerial data. U-net neural network architecture has initially been developed for use in biomedical image segmentation and it is one of the most widely used CNN architectures for image segmentation. Training data have been prepared using colour infrared images of Ventspils town and its digital surface model (DSM). Forest, buildings, water, roads and other land plots have been selected as classes, into which the image has been segmented. As a result, images have been segmented with an overall accuracy of 82.9 % with especially high average accuracy for the forest and water classes.


2020 ◽  
Vol 17 (4) ◽  
pp. 172988142094434
Author(s):  
Jingbo Chen ◽  
Shengyong Chen ◽  
Linjie Bian

Many pieces of information are included in the front region of a vehicle, especially in windshield and bumper regions. Thus, windshield or bumper region detection is making sense to extract useful information. But the existing windshield and bumper detection methods based on traditional artificial features are not robust enough. Those features may become invalid in many real situations (e.g. occlude, illumination change, viewpoint change.). In this article, we propose a multi-attribute-guided vehicle discriminately region detection method based on convolutional neural network and not rely on bounding box regression. We separate the net into two branches, respectively, for identification (ID) and Model attributes training. Therefore, the feature spaces of different attributes become more independent. Additionally, we embed a self-attention block into our framework to improve the performance of local region detection. We train our model on PKU_VD data set which has a huge number of images inside. Furthermore, we labeled the handcrafted bounding boxes on 5000 randomly picked testing images, and 1020 of them are used for evaluation and 3980 as the training data for YOLOv3. We use Intersection over Union for quantitative evaluation. Experiments were conducted in three different latest convolutional neural network trunks to illustrate the detection performance of the proposed method. Simultaneously, in terms of quantitative evaluation, the performance of our method is close to YOLOv3 even without handcrafted bounding boxes.


Complexity ◽  
2017 ◽  
Vol 2017 ◽  
pp. 1-12 ◽  
Author(s):  
Sujuan Hou ◽  
Jianwei Lin ◽  
Shangbo Zhou ◽  
Maoling Qin ◽  
Weikuan Jia ◽  
...  

We introduce a logo classification mechanism which combines a series of deep representations obtained by fine-tuning convolutional neural network (CNN) architectures and traditional pattern recognition algorithms. In order to evaluate the proposed mechanism, we build a middle-scale logo dataset (named Logo-405) and treat it as a benchmark for logo related research. Our experiments are carried out on both the Logo-405 dataset and the publicly available FlickrLogos-32 dataset. The experimental results demonstrate that the proposed mechanism outperforms two popular ways used for logo classification, including the strategies that integrate hand-crafted features and traditional pattern recognition algorithms and the models which employ deep CNNs.


2019 ◽  
Vol 8 (7) ◽  
pp. 300 ◽  
Author(s):  
Recep Can ◽  
Sultan Kocaman ◽  
Candan Gokceoglu

Several scientific processes benefit from Citizen Science (CitSci) and VGI (Volunteered Geographical Information) with the help of mobile and geospatial technologies. Studies on landslides can also take advantage of these approaches to a great extent. However, the quality of the collected data by both approaches is often questionable, and automated procedures to check the quality are needed for this purpose. In the present study, a convolutional neural network (CNN) architecture is proposed to validate landslide photos collected by citizens or nonexperts and integrated into a mobile- and web-based GIS environment designed specifically for a landslide CitSci project. The VGG16 has been used as the base model since it allows finetuning, and high performance could be achieved by selecting the best hyper-parameters. Although the training dataset was small, the proposed CNN architecture was found to be effective as it could identify the landslide photos with 94% precision. The accuracy of the results is sufficient for purpose and could even be improved further using a larger amount of training data, which is expected to be obtained with the help of volunteers.


2020 ◽  
Vol 2020 ◽  
pp. 1-12 ◽  
Author(s):  
Jing An ◽  
Ping Ai

In many real-world fault diagnosis applications, due to the frequent changes in working conditions, the distribution of labeled training data (source domain) is different from the distribution of the unlabeled test data (target domain), which leads to performance degradation. In order to solve this problem, an end-to-end unsupervised domain adaptation bear fault diagnosis model that combines Riemann metric correlation alignment and one-dimensional convolutional neural network (RMCA-1DCNN) is proposed in this study. Second-order statistic alignment of the specific activation layer in source and target domains is considered to be a regularization item and embedded in the deep convolutional neural network architecture to compensate for domain shift. Experimental results on the Case Western Reserve University motor bearing database demonstrate that the proposed method has strong fault-discriminative and domain-invariant capacity. Therefore, the proposed method can achieve higher diagnosis accuracy than that of other existing experimental methods.


Sign in / Sign up

Export Citation Format

Share Document