Segmentation Neural Network Incorporating Scale-Space in the Application of Cardiac MRI

In this work, we present an image segmentation algorithm based on the convolutional neural network framework where the scale space theory is incorporated in the course of training procedure. The construction of data augmentation is designed to apply the scale space to the training data in order to effectively deal with the variability of regions of interest in geometry and appearance such as shape and contrast. The proposed data augmentation algorithm via scale space is aimed to improve invariant features with respect to both geometry and appearance by taking into consideration of their diffusion process. We develop a segmentation algorithm based on the convolutional neural network framework where the network architecture consists of encoding and decoding substructures in combination with the data augmentation scheme via the scale space induced by the heat equation. The quantitative analysis using the cardiac MRI dataset indicates that the proposed algorithm achieves better accuracy in the delineation of the left ventricles, which demonstrates the potential of the algorithm in the application of the whole heart segmentation as a compute-aided diagnosis system for the cardiac diseases.

Download Full-text

Robust Approach to Supervised Deep Neural Network Training for Real-Time Object Classification in Cluttered Indoor Environment

Applied Sciences ◽

10.3390/app11157148 ◽

2021 ◽

Vol 11 (15) ◽

pp. 7148

Author(s):

Bedada Endale ◽

Abera Tullu ◽

Hayoung Shi ◽

Beom-Soo Kang

Keyword(s):

Neural Network ◽

Deep Learning ◽

Real Time ◽

Network Architecture ◽

Input Data ◽

Deep Neural Network ◽

Data Augmentation ◽

Object Classification ◽

Training Data ◽

Gradient Descent Algorithm

Unmanned aerial vehicles (UAVs) are being widely utilized for various missions: in both civilian and military sectors. Many of these missions demand UAVs to acquire artificial intelligence about the environments they are navigating in. This perception can be realized by training a computing machine to classify objects in the environment. One of the well known machine training approaches is supervised deep learning, which enables a machine to classify objects. However, supervised deep learning comes with huge sacrifice in terms of time and computational resources. Collecting big input data, pre-training processes, such as labeling training data, and the need for a high performance computer for training are some of the challenges that supervised deep learning poses. To address these setbacks, this study proposes mission specific input data augmentation techniques and the design of light-weight deep neural network architecture that is capable of real-time object classification. Semi-direct visual odometry (SVO) data of augmented images are used to train the network for object classification. Ten classes of 10,000 different images in each class were used as input data where 80% were for training the network and the remaining 20% were used for network validation. For the optimization of the designed deep neural network, a sequential gradient descent algorithm was implemented. This algorithm has the advantage of handling redundancy in the data more efficiently than other algorithms.

Download Full-text

Oversampling Based on Data Augmentation in Convolutional Neural Network for Silicon Wafer Defect Classification

Knowledge Innovation Through Intelligent Software Methodologies, Tools and Techniques - Frontiers in Artificial Intelligence and Applications ◽

10.3233/faia200547 ◽

2020 ◽

Author(s):

Uzma Batool ◽

Mohd Ibrahim Shapiai ◽

Nordinah Ismail ◽

Hilman Fauzi ◽

Syahrizal Salleh

Keyword(s):

Neural Network ◽

Deep Learning ◽

Convolutional Neural Network ◽

Silicon Wafer ◽

Data Augmentation ◽

Imbalanced Data ◽

Training Data ◽

Defect Classification ◽

Learning Method ◽

Test Set

Silicon wafer defect data collected from fabrication facilities is intrinsically imbalanced because of the variable frequencies of defect types. Frequently occurring types will have more influence on the classification predictions if a model gets trained on such skewed data. A fair classifier for such imbalanced data requires a mechanism to deal with type imbalance in order to avoid biased results. This study has proposed a convolutional neural network for wafer map defect classification, employing oversampling as an imbalance addressing technique. To have an equal participation of all classes in the classifier’s training, data augmentation has been employed, generating more samples in minor classes. The proposed deep learning method has been evaluated on a real wafer map defect dataset and its classification results on the test set returned a 97.91% accuracy. The results were compared with another deep learning based auto-encoder model demonstrating the proposed method, a potential approach for silicon wafer defect classification that needs to be investigated further for its robustness.

Download Full-text

Batik pattern recognition using convolutional neural network

Bulletin of Electrical Engineering and Informatics ◽

10.11591/eei.v9i4.2385 ◽

2020 ◽

Vol 9 (4) ◽

pp. 1430-1437

Author(s):

Mohammad Arif Rasyidi ◽

Taufiqotul Bariyah

Keyword(s):

Neural Network ◽

Pattern Recognition ◽

Convolutional Neural Network ◽

Test Data ◽

Network Architecture ◽

Experimental Results ◽

Training Data ◽

Excellent Performance ◽

Artistic Value

Batik is one of Indonesia's cultures that is well-known worldwide. Batik is a fabric that is painted using canting and liquid wax so that it forms patterns of high artistic value. In this study, we applied the convolutional neural network (CNN) to identify six batik patterns, namely Banji, Ceplok, Kawung, Mega Mendung, Parang, and Sekar Jagad. 994 images from the 6 categories were collected and then divided into training and test data with a ratio of 8:2. Image augmentation was also done to provide variations in training data as well as to prevent overfitting. Experimental results on the test data showed that CNN produced an excellent performance as indicated by accuracy of 94% and top-2 accuracy of 99% which was obtained using the DenseNet network architecture.

Download Full-text

Convolutional Neural Network-Based Discriminator for Outlier Detection

Computational Intelligence and Neuroscience ◽

10.1155/2021/8811147 ◽

2021 ◽

Vol 2021 ◽

pp. 1-13

Author(s):

Fahad Alharbi ◽

Khalil El Hindi ◽

Saad Al Ahmadi ◽

Hussien Alsalamn

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Systematic Approach ◽

Data Augmentation ◽

Random Noise ◽

Training Data ◽

Human Errors ◽

Machine Learning Methods ◽

Benchmark Datasets ◽

Trusted Data

Noise in training data increases the tendency of many machine learning methods to overfit the training data, which undermines the performance. Outliers occur in big data as a result of various factors, including human errors. In this work, we present a novel discriminator model for the identification of outliers in the training data. We propose a systematic approach for creating training datasets to train the discriminator based on a small number of genuine instances (trusted data). The noise discriminator is a convolutional neural network (CNN). We evaluate the discriminator’s performance using several benchmark datasets and with different noise ratios. We inserted random noise in each dataset and trained discriminators to clean them. Different discriminators were trained using different numbers of genuine instances with and without data augmentation. We compare the performance of the proposed noise-discriminator method with seven other methods proposed in the literature using several benchmark datasets. Our empirical results indicate that the proposed method is very competitive to the other methods. It actually outperforms them for pair noise.

Download Full-text

Robust Convolutional Neural Network Cascade for Facial Landmark Localization Exploiting Training Data Augmentation

Electronic Imaging ◽

10.2352/issn.2470-1173.2018.10.imawm-374 ◽

2018 ◽

Vol 2018 (10) ◽

pp. 374-1-374-5 ◽

Cited By ~ 2

Author(s):

Ruiyi Mao ◽

Qian Lin ◽

Jan P. Allebach

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Data Augmentation ◽

Training Data ◽

Facial Landmark Localization ◽

Facial Landmark ◽

Landmark Localization

Download Full-text

Chaotic Harris Hawks Optimization with Quasi-Reflection-Based Learning: An Application to Enhance CNN Design

Sensors ◽

10.3390/s21196654 ◽

2021 ◽

Vol 21 (19) ◽

pp. 6654

Author(s):

Jameer Basha ◽

Nebojsa Bacanin ◽

Nikola Vukobrat ◽

Miodrag Zivkovic ◽

K. Venkatachalam ◽

...

Keyword(s):

Neural Network ◽

Brain Tumor ◽

Convolutional Neural Network ◽

Swarm Intelligence ◽

Network Architecture ◽

Data Augmentation ◽

Superior Performance ◽

Basic Algorithm ◽

Experimental Conditions ◽

Neural Network Approach

The research presented in this manuscript proposes a novel Harris Hawks optimization algorithm with practical application for evolving convolutional neural network architecture to classify various grades of brain tumor using magnetic resonance imaging. The proposed improved Harris Hawks optimization method, which belongs to the group of swarm intelligence metaheuristics, further improves the exploration and exploitation abilities of the basic algorithm by incorporating a chaotic population initialization and local search, along with a replacement strategy based on the quasi-reflection-based learning procedure. The proposed method was first evaluated on 10 recent CEC2019 benchmarks and the achieved results are compared with the ones generated by the basic algorithm, as well as with results of other state-of-the-art approaches that were tested under the same experimental conditions. In subsequent empirical research, the proposed method was adapted and applied for a practical challenge of convolutional neural network design. The evolved network structures were validated against two datasets that contain images of a healthy brain and brain with tumors. The first dataset comprises well-known IXI and cancer imagining archive images, while the second dataset consists of axial T1-weighted brain tumor images, as proposed in one recently published study in the Q1 journal. After performing data augmentation, the first dataset encompasses 8.000 healthy and 8.000 brain tumor images with grades I, II, III, and IV and the second dataset includes 4.908 images with Glioma, Meningioma, and Pituitary, with 1.636 images belonging to each tumor class. The swarm intelligence-driven convolutional neural network approach was evaluated and compared to other, similar methods and achieved a superior performance. The obtained accuracy was over 95% in all conducted experiments. Based on the established results, it is reasonable to conclude that the proposed approach could be used to develop networks that can assist doctors in diagnostics and help in the early detection of brain tumors.

Download Full-text

Plastic Gasket Defect Detection Based on Transfer Learning

Scientific Programming ◽

10.1155/2021/5990020 ◽

2021 ◽

Vol 2021 ◽

pp. 1-11

Author(s):

Xieyi Chen ◽

Dongyun Wang ◽

Jinjun Shao ◽

Jun Fan

Keyword(s):

Neural Network ◽

Neural Networks ◽

Deep Learning ◽

Convolutional Neural Network ◽

Transfer Learning ◽

Defect Detection ◽

Data Augmentation ◽

Surface Defects ◽

Visual Detection ◽

Training Data

To automatically detect plastic gasket defects, a set of plastic gasket defect visual detection devices based on GoogLeNet Inception-V2 transfer learning was designed and established in this study. The GoogLeNet Inception-V2 deep convolutional neural network (DCNN) was adopted to extract and classify the defect features of plastic gaskets to solve the problem of their numerous surface defects and difficulty in extracting and classifying the features. Deep learning applications require a large amount of training data to avoid model overfitting, but there are few datasets of plastic gasket defects. To address this issue, data augmentation was applied to our dataset. Finally, the performance of the three convolutional neural networks was comprehensively compared. The results showed that the GoogLeNet Inception-V2 transfer learning model had a better performance in less time. It means it had higher accuracy, reliability, and efficiency on the dataset used in this paper.

Download Full-text

Data augmentation techniques for transfer learning improvement in drill wear classification using convolutional neural network

Machine Graphics and Vision ◽

10.22630/mgv.2019.28.1.1 ◽

2019 ◽

Vol 28 (1) ◽

pp. 3-12

Author(s):

Jarosław Kurek ◽

Joanna Aleksiejuk-Gawron ◽

Izabella Antoniuk ◽

Jarosław Górski ◽

Albina Jegorowa ◽

...

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Transfer Learning ◽

Data Augmentation ◽

Good Condition ◽

Training Data ◽

Time Data ◽

Learning Improvement ◽

Original Dataset ◽

Augmentation Techniques

This paper presents an improved method for recognizing the drill state on the basis of hole images drilled in a laminated chipboard, using convolutional neural network (CNN) and data augmentation techniques. Three classes were used to describe the drill state: red -- for drill that is worn out and should be replaced, yellow -- for state in which the system should send a warning to the operator, indicating that this element should be checked manually, and green -- denoting the drill that is still in good condition, which allows for further use in the production process. The presented method combines the advantages of transfer learning and data augmentation methods to improve the accuracy of the received evaluations. In contrast to the classical deep learning methods, transfer learning requires much smaller training data sets to achieve acceptable results. At the same time, data augmentation customized for drill wear recognition makes it possible to expand the original dataset and to improve the overall accuracy. The experiments performed have confirmed the suitability of the presented approach to accurate class recognition in the given problem, even while using a small original dataset.

Download Full-text

Comparative analysis of augmented datasets performances of age invariant face recognition models

Bulletin of Electrical Engineering and Informatics ◽

10.11591/eei.v10i3.3020 ◽

2021 ◽

Vol 10 (3) ◽

pp. 1356-1367

Author(s):

Kennedy Okokpujie ◽

Etinosa Noma-Osaghae ◽

Samuel Ndueso John ◽

Charles Ndujiuba ◽

Imhade Princess Okokpujie

Keyword(s):

Neural Network ◽

Face Recognition ◽

Convolutional Neural Network ◽

Network Architecture ◽

Data Augmentation ◽

Absolute Error ◽

Extraction Process ◽

Deep Convolutional Neural Network ◽

Non Invasive ◽

Invasive Method

The popularity of face recognition systems has increased due to their non-invasive method of image acquisition, thus boasting the widespread applications. Face ageing is one major factor that influences the performance of face recognition algorithms. In this study, the authors present a comparative study of the two most accepted and experimented face ageing datasets (FG-Net and morph II). These datasets were used to simulate age invariant face recognition (AIFR) models. Four types of noises were added to the two face ageing datasets at the preprocessing stage. The addition of noise at the preprocessing stage served as a data augmentation technique that increased the number of sample images available for deep convolutional neural network (DCNN) experimentation, improved the proposed AIFR model and the trait aging features extraction process. The proposed AIFR models are developed with the pre-trained Inception-ResNet-v2 deep convolutional neural network architecture. On testing and comparing the models, the results revealed that FG-Net is more efficient over Morph with an accuracy of 0.15%, loss function of 71%, mean square error (MSE) of 39% and mean absolute error (MAE) of -0.63%.

Download Full-text

Neural Network based Whole Heart Segmentation from 3D CT images

International journal of electrical and computer engineering systems ◽

10.32985/ijeces.11.1.3 ◽

2020 ◽

Vol 11 (1) ◽

pp. 25-31

Author(s):

Marija Habijan ◽

Hrvoje Leventić ◽

Irena Galić ◽

Danilo Babin

Keyword(s):

Neural Network ◽

Data Augmentation ◽

Principal Component ◽

Ground Truth ◽

Ct Images ◽

Training Data ◽

Optimal Learning ◽

Learning Rates ◽

Augmentation Techniques ◽

Whole Heart

The most recent research is showing the importance and suitability of neural networks for medical image processing tasks. Nonetheless, their efficiency in segmentation tasks is greatly dependent on the amount of available training data. To overcome issues of using small datasets, various data augmentation techniques have been developed. In this paper, an approach for the whole heart segmentation based on the convolutional neural network, specifically on the 3D U-Net architecture, is presented. Also, we propose the incorporation of the principal component analysis as an additional data augmentation technique. The network is trained end-to-end, i.e., no pre-trained network is required. Evaluation of the proposed approach is performed on CT images from MICCAI 2017 Multi-Modality Whole Heart Segmentation Challenge dataset, delivering in a three-fold cross-validation an average dice coefficient overlap of 88.2% for the whole heart, i.e. all heart substructures. Final segmentation results show a high accuracy with the ground truth, indicating that the proposed approach is competitive to the state-of-the-art. Additionally, experiments on the influence of different learning rates are provided as well, showing the optimal learning rate of 0.005 to give the best segmentation results.

Download Full-text