Small facial image dataset augmentation using conditional GANs based on incomplete edge feature input

Image data collection and labelling is costly or difficult in many real applications. Generating diverse and controllable images using conditional generative adversarial networks (GANs) for data augmentation from a small dataset is promising but challenging as deep convolutional neural networks need a large training dataset to achieve reasonable performance in general. However, unlabeled and incomplete features (e.g., unintegral edges, simplified lines, hand-drawn sketches, discontinuous geometry shapes, etc.) can be conveniently obtained through pre-processing the training images and can be used for image data augmentation. This paper proposes a conditional GAN framework for facial image augmentation using a very small training dataset and incomplete or modified edge features as conditional input for diversity. The proposed method defines a new domain or space for refining interim images to prevent overfitting caused by using a very small training dataset and enhance the tolerance of distortions caused by incomplete edge features, which effectively improves the quality of facial image augmentation with diversity. Experimental results have shown that the proposed method can generate high-quality images of good diversity when the GANs are trained using very sparse edges and a small number of training samples. Compared to the state-of-the-art edge-to-image translation methods that directly convert sparse edges to images, when using a small training dataset, the proposed conditional GAN framework can generate facial images with desirable diversity and acceptable distortions for dataset augmentation and significantly outperform the existing methods in terms of the quality of synthesised images, evaluated by Fréchet Inception Distance (FID) and Kernel Inception Distance (KID) scores.

Download Full-text

Making Radiomics More Reproducible across Scanner and Imaging Protocol Variations: A Review of Harmonization Methods

Journal of Personalized Medicine ◽

10.3390/jpm11090842 ◽

2021 ◽

Vol 11 (9) ◽

pp. 842

Author(s):

Shruti Atul Mali ◽

Abdalla Ibrahim ◽

Henry C. Woodruff ◽

Vincent Andrearczyk ◽

Henning Müller ◽

...

Keyword(s):

Deep Learning ◽

Domain Knowledge ◽

Data Augmentation ◽

Image Data ◽

Clinical Decision ◽

Generative Adversarial Networks ◽

Imaging Protocol ◽

Image Domain ◽

Style Transfer ◽

Augmentation Techniques

Radiomics converts medical images into mineable data via a high-throughput extraction of quantitative features used for clinical decision support. However, these radiomic features are susceptible to variation across scanners, acquisition protocols, and reconstruction settings. Various investigations have assessed the reproducibility and validation of radiomic features across these discrepancies. In this narrative review, we combine systematic keyword searches with prior domain knowledge to discuss various harmonization solutions to make the radiomic features more reproducible across various scanners and protocol settings. Different harmonization solutions are discussed and divided into two main categories: image domain and feature domain. The image domain category comprises methods such as the standardization of image acquisition, post-processing of raw sensor-level image data, data augmentation techniques, and style transfer. The feature domain category consists of methods such as the identification of reproducible features and normalization techniques such as statistical normalization, intensity harmonization, ComBat and its derivatives, and normalization using deep learning. We also reflect upon the importance of deep learning solutions for addressing variability across multi-centric radiomic studies especially using generative adversarial networks (GANs), neural style transfer (NST) techniques, or a combination of both. We cover a broader range of methods especially GANs and NST methods in more detail than previous reviews.

Download Full-text

Boosting Multilabel Semantic Segmentation for Somata and Vessels in Mouse Brain

Frontiers in Neuroscience ◽

10.3389/fnins.2021.610122 ◽

2021 ◽

Vol 15 ◽

Author(s):

Xinglong Wu ◽

Yuhang Tao ◽

Guangzhi He ◽

Dun Liu ◽

Meiling Fan ◽

...

Keyword(s):

Neural Networks ◽

Mouse Brain ◽

Light And Electron Microscopy ◽

Three Dimensional ◽

Image Data ◽

Semantic Segmentation ◽

Poor Quality ◽

Deep Convolutional Neural Networks ◽

Segmentation Task

Deep convolutional neural networks (DCNNs) are widely utilized for the semantic segmentation of dense nerve tissues from light and electron microscopy (EM) image data; the goal of this technique is to achieve efficient and accurate three-dimensional reconstruction of the vasculature and neural networks in the brain. The success of these tasks heavily depends on the amount, and especially the quality, of the human-annotated labels fed into DCNNs. However, it is often difficult to acquire the gold standard of human-annotated labels for dense nerve tissues; human annotations inevitably contain discrepancies or even errors, which substantially impact the performance of DCNNs. Thus, a novel boosting framework consisting of a DCNN for multilabel semantic segmentation with a customized Dice-logarithmic loss function, a fusion module combining the annotated labels and the corresponding predictions from the DCNN, and a boosting algorithm to sequentially update the sample weights during network training iterations was proposed to systematically improve the quality of the annotated labels; this framework eventually resulted in improved segmentation task performance. The microoptical sectioning tomography (MOST) dataset was then employed to assess the effectiveness of the proposed framework. The result indicated that the framework, even trained with a dataset including some poor-quality human-annotated labels, achieved state-of-the-art performance in the segmentation of somata and vessels in the mouse brain. Thus, the proposed technique of artificial intelligence could advance neuroscience research.

Download Full-text

Cross Data Set Generalization of Ultrasound Image Augmentation using Representation Learning: A Case Study

Current Directions in Biomedical Engineering ◽

10.1515/cdbme-2021-2193 ◽

2021 ◽

Vol 7 (2) ◽

pp. 755-758

Author(s):

Daniel Wulff ◽

Mohamad Mehdi ◽

Floris Ernst ◽

Jannis Hagenah

Keyword(s):

Data Augmentation ◽

Ultrasound Image ◽

Image Data ◽

Representation Learning ◽

Generative Adversarial Networks ◽

Data Sets ◽

Imaging Data ◽

Data Set ◽

Adversarial Networks ◽

Classical Image

Abstract Data augmentation is a common method to make deep learning assessible on limited data sets. However, classical image augmentation methods result in highly unrealistic images on ultrasound data. Another approach is to utilize learning-based augmentation methods, e.g. based on variational autoencoders or generative adversarial networks. However, a large amount of data is necessary to train these models, which is typically not available in scenarios where data augmentation is needed. One solution for this problem could be a transfer of augmentation models between different medical imaging data sets. In this work, we present a qualitative study of the cross data set generalization performance of different learning-based augmentation methods for ultrasound image data. We could show that knowledge transfer is possible in ultrasound image augmentation and that the augmentation partially results in semantically meaningful transfers of structures, e.g. vessels, across domains.

Download Full-text

Cervical Cancer Single Cell Image Data Augmentation Using Residual Condition Generative Adversarial Networks

2020 3rd International Conference on Artificial Intelligence and Big Data (ICAIBD) ◽

10.1109/icaibd49809.2020.9137494 ◽

2020 ◽

Author(s):

Siyu Chen ◽

Dongrui Gao ◽

Lutao Wang ◽

Yongqing Zhang

Keyword(s):

Cervical Cancer ◽

Single Cell ◽

Data Augmentation ◽

Image Data ◽

Generative Adversarial Networks ◽

Cell Image ◽

Adversarial Networks

Download Full-text

Neural Style Transfer and Geometric Transformations for Data Augmentation on Balinese Carving Recognition using MobileNet

International Journal of Intelligent Engineering and Systems ◽

10.22266/ijies2020.1231.31 ◽

2020 ◽

Vol 13 (6) ◽

pp. 349-363

Author(s):

I Darma ◽

◽

Nanik Suciati ◽

Daniel Siahaan ◽

◽

...

Keyword(s):

Cultural Heritage ◽

Data Augmentation ◽

Recognition Performance ◽

Training Dataset ◽

Geometric Transformation ◽

Style Transfer ◽

Geometric Transformations ◽

Feature Extractor ◽

Improve Accuracy

The preservation of Balinese carving data is a challenge in recognition of Balinese carving. Balinese carvings are a cultural heritage found in traditional buildings in Bali. The collection of Balinese carving images from public images can be a solution for preserving cultural heritage. However, the lousy quality of taking photographs, e.g., skewed shots, can affect the recognition results. Research on the Balinese carving recognition has existed but only recognizes a predetermined image. We proposed a Neural Style Geometric Transformation (NSGT) as a data augmentation technique for Balinese carvings recognition. NSGT is combining Neural Style Transfers and Geometric Transformations for a small dataset solution. This method provides variations in color, lighting, rotation, rescale, zoom, and the size of the training dataset, to improve recognition performance. We use MobileNet as a feature extractor because it has a small number of parameters, which makes it suitable to be applied on mobile devices. Eight scenarios were tested based on image styles and geometric transformations to get the best results. Based on the results, the proposed method can improve accuracy by up to 16.2%.

Download Full-text

Electrocardiogram Quality Assessment with a Generalized Deep Learning Model Assisted by Conditional Generative Adversarial Networks

Life ◽

10.3390/life11101013 ◽

2021 ◽

Vol 11 (10) ◽

pp. 1013

Author(s):

Xue Zhou ◽

Xin Zhu ◽

Keijiro Nakamura ◽

Mahito Noro

Keyword(s):

Neural Network ◽

Quality Assessment ◽

Data Augmentation ◽

Disease Diagnosis ◽

Assessment System ◽

Assessment Model ◽

Training Data ◽

Training Dataset ◽

Generative Adversarial Networks ◽

Adversarial Networks

The electrocardiogram (ECG) is widely used for cardiovascular disease diagnosis and daily health monitoring. Before ECG analysis, ECG quality screening is an essential but time-consuming and experience-dependent work for technicians. An automatic ECG quality assessment method can reduce unnecessary time loss to help cardiologists perform diagnosis. This study aims to develop an automatic quality assessment system to search qualified ECGs for interpretation. The proposed system consists of data augmentation and quality assessment parts. For data augmentation, we train a conditional generative adversarial networks model to get an ECG segment generator, and thus to increase the number of training data. Then, we pre-train a deep quality assessment model based on a training dataset composed of real and generated ECG. Finally, we fine-tune the proposed model using real ECG and validate it on two different datasets composed of real ECG. The proposed system has a generalized performance on the two validation datasets. The model’s accuracy is 97.1% and 96.4%, respectively for the two datasets. The proposed method outperforms a shallow neural network model, and also a deep neural network models without being pre-trained by generated ECG. The proposed system demonstrates improved performance in the ECG quality assessment, and it has the potential to be an initial ECG quality screening tool in clinical practice.

Download Full-text

A Survey on Generative Adversarial Networks for imbalance problems in computer vision tasks

10.21203/rs.3.rs-45616/v4 ◽

2021 ◽

Author(s):

Vignesh Sampath ◽

Iñaki Maurtua ◽

Juan José Aguilar Martín ◽

Aitor Gutierrez

Keyword(s):

Computer Vision ◽

Real World ◽

Medical Image Analysis ◽

Image Data ◽

Training Dataset ◽

Generative Adversarial Networks ◽

Metallic Surface ◽

Synthetic Image ◽

Application Development ◽

Model Complex

Abstract Any computer vision application development starts off by acquiring images and data, then preprocessing and pattern recognition steps to perform a task. When the acquired images are highly imbalanced and not adequate, the desired task may not be achievable. Unfortunately, the occurrence of imbalance problems in acquired image datasets in certain complex real-world problems such as anomaly detection, emotion recognition, medical image analysis, fraud detection, metallic surface defect detection, disaster prediction, etc., are inevitable. The performance of computer vision algorithms can significantly deteriorate when the training dataset is imbalanced. In recent years, Generative Adversarial Neural Networks (GANs) have gained immense attention by researchers across a variety of application domains due to their capability to model complex real-world image data. It is particularly important that GANs can not only be used to generate synthetic images, but also its fascinating adversarial learning idea showed good potential in restoring balance in imbalanced datasets.In this paper, we examine the most recent developments of GANs based techniques for addressing imbalance problems in image data. The real-world challenges and implementations of synthetic image generation based on GANs are extensively covered in this survey. Our survey first introduces various imbalance problems in computer vision tasks and its existing solutions, and then examines key concepts such as deep generative image models and GANs. After that, we propose a taxonomy to summarize GANs based techniques for addressing imbalance problems in computer vision tasks into three major categories: 1. Image level imbalances in classification, 2. object level imbalances in object detection and 3. pixel level imbalances in segmentation tasks. We elaborate the imbalance problems of each group, and provide GANs based solutions in each group. Readers will understand how GANs based techniques can handle the problem of imbalances and boost performance of the computer vision algorithms.

Download Full-text

Performance Study of Image Data Augmentation by Generative Adversarial Networks

10.1109/iemcon53756.2021.9623117 ◽

2021 ◽

Author(s):

Katsuaki Wada ◽

Basabi Chakraborty

Keyword(s):

Data Augmentation ◽

Image Data ◽

Generative Adversarial Networks ◽

Performance Study ◽

Adversarial Networks

Download Full-text

Land Cover Classification based on Deep Convolutional Neural Network with Feature-based Data Augmentation

Journal of Imaging Science and Technology ◽

10.2352/j.imagingsci.technol.2021.65.1.010504 ◽

2020 ◽

Author(s):

Bo Wang ◽

Chengeng Huang ◽

Yuhua Guo ◽

Jiahui Tao

Keyword(s):

Land Cover ◽

Data Augmentation ◽

Remote Sensing Data ◽

Image Data ◽

Texture Features ◽

Land Cover Classification ◽

Deep Convolutional Neural Networks ◽

Image Information ◽

Feature Based ◽

The Rich

Radiation information is essential to land cover classification, but general deep convolutional neural networks (DCNNs) hardly use this to advantage. Additionally, the limited amount of available remote sensing data restricts the efficiency of DCNN models though this can be overcome by data augmentation. However, normal data augmentation methods, which only involve operations such as rotation and translation, have little effect on radiation information. These methods ignore the rich information contained in the image data. In this article, the authors propose a feasible feature-based data augmentation method, which extracts spectral features that can reflect radiation information as well as geometric and texture features that can reflect image information prior to augmentation. Through feature extraction, this method indirectly enhances radiation information and increases the utilization of image information. Classification accuracies show an improvement from 80.20% to 89.20%, which further verifies the effectiveness of this method.

Download Full-text

Identification of cucumber leaf diseases using deep learning and small sample size for agricultural Internet of Things

International Journal of Distributed Sensor Networks ◽

10.1177/15501477211007407 ◽

2021 ◽

Vol 17 (4) ◽

pp. 155014772110074

Author(s):

Jingyao Zhang ◽

Yuan Rao ◽

Chao Man ◽

Zhaohui Jiang ◽

Shaowen Li

Keyword(s):

Neural Network ◽

Internet Of Things ◽

Convolutional Neural Network ◽

Sample Size ◽

Data Augmentation ◽

Small Sample Size ◽

Small Sample ◽

Identification Accuracy ◽

Generative Adversarial Networks ◽

Training Samples

Due to the complex environments in real fields, it is challenging to conduct identification modeling and diagnosis of plant leaf diseases by directly utilizing in-situ images from the system of agricultural Internet of things. To overcome this shortcoming, one approach, based on small sample size and deep convolutional neural network, was proposed for conducting the recognition of cucumber leaf diseases under field conditions. One two-stage segmentation method was presented to acquire the lesion images by extracting the disease spots from cucumber leaves. Subsequently, after implementing rotation and translation, the lesion images were fed into the activation reconstruction generative adversarial networks for data augmentation to generate new training samples. Finally, to improve the identification accuracy of cucumber leaf diseases, we proposed dilated and inception convolutional neural network that was trained using the generated training samples. Experimental results showed that the proposed approach achieved the average identification accuracy of 96.11% and 90.67% when implemented on the data sets of lesion and raw field diseased leaf images with three different diseases of anthracnose, downy mildew, and powdery mildew, significantly outperforming those existing counterparts, indicating that it offered good potential of serving field application of agricultural Internet of things.

Download Full-text