Intraoral image generation by progressive growing of generative adversarial network and evaluation of generated image quality by dentists

AbstractDentists need experience with clinical cases to practice specialized skills. However, the need to protect patient's private information limits their ability to utilize intraoral images obtained from clinical cases. In this study, since generating realistic images could make it possible to utilize intraoral images, progressive growing of generative adversarial networks are used to generate intraoral images. A total of 35,254 intraoral images were used as training data with resolutions of 128 × 128, 256 × 256, 512 × 512, and 1024 × 1024. The results of the training datasets with and without data augmentation were compared. The Sliced Wasserstein Distance was calculated to evaluate the generated images. Next, 50 real images and 50 generated images for each resolution were randomly selected and shuffled. 12 pediatric dentists were asked to observe these images and assess whether they were real or generated. The d prime of the 1024 × 1024 images was significantly higher than that of the other resolutions. In conclusion, generated intraoral images with resolutions of 512 × 512 or lower were so realistic that the dentists could not distinguish whether they were real or generated. This implies that the generated images can be used in dental education or data augmentation for deep learning, without privacy restrictions.

Download Full-text

Intraoral Image Generation by Progressive Growing of Generative Adversarial Network and Evaluation of Generated Image Quality by Dentists

10.21203/rs.3.rs-117942/v1 ◽

2020 ◽

Author(s):

Kazuma Kokomoto ◽

Rena Okawa ◽

Kazuhiko Nakano ◽

Kazunori Nozaki

Keyword(s):

Private Information ◽

Data Augmentation ◽

Wasserstein Distance ◽

Training Data ◽

Image Generation ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Pediatric Dentists ◽

Realistic Images ◽

Clinical Cases

Abstract Dentists need experience with plenty of clinical cases to practice specialized skills. However, the need to protect patients’ private information limits the ability to utilize lots of intraoral images obtained from clinical cases. In this study, since generating realistic images could making utilizing lots of intraoral images possible, intraoral images are generated by using a progressive growing of generative adversarial network. 35,254 intraoral images were used as training data with resolutions of 128×128, 256×256, 512×512, and 1,024×1,024. The results of training datasets with and without data augmentation were compared. The sliced Wasserstein distance (SWD) was calculated to evaluate the generated images. Next, 50 real images and 50 generated images for each resolution were randomly selected and shuffled. Twelve pediatric dentists were asked to observe these images and assess whether each was real or generated. The accuracy of the assessment of the 1,024×1,024 images was significantly higher than that of the other resolutions. In conclusion, generated intraoral images with resolutions of 512×512 or lower were so realistic that the dentists could not distinguish whether they were real or generated. This implies that generated images can be used for dental education or data augmentation for deep learning free from privacy restrictions.

Download Full-text

Linear electromagnetic inverse scattering via generative adversarial networks

International Journal of Microwave and Wireless Technologies ◽

10.1017/s1759078721001331 ◽

2021 ◽

pp. 1-9

Author(s):

Huilin Zhou ◽

Huimin Zheng ◽

Qiegen Liu ◽

Jian Liu ◽

Yuhao Wang

Keyword(s):

Inverse Scattering ◽

Optimization Methods ◽

Training Data ◽

Generative Adversarial Networks ◽

Scattering Problems ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Adversarial Networks ◽

Highly Nonlinear ◽

Electromagnetic Inverse Scattering

Abstract Electromagnetic inverse-scattering problems (ISPs) are concerned with determining the properties of an unknown object using measured scattered fields. ISPs are often highly nonlinear, causing the problem to be very difficult to address. In addition, the reconstruction images of different optimization methods are distorted which leads to inaccurate reconstruction results. To alleviate these issues, we propose a new linear model solution of generative adversarial network-based (LM-GAN) inspired by generative adversarial networks (GAN). Two sub-networks are trained alternately in the adversarial framework. A linear deep iterative network as a generative network captures the spatial distribution of the data, and a discriminative network estimates the probability of a sample from the training data. Numerical results validate that LM-GAN has admirable fidelity and accuracy when reconstructing complex scatterers.

Download Full-text

Speech Emotion Recognition using Data Augmentation Method by Cycle-Generative Adversarial Networks

10.20944/preprints202104.0651.v1 ◽

2021 ◽

Author(s):

Arash Shilandari ◽

Hossein Marvi ◽

Hossein Khosravi

Keyword(s):

Neural Network ◽

Emotion Recognition ◽

Speech Processing ◽

Data Augmentation ◽

Generative Adversarial Networks ◽

Speech Emotion Recognition ◽

Support Vector ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Adversarial Networks

Nowadays, and with the mechanization of life, speech processing has become so crucial for the interaction between humans and machines. Deep neural networks require a database with enough data for training. The more features are extracted from the speech signal, the more samples are needed to train these networks. Adequate training of these networks can be ensured when there is access to sufficient and varied data in each class. If there is not enough data; it is possible to use data augmentation methods to obtain a database with enough samples. One of the obstacles to developing speech emotion recognition systems is the Data sparsity problem in each class for neural network training. The current study has focused on making a cycle generative adversarial network for data augmentation in a system for speech emotion recognition. For each of the five emotions employed, an adversarial generating network is designed to generate data that is very similar to the main data in that class, as well as differentiate the emotions of the other classes. These networks are taught in an adversarial way to produce feature vectors like each class in the space of the main feature, and then they add to the training sets existing in the database to train the classifier network. Instead of using the common cross-entropy error to train generative adversarial networks and to remove the vanishing gradient problem, Wasserstein Divergence has been used to produce high-quality artificial samples. The suggested network has been tested to be applied for speech emotion recognition using EMODB as training, testing, and evaluating sets, and the quality of artificial data evaluated using two Support Vector Machine (SVM) and Deep Neural Network (DNN) classifiers. Moreover, it has been revealed that extracting and reproducing high-level features from acoustic features, speech emotion recognition with separating five primary emotions has been done with acceptable accuracy.

Download Full-text

Improved Classification of White Blood Cells with the Generative Adversarial Network and Deep Convolutional Neural Network

Computational Intelligence and Neuroscience ◽

10.1155/2020/6490479 ◽

2020 ◽

Vol 2020 ◽

pp. 1-12

Author(s):

Khaled Almezhghwi ◽

Sertan Serte

Keyword(s):

Blood Cells ◽

Data Augmentation ◽

White Blood Cells ◽

Training Data ◽

Generative Adversarial Networks ◽

Generative Adversarial Network ◽

Human Errors ◽

Morphological Variations ◽

Adversarial Network

White blood cells (leukocytes) are a very important component of the blood that forms the immune system, which is responsible for fighting foreign elements. The five types of white blood cells include neutrophils, eosinophils, lymphocytes, monocytes, and basophils, where each type constitutes a different proportion and performs specific functions. Being able to classify and, therefore, count these different constituents is critical for assessing the health of patients and infection risks. Generally, laboratory experiments are used for determining the type of a white blood cell. The staining process and manual evaluation of acquired images under the microscope are tedious and subject to human errors. Moreover, a major challenge is the unavailability of training data that cover the morphological variations of white blood cells so that trained classifiers can generalize well. As such, this paper investigates image transformation operations and generative adversarial networks (GAN) for data augmentation and state-of-the-art deep neural networks (i.e., VGG-16, ResNet, and DenseNet) for the classification of white blood cells into the five types. Furthermore, we explore initializing the DNNs’ weights randomly or using weights pretrained on the CIFAR-100 dataset. In contrast to other works that require advanced image preprocessing and manual feature extraction before classification, our method works directly with the acquired images. The results of extensive experiments show that the proposed method can successfully classify white blood cells. The best DNN model, DenseNet-169, yields a validation accuracy of 98.8%. Particularly, we find that the proposed approach outperforms other methods that rely on sophisticated image processing and manual feature engineering.

Download Full-text

RF-GANs: A Method to Synthesize Retinal Fundus Images Based on Generative Adversarial Network

Computational Intelligence and Neuroscience ◽

10.1155/2021/3812865 ◽

2021 ◽

Vol 2021 ◽

pp. 1-17

Author(s):

Yu Chen ◽

Jun Long ◽

Jifeng Guo

Keyword(s):

Data Augmentation ◽

Semantic Segmentation ◽

Training Data ◽

Generative Adversarial Networks ◽

Fundus Images ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Retinal Fundus Images ◽

Retinal Fundus ◽

Segmentation Models

Diabetic retinopathy (DR) is a diabetic complication affecting the eyes, which is the main cause of blindness in young and middle-aged people. In order to speed up the diagnosis of DR, a mass of deep learning methods have been used for the detection of this disease, but they failed to attain excellent results due to unbalanced training data, i.e., the lack of DR fundus images. To address the problem of data imbalance, this paper proposes a method dubbed retinal fundus images generative adversarial networks (RF-GANs), which is based on generative adversarial network, to synthesize retinal fundus images. RF-GANs is composed of two generation models, RF-GAN1 and RF-GAN2. Firstly, RF-GAN1 is employed to translate retinal fundus images from source domain (the domain of semantic segmentation datasets) to target domain (the domain of EyePACS dataset connected to Kaggle (EyePACS)). Then, we train the semantic segmentation models with the translated images, and employ the trained models to extract the structural and lesion masks (hereafter, we refer to it as Masks) of EyePACS. Finally, we employ RF-GAN2 to synthesize retinal fundus images using the Masks and DR grading labels. This paper verifies the effectiveness of the method: RF-GAN1 can narrow down the domain gap between different datasets to improve the performance of the segmentation models. RF-GAN2 can synthesize realistic retinal fundus images. Adopting the synthesized images for data augmentation, the accuracy and quadratic weighted kappa of the state-of-the-art DR grading model on the testing set of EyePACS increase by 1.53% and 1.70%, respectively.

Download Full-text

Geometric Morphometric Data Augmentation Using Generative Computational Learning Algorithms

Applied Sciences ◽

10.3390/app10249133 ◽

2020 ◽

Vol 10 (24) ◽

pp. 9133

Author(s):

Lloyd A. Courtenay ◽

Diego González-Aguilera

Keyword(s):

Sample Size ◽

Data Augmentation ◽

Synthetic Data ◽

Model Performance ◽

Training Data ◽

Generative Adversarial Networks ◽

Generative Adversarial Network ◽

Geometric Morphometric ◽

Adversarial Networks ◽

The Impact

The fossil record is notorious for being incomplete and distorted, frequently conditioning the type of knowledge that can be extracted from it. In many cases, this often leads to issues when performing complex statistical analyses, such as classification tasks, predictive modelling, and variance analyses, such as those used in Geometric Morphometrics. Here different Generative Adversarial Network architectures are experimented with, testing the effects of sample size and domain dimensionality on model performance. For model evaluation, robust statistical methods were used. Each of the algorithms were observed to produce realistic data. Generative Adversarial Networks using different loss functions produced multidimensional synthetic data significantly equivalent to the original training data. Conditional Generative Adversarial Networks were not as successful. The methods proposed are likely to reduce the impact of sample size and bias on a number of statistical learning applications. While Generative Adversarial Networks are not the solution to all sample-size related issues, combined with other pre-processing steps these limitations may be overcome. This presents a valuable means of augmenting geometric morphometric datasets for greater predictive visualization.

Download Full-text

Data Augmentation for EEG-Based Emotion Recognition Using Generative Adversarial Networks

Frontiers in Computational Neuroscience ◽

10.3389/fncom.2021.723843 ◽

2021 ◽

Vol 15 ◽

Author(s):

Guangcheng Bao ◽

Bin Yan ◽

Li Tong ◽

Jun Shu ◽

Linyuan Wang ◽

...

Keyword(s):

Emotion Recognition ◽

Data Augmentation ◽

Generative Models ◽

Generative Adversarial Networks ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Adversarial Networks ◽

Training Samples ◽

Using Data ◽

Effective Models

One of the greatest limitations in the field of EEG-based emotion recognition is the lack of training samples, which makes it difficult to establish effective models for emotion recognition. Inspired by the excellent achievements of generative models in image processing, we propose a data augmentation model named VAE-D2GAN for EEG-based emotion recognition using a generative adversarial network. EEG features representing different emotions are extracted as topological maps of differential entropy (DE) under five classical frequency bands. The proposed model is designed to learn the distributions of these features for real EEG signals and generate artificial samples for training. The variational auto-encoder (VAE) architecture can learn the spatial distribution of the actual data through a latent vector, and is introduced into the dual discriminator GAN to improve the diversity of the generated artificial samples. To evaluate the performance of this model, we conduct a systematic test on two public emotion EEG datasets, the SEED and the SEED-IV. The obtained recognition accuracy of the method using data augmentation shows as 92.5 and 82.3%, respectively, on the SEED and SEED-IV datasets, which is 1.5 and 3.5% higher than that of methods without using data augmentation. The experimental results show that the artificial samples generated by our model can effectively enhance the performance of the EEG-based emotion recognition.

Download Full-text

Geometric Morphometric Data Augmentation using Generative Computational Learning Algorithms

10.20944/preprints202011.0696.v1 ◽

2020 ◽

Author(s):

Lloyd A. Courtenay ◽

Diego González-Aguilera

Keyword(s):

Sample Size ◽

Data Augmentation ◽

Synthetic Data ◽

Model Performance ◽

Training Data ◽

Generative Adversarial Networks ◽

Generative Adversarial Network ◽

Geometric Morphometric ◽

Adversarial Networks ◽

The Impact

Download Full-text

Generating Scenery Images with Larger Variety According to User Descriptions

Applied Sciences ◽

10.3390/app112110224 ◽

2021 ◽

Vol 11 (21) ◽

pp. 10224

Author(s):

Hsu-Yung Cheng ◽

Chih-Chang Yu

Keyword(s):

Neural Network ◽

Data Augmentation ◽

Memory Cell ◽

Generative Adversarial Networks ◽

Image Generation ◽

Generative Adversarial Network ◽

Data Set ◽

Adversarial Network ◽

Adversarial Networks ◽

Hidden Layer

In this paper, a framework based on generative adversarial networks is proposed to perform nature-scenery generation according to descriptions from the users. The desired place, time and seasons of the generated scenes can be specified with the help of text-to-image generation techniques. The framework improves and modifies the architecture of a generative adversarial network with attention models by adding the imagination models. The proposed attentional and imaginative generative network uses the hidden layer information to initialize the memory cell of the recurrent neural network to produce the desired photos. A data set containing different categories of scenery images is established to train the proposed system. The experiments validate that the proposed method is able to increase the quality and diversity of the generated images compared to the existing method. A possible application of road image generation for data augmentation is also demonstrated in the experimental results.

Download Full-text

GANDaLF: GAN for Data-Limited Fingerprinting

Proceedings on Privacy Enhancing Technologies ◽

10.2478/popets-2021-0029 ◽

2021 ◽

Vol 2021 (2) ◽

pp. 305-322

Author(s):

Se Eun Oh ◽

Nate Mathews ◽

Mohammad Saidur Rahman ◽

Matthew Wright ◽

Nicholas Hopper

Keyword(s):

Deep Learning ◽

Training Data ◽

Generative Adversarial Networks ◽

Large Set ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Adversarial Networks ◽

Closed World ◽

Training Samples ◽

A Site

Abstract We introduce Generative Adversarial Networks for Data-Limited Fingerprinting (GANDaLF), a new deep-learning-based technique to perform Website Fingerprinting (WF) on Tor traffic. In contrast to most earlier work on deep-learning for WF, GANDaLF is intended to work with few training samples, and achieves this goal through the use of a Generative Adversarial Network to generate a large set of “fake” data that helps to train a deep neural network in distinguishing between classes of actual training data. We evaluate GANDaLF in low-data scenarios including as few as 10 training instances per site, and in multiple settings, including fingerprinting of website index pages and fingerprinting of non-index pages within a site. GANDaLF achieves closed-world accuracy of 87% with just 20 instances per site (and 100 sites) in standard WF settings. In particular, GANDaLF can outperform Var-CNN and Triplet Fingerprinting (TF) across all settings in subpage fingerprinting. For example, GANDaLF outperforms TF by a 29% margin and Var-CNN by 38% for training sets using 20 instances per site.

Download Full-text