3dinfogan: 3d Models’ Reconstruction In Infogans

Du Chunqi; Shinobu Hasegawa

doi:10.17576/apjitm-2021-1002-07

3dinfogan: 3d Models’ Reconstruction In Infogans

Asia-Pacific Journal of Information Technology and Multimedia ◽

10.17576/apjitm-2021-1002-07 ◽

2021 ◽

Vol 10 (02) ◽

pp. 95-109

Author(s):

Du Chunqi ◽

Shinobu Hasegawa

Keyword(s):

Computer Vision ◽

Deep Learning ◽

Latent Variables ◽

Data Augmentation ◽

Synthetic Data ◽

Real Data ◽

3D Models ◽

High Quality ◽

Shape Constraint ◽

Real Objects

In computer vision and computer graphics, 3D reconstruction is the process of capturing real objects’ shapes and appearances. 3D models always can be constructed by active methods which use high-quality scanner equipment, or passive methods that learn from the dataset. However, both of these two methods only aimed to construct the 3D models, without showing what element affects the generation of 3D models. Therefore, the goal of this research is to apply deep learning to automatically generating 3D models, and finding the latent variables which affect the reconstructing process. The existing research GANs can be trained in little data with two networks called Generator and Discriminator, respectively. Generator can produce synthetic data, and Discriminator can discriminate between the generator’s output and real data. The existing research shows that InFoGAN can maximize the mutual information between latent variables and observation. In our approach, we will generate the 3D models based on InFoGAN and design two constraints, shape-constraint and parameters-constraint, respectively. Shape-constraint utilizes the data augmentation method to limit the synthetic data generated in the models’ profiles. At the same time, we also try to employ parameters-constraint to find the 3D models’ relationship corresponding to the latent variables. Furthermore, our approach will be a challenge in the architecture of generating 3D models built on InFoGAN. Finally, in the process of generation, we might discover the contribution of the latent variables influencing the 3D models to the whole network.

Seismic Data Augmentation Based on Conditional Generative Adversarial Networks

Sensors ◽

10.3390/s20236850 ◽

2020 ◽

Vol 20 (23) ◽

pp. 6850

Author(s):

Yuanming Li ◽

Bonhwa Ku ◽

Shou Zhang ◽

Jae-Kwang Ahn ◽

Hanseok Ko

Keyword(s):

Deep Learning ◽

Data Augmentation ◽

Synthetic Data ◽

Generative Adversarial Networks ◽

High Quality ◽

Seismic Waveforms ◽

Adversarial Networks ◽

Seismic Waveform ◽

Proposed Model

Realistic synthetic data can be useful for data augmentation when training deep learning models to improve seismological detection and classification performance. In recent years, various deep learning techniques have been successfully applied in modern seismology. Due to the performance of deep learning depends on a sufficient volume of data, the data augmentation technique as a data-space solution is widely utilized. In this paper, we propose a Generative Adversarial Networks (GANs) based model that uses conditional knowledge to generate high-quality seismic waveforms. Unlike the existing method of generating samples directly from noise, the proposed method generates synthetic samples based on the statistical characteristics of real seismic waveforms in embedding space. Moreover, a content loss is added to relate high-level features extracted by a pre-trained model to the objective function to enhance the quality of the synthetic data. The classification accuracy is increased from 96.84% to 97.92% after mixing a certain amount of synthetic seismic waveforms, and results of the quality of seismic characteristics derived from the representative experiment show that the proposed model provides an effective structure for generating high-quality synthetic seismic waveforms. Thus, the proposed model is experimentally validated as a promising approach to realistic high-quality seismic waveform data augmentation.

Real UAV-Bird Image Classification Using CNN with a Synthetic Dataset

Applied Sciences ◽

10.3390/app11093863 ◽

2021 ◽

Vol 11 (9) ◽

pp. 3863

Author(s):

Ali Emre Öztürk ◽

Ergun Erçelebi

Keyword(s):

Deep Learning ◽

Image Classification ◽

Synthetic Data ◽

Real Data ◽

Corner Detection ◽

Batch Size ◽

Test Accuracy ◽

Classification Problems ◽

Auc Value ◽

Classification Test

A large amount of training image data is required for solving image classification problems using deep learning (DL) networks. In this study, we aimed to train DL networks with synthetic images generated by using a game engine and determine the effects of the networks on performance when solving real-image classification problems. The study presents the results of using corner detection and nearest three-point selection (CDNTS) layers to classify bird and rotary-wing unmanned aerial vehicle (RW-UAV) images, provides a comprehensive comparison of two different experimental setups, and emphasizes the significant improvements in the performance in deep learning-based networks due to the inclusion of a CDNTS layer. Experiment 1 corresponds to training the commonly used deep learning-based networks with synthetic data and an image classification test on real data. Experiment 2 corresponds to training the CDNTS layer and commonly used deep learning-based networks with synthetic data and an image classification test on real data. In experiment 1, the best area under the curve (AUC) value for the image classification test accuracy was measured as 72%. In experiment 2, using the CDNTS layer, the AUC value for the image classification test accuracy was measured as 88.9%. A total of 432 different combinations of trainings were investigated in the experimental setups. The experiments were trained with various DL networks using four different optimizers by considering all combinations of batch size, learning rate, and dropout hyperparameters. The test accuracy AUC values for networks in experiment 1 ranged from 55% to 74%, whereas the test accuracy AUC values in experiment 2 networks with a CDNTS layer ranged from 76% to 89.9%. It was observed that the CDNTS layer has considerable effects on the image classification accuracy performance of deep learning-based networks. AUC, F-score, and test accuracy measures were used to validate the success of the networks.

Generative adversarial networks for generating synthetic features for Wi-Fi signal quality

PLoS ONE ◽

10.1371/journal.pone.0260308 ◽

2021 ◽

Vol 16 (11) ◽

pp. e0260308

Author(s):

Mauro Castelli ◽

Luca Manzoni ◽

Tatiane Espindola ◽

Aleš Popovič ◽

Andrea De Lorenzo

Keyword(s):

Synthetic Data ◽

Real Data ◽

Generative Models ◽

Generative Adversarial Networks ◽

Signal Quality ◽

Quality Service ◽

High Quality ◽

The Real ◽

Adversarial Networks ◽

High Quality Service

Wireless networks are among the fundamental technologies used to connect people. Considering the constant advancements in the field, telecommunication operators must guarantee a high-quality service to keep their customer portfolio. To ensure this high-quality service, it is common to establish partnerships with specialized technology companies that deliver software services in order to monitor the networks and identify faults and respective solutions. A common barrier faced by these specialized companies is the lack of data to develop and test their products. This paper investigates the use of generative adversarial networks (GANs), which are state-of-the-art generative models, for generating synthetic telecommunication data related to Wi-Fi signal quality. We developed, trained, and compared two of the most used GAN architectures: the Vanilla GAN and the Wasserstein GAN (WGAN). Both models presented satisfactory results and were able to generate synthetic data similar to the real ones. In particular, the distribution of the synthetic data overlaps the distribution of the real data for all of the considered features. Moreover, the considered generative models can reproduce the same associations observed for the synthetic features. We chose the WGAN as the final model, but both models are suitable for addressing the problem at hand.

Albumentations: Fast and Flexible Image Augmentations

Information ◽

10.3390/info11020125 ◽

2020 ◽

Vol 11 (2) ◽

pp. 125 ◽

Cited By ~ 30

Author(s):

Alexander Buslaev ◽

Vladimir I. Iglovikov ◽

Eugene Khvedchenya ◽

Alex Parinov ◽

Mikhail Druzhinin ◽

...

Keyword(s):

Computer Vision ◽

Deep Learning ◽

Processing Speed ◽

Data Augmentation ◽

Improve Performance ◽

Regularization Technique ◽

Image Transform ◽

Basic Image ◽

Learning Frameworks ◽

Image Transformations

Data augmentation is a commonly used technique for increasing both the size and the diversity of labeled training sets by leveraging input transformations that preserve corresponding output labels. In computer vision, image augmentations have become a common implicit regularization technique to combat overfitting in deep learning models and are ubiquitously used to improve performance. While most deep learning frameworks implement basic image transformations, the list is typically limited to some variations of flipping, rotating, scaling, and cropping. Moreover, image processing speed varies in existing image augmentation libraries. We present Albumentations, a fast and flexible open source library for image augmentation with many various image transform operations available that is also an easy-to-use wrapper around other augmentation libraries. We discuss the design principles that drove the implementation of Albumentations and give an overview of the key features and distinct capabilities. Finally, we provide examples of image augmentations for different computer vision tasks and demonstrate that Albumentations is faster than other commonly used image augmentation tools on most image transform operations.

Brain tumor segmentation using K‐means clustering and deep learning with synthetic data augmentation for classification

Microscopy Research and Technique ◽

10.1002/jemt.23694 ◽

2021 ◽

Author(s):

Amjad Rehman Khan ◽

Siraj Khan ◽

Majid Harouni ◽

Rashid Abbasi ◽

Sajid Iqbal ◽

...

Keyword(s):

Deep Learning ◽

Brain Tumor ◽

Data Augmentation ◽

Synthetic Data ◽

Tumor Segmentation ◽

Brain Tumor Segmentation

Data Augmentation of Automotive LIDAR Point Clouds under Adverse Weather Situations

Sensors ◽

10.3390/s21134503 ◽

2021 ◽

Vol 21 (13) ◽

pp. 4503

Author(s):

Jose Roberto Vargas Rivero ◽

Thiemo Gerbich ◽

Boris Buschardt ◽

Jia Chen

Keyword(s):

Autonomous Vehicles ◽

Data Augmentation ◽

Synthetic Data ◽

Real Data ◽

Weather Conditions ◽

Point Clouds ◽

Training Data ◽

Driver Assistance Systems ◽

Solid Objects ◽

Adverse Weather

In contrast to previous works on data augmentation using LIDAR (Light Detection and Ranging), which mostly consider point clouds under good weather conditions, this paper uses point clouds which are affected by spray. Spray water can be a cause of phantom braking and understanding how to handle the extra detections caused by it is an important step in the development of ADAS (Advanced Driver Assistance Systems)/AV (Autonomous Vehicles) functions. The extra detections caused by spray cannot be safely removed without considering cases in which real solid objects may be present in the same region in which the detections caused by spray take place. As collecting real examples would be extremely difficult, the use of synthetic data is proposed. Real scenes are reconstructed virtually with an added extra object in the spray region, in a way that the detections caused by this obstacle match the characteristics a real object in the same position would have regarding intensity, echo number and occlusion. The detections generated by the obstacle are then used to augment the real data, obtaining, after occlusion effects are added, a good approximation of the desired training data. This data is used to train a classifier achieving an average F-Score of 92. The performance of the classifier is analyzed in detail based on the characteristics of the synthetic object: size, position, reflection, duration. The proposed method can be easily expanded to different kinds of obstacles and classifier types.

A Torch Without Light: Low-Light Imaging for Mobile Phones

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.a9557.109119 ◽

2019 ◽

Vol 9 (1) ◽

pp. 2042-2047

Keyword(s):

Synthetic Data ◽

Real Data ◽

Light Level ◽

Good Representation ◽

Low Light ◽

High Quality ◽

Data Set ◽

Learning Techniques ◽

Low Illumination ◽

Scene Content

Photography used to be a hobby that required equipment such as a professional camera. Today, photography has evolved to be a daily activity conducted on an unprecedented scale due to the adoption of camera into smartphones. Mobile phone cameras are on the way to completely replace other forms of camera due to their portability and quality. Millions of images are captured on mobile devices across the globe. These images are clear and crisp. But all these images are captured in daylight. Images taken in low illumination essentially turn out to be too dark to be comprehensible. Research shows that current solutions to this problem work for dim to moderate light level but fail in extreme low light. There are certain problems involved with these techniques. Firstly, image denoising relies on image priors limiting the situations on what it will work on. Other deep learning techniques work on synthetic data and cannot be proficient on real data. Secondly, Low light image enhancement assumes that images already contain a good representation of scene content. This paper proposes to capture low illumination images and transform them to high quality images using end to end fully convolutional neural network trained on our data set of raw images shot in low aperture and their corresponding high aperture raw images. As an outcome, we will be able to transform images to high quality and identify objects.

Wildfire Smoke Classification Based on Synthetic Images and Pixel- and Feature-Level Domain Adaptation

Sensors ◽

10.3390/s21237785 ◽

2021 ◽

Vol 21 (23) ◽

pp. 7785

Author(s):

Jun Mao ◽

Change Zheng ◽

Jiyan Yin ◽

Ye Tian ◽

Wenbin Cui

Keyword(s):

Deep Learning ◽

Domain Adaptation ◽

Synthetic Data ◽

Real Data ◽

Training Image ◽

Classification Model ◽

Environmental Diversity ◽

Wildfire Smoke ◽

Synthetic Images ◽

Adaptation Method

Training a deep learning-based classification model for early wildfire smoke images requires a large amount of rich data. However, due to the episodic nature of fire events, it is difficult to obtain wildfire smoke image data, and most of the samples in public datasets suffer from a lack of diversity. To address these issues, a method using synthetic images to train a deep learning classification model for real wildfire smoke was proposed in this paper. Firstly, we constructed a synthetic dataset by simulating a large amount of morphologically rich smoke in 3D modeling software and rendering the virtual smoke against many virtual wildland background images with rich environmental diversity. Secondly, to better use the synthetic data to train a wildfire smoke image classifier, we applied both pixel-level domain adaptation and feature-level domain adaptation. The CycleGAN-based pixel-level domain adaptation method for image translation was employed. On top of this, the feature-level domain adaptation method incorporated ADDA with DeepCORAL was adopted to further reduce the domain shift between the synthetic and real data. The proposed method was evaluated and compared on a test set of real wildfire smoke and achieved an accuracy of 97.39%. The method is applicable to wildfire smoke classification tasks based on RGB single-frame images and would also contribute to training image classification models without sufficient data.

Gait Activity Classification on Unbalanced Data from Inertial Sensors Using Shallow and Deep Learning

Sensors ◽

10.3390/s20174756 ◽

2020 ◽

Vol 20 (17) ◽

pp. 4756

Author(s):

Irvin Hussein Lopez-Nava ◽

Luis M. Valentín-Coronado ◽

Matias Garcia-Constantino ◽

Jesus Favela

Keyword(s):

Deep Learning ◽

Activity Recognition ◽

Large Scale ◽

Data Augmentation ◽

Inertial Sensors ◽

Synthetic Data ◽

Classification Performance ◽

Unbalanced Data ◽

Learning Approach ◽

Sampled Data

Activity recognition is one of the most active areas of research in ubiquitous computing. In particular, gait activity recognition is useful to identify various risk factors in people’s health that are directly related to their physical activity. One of the issues in activity recognition, and gait in particular, is that often datasets are unbalanced (i.e., the distribution of classes is not uniform), and due to this disparity, the models tend to categorize into the class with more instances. In the present study, two methods for classifying gait activities using accelerometer and gyroscope data from a large-scale public dataset were evaluated and compared. The gait activities in this dataset are: (i) going down an incline, (ii) going up an incline, (iii) walking on level ground, (iv) going down stairs, and (v) going up stairs. The proposed methods are based on conventional (shallow) and deep learning techniques. In addition, data were evaluated from three data treatments: original unbalanced data, sampled data, and augmented data. The latter was based on the generation of synthetic data according to segmented gait data. The best results were obtained with classifiers built with augmented data, with F-measure results of 0.812 (σ = 0.078) for the shallow learning approach, and of 0.927 (σ = 0.033) for the deep learning approach. In addition, the data augmentation strategy proposed to deal with the unbalanced problem resulted in increased classification performance using both techniques.

Towards Visible and Thermal Drone Monitoring with Convolutional Neural Networks

APSIPA Transactions on Signal and Information Processing ◽

10.1017/atsip.2018.30 ◽

2019 ◽

Vol 8 ◽

Cited By ~ 1

Author(s):

Ye Wang ◽

Yueru Chen ◽

Jongmoo Choi ◽

C.-C. Jay Kuo

Keyword(s):

Deep Learning ◽

Data Augmentation ◽

Tracking System ◽

Synthetic Data ◽

The Public ◽

Detection And Tracking ◽

Individual Module ◽

Augmentation Techniques ◽

Bounding Boxes ◽

Integrated Detection

This paper reports a visible and thermal drone monitoring system that integrates deep-learning-based detection and tracking modules. The biggest challenge in adopting deep learning methods for drone detection is the paucity of training drone images especially thermal drone images. To address this issue, we develop two data augmentation techniques. One is a model-based drone augmentation technique that automatically generates visible drone images with a bounding box label on the drone's location. The other is exploiting an adversarial data augmentation methodology to create thermal drone images. To track a small flying drone, we utilize the residual information between consecutive image frames. Finally, we present an integrated detection and tracking system that outperforms the performance of each individual module containing detection or tracking only. The experiments show that, even being trained on synthetic data, the proposed system performs well on real-world drone images with complex background. The USC drone detection and tracking dataset with user labeled bounding boxes is available to the public.