Using Conditional Generative Adversarial Networks to Boost the Performance of Machine Learning in Microbiome Datasets

ORGANIC (1).pdf

10.26434/chemrxiv.5309668.v1 ◽

2017 ◽

Author(s):

Benjamin Sanchez-Lengeling ◽

Carlos Outeiral ◽

Gabriel L. Guimaraes ◽

Alan Aspuru-Guzik

Keyword(s):

Machine Learning ◽

Learning Community ◽

Chemical Species ◽

Material Design ◽

Organic Photovoltaic ◽

Generative Adversarial Networks ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Adversarial Networks ◽

Photovoltaic Material

Molecular discovery seeks to generate chemical species tailored to very specific needs. In this paper, we present ORGANIC, a framework based on Objective-Reinforced Generative Adversarial Networks (ORGAN), capable of producing a distribution over molecular space that matches with a certain set of desirable metrics. This methodology combines two successful techniques from the machine learning community: a Generative Adversarial Network (GAN), to create non-repetitive sensible molecular species, and Reinforcement Learning (RL), to bias this generative distribution towards certain attributes. We explore several applications, from optimization of random physicochemical properties to candidates for drug discovery and organic photovoltaic material design.

Download Full-text

Machine Learning Data Center Workloads Using Generative Adversarial Networks

ACM SIGMETRICS Performance Evaluation Review ◽

10.1145/3439602.3439611 ◽

2020 ◽

Vol 48 (2) ◽

pp. 21-23

Author(s):

Boudewijn R. Haverkort ◽

Felix Finkbeiner ◽

Pieter-Tjerk de Boer

Keyword(s):

Machine Learning ◽

Data Center ◽

Generative Adversarial Networks ◽

Adversarial Networks ◽

Learning Data

Download Full-text

Handling Imbalanced Data in Intrusion Detection Systems using Generative Adversarial Networks

Research and Development on Information and Communication Technology ◽

10.32913/mic-ict-research.v2020.n1.894 ◽

2020 ◽

Vol 2020 (1) ◽

pp. 1-13

Author(s):

Ly Vu ◽

Quang Uy Nguyen

Keyword(s):

Machine Learning ◽

Intrusion Detection ◽

Detection System ◽

Imbalanced Data ◽

Original Data ◽

Machine Learning Techniques ◽

Generative Adversarial Networks ◽

Detection Systems ◽

Adversarial Networks ◽

Attack Data

Machine learning-based intrusion detection hasbecome more popular in the research community thanks to itscapability in discovering unknown attacks. To develop a gooddetection model for an intrusion detection system (IDS) usingmachine learning, a great number of attack and normal datasamples are required in the learning process. While normaldata can be relatively easy to collect, attack data is muchrarer and harder to gather. Subsequently, IDS datasets areoften dominated by normal data and machine learning modelstrained on those imbalanced datasets are ineffective in detect-ing attacks. In this paper, we propose a novel solution to thisproblem by using generative adversarial networks to generatesynthesized attack data for IDS. The synthesized attacks aremerged with the original data to form the augmented dataset.Three popular machine learning techniques are trained on theaugmented dataset. The experiments conducted on the threecommon IDS datasets and one our own dataset show thatmachine learning algorithms achieve better performance whentrained on the augmented dataset of the generative adversarialnetworks compared to those trained on the original datasetand other sampling techniques. The visualization techniquewas also used to analyze the properties of the synthesizeddata of the generative adversarial networks and the others.

Download Full-text

Counterfactual Examples for Data Augmentation: A Case Study

The International FLAIRS Conference Proceedings ◽

10.32473/flairs.v34i1.128503 ◽

2021 ◽

Vol 34 (1) ◽

Author(s):

Md Golam Moula Mehedi Hasan ◽

Douglas A. Talbert

Keyword(s):

Machine Learning ◽

Potential Application ◽

Data Augmentation ◽

Generative Adversarial Networks ◽

Application Area ◽

Learning Models ◽

Adversarial Networks ◽

Feature Values ◽

Machine Learning Models

Counterfactual explanations are gaining in popularity as a way of explaining machine learning models. Counterfactual examples are generally created to help interpret the decision of a model. In this case, if a model makes a certain decision for an instance, the counterfactual examples of that instance reverse the decision of the model. The counterfactual examples can be created by craftily changing particular feature values of the instance. Though counterfactual examples are generated to explain the decision of machine learning models, in this work, we explore another potential application area of counterfactual examples, whether counterfactual examples are useful for data augmentation. We demonstrate the efficacy of this approach on the widely used “Adult-Income” dataset. We consider several scenarios where we do not have enough data and use counterfactual examples to augment the dataset. We compare our approach with Generative Adversarial Networks approach for dataset augmentation. The experimental results show that our proposed approach can be an effective way to augment a dataset.

Download Full-text

Proposed Improvements for Automated Chemical Safety Evaluations Using In-Silico Techniques

10.20944/preprints202005.0408.v1 ◽

2020 ◽

Author(s):

Bryan Jordan

Keyword(s):

Machine Learning ◽

Drug Discovery ◽

Chemical Space ◽

Machine Learning Techniques ◽

Training Dataset ◽

Generative Adversarial Networks ◽

Chemical Safety ◽

Adversarial Networks ◽

Wide Range ◽

Traditional Drug

The vastness of chemical-space constrains traditional drug-discovery methods to the organic laws that are guiding the chemistry involved in filtering through candidates. Leveraging computing with machine-learning to intelligently generate compounds that meet a wide range of objectives can bring significant gains in time and effort needed to filter through a broad range of candidates. This paper details how the use of Generative-Adversarial-Networks, novel machine learning techniques to format the training dataset and the use of quantum computing offer new ways to expedite drug-discovery.

Download Full-text

Deep Convolutional Generative Modeling for Artificial Microstructure Development of Aluminum-Silicon Alloy

10.35940/ijdm.a1603.051121 ◽

2021 ◽

Vol 1 (1) ◽

pp. 1-6

Author(s):

Akshansh Mishra ◽

Tarushi Pathak

Keyword(s):

Machine Learning ◽

Generative Adversarial Networks ◽

Microstructure Development ◽

Machine Learning Technique ◽

Silicon Alloy ◽

Aluminum Silicon Alloy ◽

Adversarial Networks ◽

Generative Modeling ◽

Learning Technique ◽

The Given

Machine learning which is a sub-domain of an Artificial Intelligence which is finding various applications in manufacturing and material science sectors. In the present study, Deep Generative Modeling which a type of unsupervised machine learning technique has been adapted for the constructing the artificial microstructure of Aluminium-Silicon alloy. Deep Generative Adversarial Networks has been used for developing the artificial microstructure of the given microstructure image dataset. The results obtained showed that the developed models had learnt to replicate the lining near the certain images of the microstructures.

Download Full-text

Generating Energy Data for Machine Learning with Recurrent Generative Adversarial Networks

Energies ◽

10.3390/en13010130 ◽

2019 ◽

Vol 13 (1) ◽

pp. 130 ◽

Cited By ~ 4

Author(s):

Mohammad Navid Fekri ◽

Ananda Mohon Ghosh ◽

Katarina Grolinger

Keyword(s):

Machine Learning ◽

Neural Networks ◽

Energy Consumption ◽

Smart Grid ◽

Time Series Data ◽

Security And Privacy ◽

Series Data ◽

Generative Adversarial Networks ◽

Generative Adversarial Network ◽

Adversarial Networks

The smart grid employs computing and communication technologies to embed intelligence into the power grid and, consequently, make the grid more efficient. Machine learning (ML) has been applied for tasks that are important for smart grid operation including energy consumption and generation forecasting, anomaly detection, and state estimation. These ML solutions commonly require sufficient historical data; however, this data is often not readily available because of reasons such as data collection costs and concerns regarding security and privacy. This paper introduces a recurrent generative adversarial network (R-GAN) for generating realistic energy consumption data by learning from real data. Generativea adversarial networks (GANs) have been mostly used for image tasks (e.g., image generation, super-resolution), but here they are used with time series data. Convolutional neural networks (CNNs) from image GANs are replaced with recurrent neural networks (RNNs) because of RNN’s ability to capture temporal dependencies. To improve training stability and increase quality of generated data, Wasserstein GANs (WGANs) and Metropolis-Hastings GAN (MH-GAN) approaches were applied. The accuracy is further improved by adding features created with ARIMA and Fourier transform. Experiments demonstrate that data generated by R-GAN can be used for training energy forecasting models.

Download Full-text

Usage of generative adversarial networks for edge prediction in ontology graphs

Informacionno-technologicheskij vestnik ◽

10.21499/2409-1650-2020-26-4-96-103 ◽

2020 ◽

pp. 96-103

Author(s):

О.П. Мосалов

Keyword(s):

Machine Learning ◽

Learning Model ◽

Computational Experiments ◽

Generative Adversarial Networks ◽

Adversarial Networks ◽

Machine Learning Model ◽

Parameter Values

Рассматривается модель машинного обучения для предсказания существования рёбер в графе онтологии, основанная на использовании генеративно-состязательной сети. Проведены вычислительные эксперименты для различных наборов значений гиперпараметров модели. Показано, что модель решает поставленную задачу. Сформулированы направления дальнейшего развития данного подхода. A machine learning model for edge existence prediction in an ontology graph, based on generative adversarial networks, is considered. Computational experiments for different sets of hyper parameter values are fulfilled. It is shown that the model solves the task. Further steps on this approach research are formulated.

Download Full-text

Machine learning-based microstructure prediction during laser sintering of alumina

Scientific Reports ◽

10.1038/s41598-021-89816-x ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Jianan Tang ◽

Xiao Geng ◽

Dongsheng Li ◽

Yunfeng Shi ◽

Jianhua Tong ◽

...

Keyword(s):

Machine Learning ◽

Materials Science ◽

Learning Algorithm ◽

Secondary Phase ◽

Generative Adversarial Networks ◽

Advanced Manufacturing ◽

Sem Micrographs ◽

Adversarial Networks ◽

Microstructure Prediction ◽

New Processing

AbstractPredicting material’s microstructure under new processing conditions is essential in advanced manufacturing and materials science. This is because the material’s microstructure hugely influences the material’s properties. We demonstrate an elegant machine learning algorithm that faithfully predicts the microstructure under new conditions, without the need of knowing the governing laws. We name this algorithm, RCWGAN-GP, which is regression-based conditional generative adversarial networks with Wasserstein loss function and gradient penalty. This algorithm was trained with experimental SEM micrographs from laser-sintered alumina under various laser powers. The RCWGAN-GP realistically regenerates the SEM micrographs under the trained laser powers. Impressively, it also faithfully predicts the alumina’s microstructure under unexplored laser powers. The predicted microstructure features, including the morphology of the sintered particles and the pores, match the experimental SEM micrographs very well. We further quantitatively examined the prediction accuracy of the RCWGAN-GP. We trained the algorithm with computer-created micrograph datasets of secondary-phase growth governed by the well-known Johnson–Mehl–Avrami (JMA) equation. The RCWGAN-GP accurately regenerates the micrographs at the trained time series, in terms of the grains’ shapes, sizes, and spatial distributions. More importantly, the predicted secondary phase fraction accurately follows the JMA curve.

Download Full-text

Generating Rainfall Data using GANs

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.b4958.129219 ◽

2019 ◽

Vol 9 (2) ◽

pp. 4124-4127

Keyword(s):

Machine Learning ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Neural Nets ◽

Rainfall Data ◽

Generative Adversarial Networks ◽

Rainfall Prediction ◽

Adversarial Networks

Rainfall prediction is one of the major discussions in the meteorology because it is a major factor on which many things in the environment rely on. Neural Nets or any other machine learning algorithms need very large amount of data in order to achieve better accuracy but sometimes data can be scarce, this type of problems can be resolved by using Generative Adversarial Networks. Generative Adversarial Networks which are known for generating data by using the existing features from the old data, like generating images etc. There are many types of datasets which are scarce, rainfall data in one among them. So, the proposed system generates the rainfall data using GAN. The generated data is used for training the classifier, which predicts the rainfall.

Download Full-text