Scalable Approach to High Coverages on Oxides via Iterative Training of a Machine-Learning Algorithm

Mapping Intimacies ◽

10.26434/chemrxiv.10288514 ◽

2019 ◽

Author(s):

Andrew Medford ◽

Shengchun Yang ◽

Fuzhu Liu

Keyword(s):

Machine Learning ◽

Chemical Potential ◽

Learning Algorithm ◽

Absolute Error ◽

Low Energy ◽

Training Data ◽

High Coverage ◽

Metal Compounds ◽

Adsorption Energies ◽

The Stability

Understanding the interaction of multiple types of adsorbate molecules on solid surfaces is crucial to establishing the stability of catalysts under various chemical environments. Computational studies on the high coverage and mixed coverages of reaction intermediates are still challenging, especially for transition-metal compounds. In this work, we present a framework to predict differential adsorption energies and identify low-energy structures under high- and mixed-adsorbate coverages on oxide materials. The approach uses Gaussian process machine-learning models with quantified uncertainty in conjunction with an iterative training algorithm to actively identify the training set. The framework is demonstrated for the mixed adsorption of CHx, NHx and OHx species on the oxygen vacancy and pristine rutile TiO2(110) surface sites. The results indicate that the proposed algorithm is highly efficient at identifying the most valuable training data, and is able to predict differential adsorption energies with a mean absolute error of ~0.3 eV based on <25% of the total DFT data. The algorithm is also used to identify 76% of the low-energy structures based on <30% of the total DFT data, enabling construction of surface phase diagrams that account for high and mixed coverage as a function of the chemical potential of C, H, O, and N. Furthermore, the computational scaling indicates the algorithm scales nearly linearly (N1.12) as the number of adsorbates increases. This framework can be directly extended to metals, metal oxides, and other materials, providing a practical route toward the investigation of the behavior of catalysts under high-coverage conditions.

Download Full-text

Scalable Approach to High Coverages on Oxides via Iterative Training of a Machine-Learning Algorithm

10.26434/chemrxiv.10288514.v1 ◽

2019 ◽

Author(s):

Andrew Medford ◽

Shengchun Yang ◽

Fuzhu Liu

Keyword(s):

Machine Learning ◽

Chemical Potential ◽

Learning Algorithm ◽

Absolute Error ◽

Low Energy ◽

Training Data ◽

High Coverage ◽

Metal Compounds ◽

Adsorption Energies ◽

The Stability

Download Full-text

Water Quality Prediction Using Statistical Tool and Machine Learning Algorithm

Waste Management ◽

10.4018/978-1-7998-1210-4.ch029 ◽

2020 ◽

pp. 609-623

Author(s):

Arun Kumar Beerala ◽

Gobinath R. ◽

Shyamala G. ◽

Siribommala Manvitha

Keyword(s):

Machine Learning ◽

Learning Algorithm ◽

Training Data ◽

Machine Learning Techniques ◽

Statistical Tool ◽

Data Set ◽

Water Quality Prediction ◽

Living Things ◽

Sampling Locations ◽

Different Seasons

Water is the most valuable natural resource for all living things and the ecosystem. The quality of groundwater is changed due to change in ecosystem, industrialisation, and urbanisation, etc. In the study, 60 samples were taken and analysed for various physio-chemical parameters. The sampling locations were located using global positioning system (GPS) and were taken for two consecutive years for two different seasons, monsoon (Nov-Dec) and post-monsoon (Jan-Mar). In 2016-2017 and 2017-2018 pH, EC, and TDS were obtained in the field. Hardness and Chloride are determined using titration method. Nitrate and Sulphate were determined using Spectrophotometer. Machine learning techniques were used to train the data set and to predict the unknown values. The dominant elements of groundwater are as follows: Ca2, Mg2 for cation and Cl-, SO42, NO3− for anions. The regression value for the training data set was found to be 0.90596, and for the entire network, it was found to be 0.81729. The best performance was observed as 0.0022605 at epoch 223.

Download Full-text

Design-Oriented Multifidelity Fluid Simulation Using Machine Learned Fidelity Mapping

ASME 2019 Conference on Smart Materials, Adaptive Structures and Intelligent Systems ◽

10.1115/smasis2019-5515 ◽

2019 ◽

Cited By ~ 1

Author(s):

Kazuko Fuchi ◽

Eric M. Wolf ◽

David S. Makhija ◽

Nathan A. Wukie ◽

Christopher R. Schrock ◽

...

Keyword(s):

Machine Learning ◽

Learning Algorithm ◽

Fluid Simulation ◽

Machine Learning Algorithms ◽

Training Data ◽

Supervised Machine Learning ◽

High Fidelity ◽

Computational Domain ◽

Symmetry Properties ◽

High Fidelity Simulations

Abstract A machine learning algorithm that performs multifidelity domain decomposition is introduced. While the design of complex systems can be facilitated by numerical simulations, the determination of appropriate physics couplings and levels of model fidelity can be challenging. The proposed method automatically divides the computational domain into subregions and assigns required fidelity level, using a small number of high fidelity simulations to generate training data and low fidelity solutions as input data. Unsupervised and supervised machine learning algorithms are used to correlate features from low fidelity solutions to fidelity assignment. The effectiveness of the method is demonstrated in a problem of viscous fluid flow around a cylinder at Re ≈ 20. Ling et al. built physics-informed invariance and symmetry properties into machine learning models and demonstrated improved model generalizability. Along these lines, we avoid using problem dependent features such as coordinates of sample points, object geometry or flow conditions as explicit inputs to the machine learning model. Use of pointwise flow features generates large data sets from only one or two high fidelity simulations, and the fidelity predictor model achieved 99.5% accuracy at training points. The trained model was shown to be capable of predicting a fidelity map for a problem with an altered cylinder radius. A significant improvement in the prediction performance was seen when inputs are expanded to include multiscale features that incorporate neighborhood information.

Download Full-text

Extraction of Sea Ice Cover by Sentinel-1 SAR Based on SVM with Unsupervised Generation of Training Data

10.20944/preprints202005.0336.v1 ◽

2020 ◽

Author(s):

Xiaoming Li ◽

Yan Sun ◽

Qiang Zhang

Keyword(s):

Machine Learning ◽

Sea Ice ◽

Learning Algorithm ◽

Texture Features ◽

Open Water ◽

Ice Cover ◽

Training Data ◽

Support Vector ◽

Training Samples

In this paper, we focus on developing a novel method to extract sea ice cover (i.e., discrimination/classification of sea ice and open water) using Sentinel-1 (S1) cross-polarization (vertical-horizontal, VH or horizontal-vertical, HV) data in extra wide (EW) swath mode based on the machine learning algorithm support vector machine (SVM). The classification basis includes the S1 radar backscatter coefficients and texture features that are calculated from S1 data using the gray level co-occurrence matrix (GLCM). Different from previous methods where appropriate samples are manually selected to train the SVM to classify sea ice and open water, we proposed a method of unsupervised generation of the training samples based on two GLCM texture features, i.e. entropy and homogeneity, that have contrasting characteristics on sea ice and open water. We eliminate the most uncertainty of selecting training samples in machine learning and achieve automatic classification of sea ice and open water by using S1 EW data. The comparison shows good agreement between the SAR-derived sea ice cover using the proposed method and a visual inspection, of which the accuracy reaches approximately 90% - 95% based on a few cases. Besides this, compared with the analyzed sea ice cover data Ice Mapping System (IMS) based on 728 S1 EW images, the accuracy of extracted sea ice cover by using S1 data is more than 80%.

Download Full-text

Development and Validation of Machine-Learning Clear-Sky Detection Method Using 1-Min Irradiance Data and Sky Imagers at a Polluted Suburban Site, Xianghe

Remote Sensing ◽

10.3390/rs13183763 ◽

2021 ◽

Vol 13 (18) ◽

pp. 3763

Author(s):

Mengqi Liu ◽

Xiangao Xia ◽

Disong Fu ◽

Jinqiang Zhang

Keyword(s):

Machine Learning ◽

Learning Algorithm ◽

Qualitative Evaluation ◽

Absolute Error ◽

High Temporal Resolution ◽

Accuracy Score ◽

Energy Applications ◽

Clear Sky ◽

Polluted Sites ◽

Suburban Site

Clear-sky detection (CSD) is of critical importance in solar energy applications and surface radiative budget studies. Existing CSD methods are not sufficiently validated due to the lack of high-temporal resolution and long-term CSD ground observations, especially at polluted sites. Using five-year high resolution ground-based solar radiation data and visual inspected Total Sky Imager (TSI) measurements at polluted Xianghe, a suburban site, this study validated 17 existing CSD methods and developed a new CSD model based on a machine-learning algorithm (Random Forest: RF). The propagation of systematic errors from input data to the calculated global horizontal irradiance (GHI) is confirmed with Mean Absolute Error (MAE) increased by 99.7% (from 20.00 to 39.93 W·m−2). Through qualitative evaluation, the novel Bright-Sun method outperforms the other traditional CSD methods at Xianghe site, with high accuracy score 0.73 and 0.92 under clear and cloudy conditions, respectively. The RF CSD model developed by one-year irradiance and TSI data shows more robust performance, with clear/cloudy-sky accuracy score of 0.78/0.88. Overall, the Bright-Sun and RF CSD models perform satisfactorily at heavy polluted sites. Further analysis shows the RF CSD model built with only GHI-related parameters can still achieve a mean accuracy score of 0.81, which indicates RF CSD models have the potential in dealing with sites only providing GHI observations.

Download Full-text

Looking Beyond Adsorption Energies to Understand Interactions at Surface Using Machine Learning

10.26434/chemrxiv.14726184 ◽

2021 ◽

Author(s):

Sheena Agarwal ◽

Kavita Joshi

Keyword(s):

Machine Learning ◽

Small Molecules ◽

Density Functional ◽

Mean Absolute Error ◽

Absolute Error ◽

Face Centered Cubic ◽

Functional Theory ◽

Full Picture ◽

Adsorption Energies ◽

Face Centered

Abstract Identifying factors that influence interactions at the surface is still an active area of research. In this study, we present the importance of analyzing bondlength activation, while interpreting Density Functional Theory (DFT) results, as yet another crucial indicator for catalytic activity. We studied the adsorption of small molecules, such as O 2 , N 2 , CO, and CO 2 , on seven face-centered cubic (fcc) transition metal surfaces (M = Ag, Au, Cu, Ir, Rh, Pt, and Pd) and their commonly studied facets (100, 110, and 111). Through our DFT investigations, we highlight the absence of linear correlation between adsorption energies (E ads ) and bondlength activation (BL act ). Our study indicates the importance of evaluating both to develop a better understanding of adsorption at surfaces. We also developed a Machine Learning (ML) model trained on simple periodic table properties to predict both, E ads and BL act . Our ML model gives an accuracy of Mean Absolute Error (MAE) ∼ 0.2 eV for E ads predictions and 0.02 Å for BL act predictions. The systematic study of the ML features that affect E ads and BL act further reinforces the importance of looking beyond adsorption energies to get a full picture of surface interactions with DFT.

Download Full-text

Adaptive Context-Aware Energy Optimization for Services on Mobile Devices with Use of Machine Learning

Wireless Personal Communications ◽

10.1007/s11277-020-07657-9 ◽

2020 ◽

Vol 115 (3) ◽

pp. 1839-1867

Author(s):

Piotr Nawrocki ◽

Bartlomiej Sniezynski

Keyword(s):

Machine Learning ◽

Supervised Learning ◽

Mobile Devices ◽

Mobile Device ◽

Learning Algorithm ◽

Service Selection ◽

Mobile Systems ◽

Training Data ◽

Learning Mechanisms ◽

Network Connection

AbstractIn this paper we present an original adaptive task scheduling system, which optimizes the energy consumption of mobile devices using machine learning mechanisms and context information. The system learns how to allocate resources appropriately: how to schedule services/tasks optimally between the device and the cloud, which is especially important in mobile systems. Decisions are made taking the context into account (e.g. network connection type, location, potential time and cost of executing the application or service). In this study, a supervised learning agent architecture and service selection algorithm are proposed to solve this problem. Adaptation is performed online, on a mobile device. Information about the context, task description, the decision made and its results such as power consumption are stored and constitute training data for a supervised learning algorithm, which updates the knowledge used to determine the optimal location for the execution of a given type of task. To verify the solution proposed, appropriate software has been developed and a series of experiments have been conducted. Results show that as a result of the experience gathered and the learning process performed, the decision module has become more efficient in assigning the task to either the mobile device or cloud resources.

Download Full-text

Modeling and Optimizing a Chiller System Using a Machine Learning Algorithm

Energies ◽

10.3390/en12152860 ◽

2019 ◽

Vol 12 (15) ◽

pp. 2860 ◽

Cited By ~ 8

Author(s):

Jee-Heon Kim ◽

Nam-Chul Seong ◽

Wonchang Choi

Keyword(s):

Machine Learning ◽

Energy Consumption ◽

Prediction Accuracy ◽

Learning Algorithm ◽

Training Data ◽

Machine Learning Algorithm ◽

Air Conditioning System ◽

Energy Consumption Model ◽

Consumption Model ◽

Input Variables

This study was conducted to develop an energy consumption model of a chiller in a heating, ventilation, and air conditioning system using a machine learning algorithm based on artificial neural networks. The proposed chiller energy consumption model was evaluated for accuracy in terms of input layers that include the number of input variables, amount (proportion) of training data, and number of neurons. A standardized reference building was also modeled to generate operational data for the chiller system during extended cooling periods (warm weather months). The prediction accuracy of the chiller’s energy consumption was improved by increasing the number of input variables and adjusting the proportion of training data. By contrast, the effect of the number of neurons on the prediction accuracy was insignificant. The developed chiller model was able to predict energy consumption with 99.07% accuracy based on eight input variables, 60% training data, and 12 neurons.

Download Full-text

Secured Energy-Efficient Routing in Wireless Sensor Networks Using Machine Learning Algorithm

Deep Learning Strategies for Security Enhancement in Wireless Sensor Networks - Advances in Information Security, Privacy, and Ethics ◽

10.4018/978-1-7998-5068-7.ch002 ◽

2020 ◽

pp. 23-41

Author(s):

Ahona Ghosh ◽

Chiung Ching Ho ◽

Robert Bestak

Keyword(s):

Machine Learning ◽

Wireless Sensor Networks ◽

Sensor Networks ◽

Energy Efficient ◽

Learning Algorithm ◽

Low Energy ◽

Sensor Nodes ◽

Wireless Sensor ◽

Energy Efficient Routing

Wireless sensor networks consist of unattended small sensor nodes having low energy and low range of communication. It has been observed that if there is any system to periodically start and stop the sensors sensing activities, then it saves some energy, and thus, the network lifetime gets extended. According to the current literature, security and energy efficiency are the two main concerns to improve the quality of service during transmission of data in wireless sensor networks. Machine learning has proved its efficiency in developing efficient processes to handle complex problems in various network aspects. Routing in wireless sensor network is the process of finding the route for transmitting data among different sensor nodes according to the requirement. Machine learning has been used in a broad way for designing energy efficient routing protocols, and this chapter reviews the existing works in the said domain, which can be the guide to someone who wants to explore the area further.

Download Full-text

A Machine Learning Algorithm to Predict Hyperglycemic Cases Induced by PD-1/PD-L1 Inhibitors in the Real World (Preprint)

10.2196/preprints.27754 ◽

2021 ◽

Author(s):

Jincheng Yang

Keyword(s):

Machine Learning ◽

Adverse Reaction ◽

Prediction Model ◽

Clinical Decision Making ◽

Learning Algorithm ◽

Adverse Event Reporting System ◽

Training Data ◽

Support Vector ◽

Machine Learning Algorithm ◽

Total Drug

BACKGROUND Diabetes mellitus and cancer are amongst the leading causes of deaths worldwide; hyperglycemia plays a major contributory role in neoplastic transformation risk. Support Vector Machine (SVM) is a type of supervised learning method which analyzes data and recognizes patterns, mainly used for statistical classification and regression. OBJECTIVE From reported adverse events of PD-1 or PD-L1 (programmed death 1 or ligand 1) inhibitors in post-marketing monitoring, we aimed to construct an effective machine learning algorithm to predict the probability of hyperglycemic adverse reaction from PD-1/PD-L1 inhibitors treated patients efficiently and rapidly. METHODS Raw data was downloaded from US Food and Drug Administration Adverse Event Reporting System (FDA FAERS). Signal of relationship between drug and adverse reaction based on disproportionality analysis and Bayesian analysis. A multivariate pattern classification of SVM was used to construct classifier to separate adverse hyperglycemic reaction patients. A 10-fold-3-time cross validation for model setup within training data (80% data) output best parameter values in SVM within R software. The model was validated in each testing data (20% data) and two total drug data, with exactly predictor parameter variables: gamma and nu. RESULTS Total 95918 case files were downloaded from 7 relevant drugs (cemiplimab, avelumab, durvalumab, atezolizumab, pembrolizumab, ipilimumab, nivolumab). The number-type/number-optimization method was selected to optimize model. Both gamma and nu values correlated with case number showed high adjusted r2 in curve regressions (both r2 >0.95). Indexes of accuracy, F1 score, kappa and sensitivity were greatly improved from the prediction model in training data and two total drug data. CONCLUSIONS The SVM prediction model established here can non-invasively and precisely predict occurrence of hyperglycemic adverse drug reaction (ADR) in PD-1/PD-L1 inhibitors treated patients. Such information is vital to overcome ADR and to improve outcomes by distinguish high hyperglycemia-risk patients, and this machine learning algorithm can eventually add value onto clinical decision making. CLINICALTRIAL N/A

Download Full-text