A Bayesian Framework for Adsorption Energy Prediction on Bimetallic Alloy Catalysts

For high-throughput screening of materials for heterogeneous catalysis, scaling relations provides an efficient scheme to estimate the chemisorption energies of hydrogenated species. However, conditioning on a single descriptor ignores the model uncertainty and leads to sub optimal prediction of the chemisorption energy. In this paper, we extend the single descriptor linear scaling relation to a multi descriptor linear regression models to leverage the correlation between adsorption energy of any two pair of adsorbates. With a large dataset, we use Bayesian Information Criteria (BIC) as the model evidence to select the best linear regression model that are derived from non-informative priors. Furthermore, Gaussian Process Regression (GPR) based on the meaningful convolution of physical properties of the metal-adsorbate complex can be used to predict the baseline residual of the selected model. This integrated Bayesian model selection and Gaussian process regression, dubbed as residual learning, can achieve performance comparable to standard DFT error (0.1 eV) for most adsorbate system. For sparse and small datasets, we propose an ad hoc Bayesian Model Averaging (BMA) approach to make a robust prediction. With this Bayesian framework, we significantly reduce the model uncertainty and improve the prediction accuracy. The possibilities of the framework for high-throughput catalytic materials exploration in a realistic setting is illustrated using large and small sets of both dense and sparse simulated dataset generated from a public database of bimetallic alloys available in Catalysis-Hub.org.

Download Full-text

A Bayesian framework for adsorption energy prediction on bimetallic alloy catalysts

npj Computational Materials ◽

10.1038/s41524-020-00447-8 ◽

2020 ◽

Vol 6 (1) ◽

Author(s):

Osman Mamun ◽

Kirsten T. Winther ◽

Jacob R. Boes ◽

Thomas Bligaard

Keyword(s):

Linear Regression ◽

Gaussian Process ◽

High Throughput ◽

Adsorption Energy ◽

Model Uncertainty ◽

Bayesian Model ◽

Bayesian Model Averaging ◽

Gaussian Process Regression ◽

Bayesian Framework ◽

Information Criteria

AbstractFor high-throughput screening of materials for heterogeneous catalysis, scaling relations provides an efficient scheme to estimate the chemisorption energies of hydrogenated species. However, conditioning on a single descriptor ignores the model uncertainty and leads to suboptimal prediction of the chemisorption energy. In this article, we extend the single descriptor linear scaling relation to a multi-descriptor linear regression models to leverage the correlation between adsorption energy of any two pair of adsorbates. With a large dataset, we use Bayesian Information Criteria (BIC) as the model evidence to select the best linear regression model. Furthermore, Gaussian Process Regression (GPR) based on the meaningful convolution of physical properties of the metal-adsorbate complex can be used to predict the baseline residual of the selected model. This integrated Bayesian model selection and Gaussian process regression, dubbed as residual learning, can achieve performance comparable to standard DFT error (0.1 eV) for most adsorbate system. For sparse and small datasets, we propose an ad hoc Bayesian Model Averaging (BMA) approach to make a robust prediction. With this Bayesian framework, we significantly reduce the model uncertainty and improve the prediction accuracy. The possibilities of the framework for high-throughput catalytic materials exploration in a realistic setting is illustrated using large and small sets of both dense and sparse simulated dataset generated from a public database of bimetallic alloys available in Catalysis-Hub.org.

Download Full-text

Multi-kernel Gaussian process regression and Bayesian model averaging based nonlinear state estimation and quality prediction of multiphase batch processes

2013 American Control Conference ◽

10.1109/acc.2013.6580690 ◽

2013 ◽

Author(s):

Jie Yu ◽

Kuilin Chen ◽

Junichi Mori ◽

Mudassir M. Rashid

Keyword(s):

Gaussian Process ◽

State Estimation ◽

Bayesian Model ◽

Bayesian Model Averaging ◽

Model Averaging ◽

Gaussian Process Regression ◽

Batch Processes ◽

Quality Prediction ◽

Nonlinear State Estimation ◽

Nonlinear State

Download Full-text

Gaussian Process Regression and Bayesian Model Averaging: An Alternative Approach to Modeling Spatial Phenomena

Geographical Analysis ◽

10.1111/gean.12083 ◽

2015 ◽

Vol 48 (1) ◽

pp. 82-111 ◽

Cited By ~ 8

Author(s):

Jacob Dearmon ◽

Tony E. Smith

Keyword(s):

Gaussian Process ◽

Bayesian Model ◽

Bayesian Model Averaging ◽

Model Averaging ◽

Gaussian Process Regression ◽

Alternative Approach

Download Full-text

Advancing Style Analysis and Risk Modeling by Incorporating Model Uncertainty with Bayesian Model Averaging

SSRN Electronic Journal ◽

10.2139/ssrn.2346295 ◽

2013 ◽

Author(s):

Hao (David) Zhou

Keyword(s):

Model Uncertainty ◽

Bayesian Model ◽

Bayesian Model Averaging ◽

Model Averaging ◽

Style Analysis ◽

Risk Modeling

Download Full-text

BMA‐Mod: A Bayesian model averaging strategy for determining dose‐response relationships in the presence of model uncertainty

Biometrical Journal ◽

10.1002/bimj.201700211 ◽

2018 ◽

Vol 61 (5) ◽

pp. 1141-1159 ◽

Cited By ~ 2

Author(s):

A. Lawrence Gould

Keyword(s):

Dose Response ◽

Model Uncertainty ◽

Bayesian Model ◽

Bayesian Model Averaging ◽

Model Averaging

Download Full-text

Bayesian Model Averaging to Account for Model Uncertainty in Estimates of a Vaccine's Effectiveness

10.1101/2021.05.12.21257126 ◽

2021 ◽

Author(s):

Carlos R Oliveira ◽

Eugene D Shapiro ◽

Daniel M Weinberger

Keyword(s):

Model Selection ◽

Model Uncertainty ◽

Bayesian Model ◽

Bayesian Model Averaging ◽

Model Averaging ◽

Selection Methods ◽

Final Model ◽

Negative Case ◽

Confounder Selection ◽

Control Study

Vaccine effectiveness (VE) studies are often conducted after the introduction of new vaccines to ensure they provide protection in real-world settings. Although susceptible to confounding, the test-negative case-control study design is the most efficient method to assess VE post-licensure. Control of confounding is often needed during the analyses, which is most efficiently done through multivariable modeling. When a large number of potential confounders are being considered, it can be challenging to know which variables need to be included in the final model. This paper highlights the importance of considering model uncertainty by re-analyzing a Lyme VE study using several confounder selection methods. We propose an intuitive Bayesian Model Averaging (BMA) framework for this task and compare the performance of BMA to that of traditional single-best-model-selection methods. We demonstrate how BMA can be advantageous in situations when there is uncertainty about model selection by systematically considering alternative models and increasing transparency.

Download Full-text

Local Marginal Analysis of Spatial Data: A Gaussian Process Regression Approach with Bayesian Model and Kernel Averaging

Spatial Econometrics: Qualitative and Limited Dependent Variables - Advances in Econometrics ◽

10.1108/s0731-905320160000037018 ◽

2016 ◽

pp. 297-342

Author(s):

Jacob Dearmon ◽

Tony E. Smith

Keyword(s):

Gaussian Process ◽

Spatial Data ◽

Bayesian Model ◽

Gaussian Process Regression ◽

Marginal Analysis ◽

Regression Approach

Download Full-text

Inferring cellular regulatory networks with Bayesian model averaging for linear regression (BMALR)

Molecular BioSystems ◽

10.1039/c4mb00053f ◽

2014 ◽

Vol 10 (8) ◽

pp. 2023-2030 ◽

Cited By ~ 6

Author(s):

Xun Huang ◽

Zhike Zi

Keyword(s):

Linear Regression ◽

Molecular Interactions ◽

Computational Efficiency ◽

Bayesian Model ◽

Prediction Accuracy ◽

Regulatory Networks ◽

Bayesian Model Averaging ◽

Model Averaging ◽

High Prediction ◽

High Computational Efficiency

A new method that uses Bayesian model averaging for linear regression to infer molecular interactions in biological systems with high prediction accuracy and high computational efficiency.

Download Full-text

Using Bayesian model averaging to improve ground motion predictions

Geophysical Journal International ◽

10.1093/gji/ggz486 ◽

2019 ◽

Vol 220 (2) ◽

pp. 1368-1378

Author(s):

M Bertin ◽

S Marin ◽

C Millet ◽

C Berge-Thierry

Keyword(s):

Ground Motion ◽

Model Uncertainty ◽

Bayesian Model ◽

Bayesian Model Averaging ◽

Selection Process ◽

Hazard Analysis ◽

Model Averaging ◽

Likelihood Estimation ◽

Predictive Performance ◽

Out Of Sample

SUMMARY In low-seismicity areas such as Europe, seismic records do not cover the whole range of variable configurations required for seismic hazard analysis. Usually, a set of empirical models established in such context (the Mediterranean Basin, northeast U.S.A., Japan, etc.) is considered through a logic-tree-based selection process. This approach is mainly based on the scientist’s expertise and ignores the uncertainty in model selection. One important and potential consequence of neglecting model uncertainty is that we assign more precision to our inference than what is warranted by the data, and this leads to overly confident decisions and precision. In this paper, we investigate the Bayesian model averaging (BMA) approach, using nine ground-motion prediction equations (GMPEs) issued from several databases. The BMA method has become an important tool to deal with model uncertainty, especially in empirical settings with large number of potential models and relatively limited number of observations. Two numerical techniques, based on the Markov chain Monte Carlo method and the maximum likelihood estimation approach, for implementing BMA are presented and applied together with around 1000 records issued from the RESORCE-2013 database. In the example considered, it is shown that BMA provides both a hierarchy of GMPEs and an improved out-of-sample predictive performance.

Download Full-text