Recursive Gaussian Process Regression Model for Adaptive Quality Monitoring in Batch Processes

SAMPL6 Challenge Results from pKa Predictions Based on a General Gaussian Process Model

10.26434/chemrxiv.6406505.v2 ◽

2018 ◽

Author(s):

Caitlin C. Bannan ◽

David Mobley ◽

A. Geoff Skillman

Keyword(s):

Gaussian Process ◽

Process Model ◽

Molecular Graph ◽

Gaussian Process Regression ◽

Ionization State ◽

Training Set ◽

Physiochemical Properties ◽

Quantile Plots ◽

Physical And Chemical ◽

Good Agreement

<div>A variety of fields would benefit from accurate pK<sub>a</sub> predictions, especially drug design due to the affect a change in ionization state can have on a molecules physiochemical properties.</div><div>Participants in the recent SAMPL6 blind challenge were asked to submit predictions for microscopic and macroscopic pK<sub>a</sub>s of 24 drug like small molecules.</div><div>We recently built a general model for predicting pK<sub>a</sub>s using a Gaussian process regression trained using physical and chemical features of each ionizable group.</div><div>Our pipeline takes a molecular graph and uses the OpenEye Toolkits to calculate features describing the removal of a proton.</div><div>These features are fed into a Scikit-learn Gaussian process to predict microscopic pK<sub>a</sub>s which are then used to analytically determine macroscopic pK<sub>a</sub>s.</div><div>Our Gaussian process is trained on a set of 2,700 macroscopic pK<sub>a</sub>s from monoprotic and select diprotic molecules.</div><div>Here, we share our results for microscopic and macroscopic predictions in the SAMPL6 challenge.</div><div>Overall, we ranked in the middle of the pack compared to other participants, but our fairly good agreement with experiment is still promising considering the challenge molecules are chemically diverse and often polyprotic while our training set is predominately monoprotic.</div><div>Of particular importance to us when building this model was to include an uncertainty estimate based on the chemistry of the molecule that would reflect the likely accuracy of our prediction. </div><div>Our model reports large uncertainties for the molecules that appear to have chemistry outside our domain of applicability, along with good agreement in quantile-quantile plots, indicating it can predict its own accuracy.</div><div>The challenge highlighted a variety of means to improve our model, including adding more polyprotic molecules to our training set and more carefully considering what functional groups we do or do not identify as ionizable. </div>

Download Full-text

Adaptive Soft Sensor Development Based on Online Ensemble Gaussian Process Regression for Nonlinear Time-Varying Batch Processes

Industrial & Engineering Chemistry Research ◽

10.1021/acs.iecr.5b01495 ◽

2015 ◽

Vol 54 (30) ◽

pp. 7320-7345 ◽

Cited By ~ 35

Author(s):

Huaiping Jin ◽

Xiangguang Chen ◽

Li Wang ◽

Kai Yang ◽

Lei Wu

Keyword(s):

Gaussian Process ◽

Gaussian Process Regression ◽

Soft Sensor ◽

Batch Processes ◽

Time Varying ◽

Sensor Development

Download Full-text

Multi-kernel Gaussian process regression and Bayesian model averaging based nonlinear state estimation and quality prediction of multiphase batch processes

2013 American Control Conference ◽

10.1109/acc.2013.6580690 ◽

2013 ◽

Author(s):

Jie Yu ◽

Kuilin Chen ◽

Junichi Mori ◽

Mudassir M. Rashid

Keyword(s):

Gaussian Process ◽

State Estimation ◽

Bayesian Model ◽

Bayesian Model Averaging ◽

Model Averaging ◽

Gaussian Process Regression ◽

Batch Processes ◽

Quality Prediction ◽

Nonlinear State Estimation ◽

Nonlinear State

Download Full-text

A residual-based Gaussian process model framework for finite element model updating

Computers & Structures ◽

10.1016/j.compstruc.2015.05.003 ◽

2015 ◽

Vol 156 ◽

pp. 149-159 ◽

Cited By ~ 28

Author(s):

Hua-Ping Wan ◽

Wei-Xin Ren

Keyword(s):

Finite Element ◽

Finite Element Model ◽

Gaussian Process ◽

Process Model ◽

Model Updating ◽

Element Model ◽

Finite Element Model Updating ◽

Model Framework ◽

Gaussian Process Model

Download Full-text

A hybrid process model for EDM based on finite-element method and Gaussian process regression

The International Journal of Advanced Manufacturing Technology ◽

10.1007/s00170-014-5989-y ◽

2014 ◽

Vol 74 (9-12) ◽

pp. 1197-1211 ◽

Cited By ~ 17

Author(s):

Wuyi Ming ◽

Guojun Zhang ◽

He Li ◽

Jianwen Guo ◽

Zhen Zhang ◽

...

Keyword(s):

Finite Element Method ◽

Finite Element ◽

Gaussian Process ◽

Process Model ◽

Gaussian Process Regression ◽

Hybrid Process ◽

Element Method

Download Full-text

Similarities and differences in spatial and non-spatial cognitive maps

10.1101/2020.01.21.914556 ◽

2020 ◽

Cited By ~ 2

Author(s):

Charley M. Wu ◽

Eric Schulz ◽

Mona M. Garvert ◽

Björn Meder ◽

Nicolas W. Schuck

Keyword(s):

Gaussian Process ◽

Process Model ◽

Gaussian Process Regression ◽

Cognitive Maps ◽

Spatial Task ◽

Cognitive Map ◽

Learning Curves ◽

Conceptual Domain ◽

Subject Design ◽

Spatial Domains

AbstractLearning and generalization in spatial domains is often thought to rely on a “cognitive map”, representing relationships between spatial locations. Recent research suggests that this same neural machinery is also recruited for reasoning about more abstract, conceptual forms of knowledge. Yet, to what extent do spatial and conceptual reasoning share common computational principles, and what are the implications for behavior? Using a within-subject design we studied how participants used spatial or conceptual distances to generalize and search for correlated rewards in successive multi-armed bandit tasks. Participant behavior indicated sensitivity to both spatial and conceptual distance, and was best captured using a Bayesian model of generalization that formalized distance-dependent generalization and uncertainty-guided exploration as a Gaussian Process regression with a radial basis function kernel. The same Gaussian Process model best captured human search decisions and judgments in both domains, and could simulate realistic learning curves, where we found equivalent levels of generalization in spatial and conceptual tasks. At the same time, we also find characteristic differences between domains. Relative to the spatial domain, participants showed reduced levels of uncertainty-directed exploration and increased levels of random exploration in the conceptual domain. Participants also displayed a one-directional transfer effect, where experience in the spatial task boosted performance in the conceptual task, but not vice versa. While confidence judgments indicated that participants were sensitive to the uncertainty of their knowledge in both tasks, they did not or could not leverage their estimates of uncertainty to guide exploration in the conceptual task. These results support the notion that value-guided learning and generalization recruit cognitive-map dependent computational mechanisms in spatial and conceptual domains. Yet both behavioral and model-based analyses suggest domain specific differences in how these representations map onto actions.Author summaryThere is a resurgence of interest in “cognitive maps” based on recent evidence that the hippocampal-entorhinal system encodes both spatial and non-spatial relational information, with far-reaching implications for human behavior. Yet little is known about the commonalities and differences in the computational principles underlying human learning and decision making in spatial and non-spatial domains. We use a within-subject design to examine how humans search for either spatially or conceptually correlated rewards. Using a Bayesian learning model, we find evidence for the same computational mechanisms of generalization across domains. While participants were sensitive to expected rewards and uncertainty in both tasks, how they leveraged this knowledge to guide exploration was different: participants displayed less uncertainty-directed and more random exploration in the conceptual domain. Moreover, experience with the spatial task improved conceptual performance, but not vice versa. These results provide important insights about the degree of overlap between spatial and conceptual cognition.

Download Full-text

SAMPL6 Challenge Results from pKa Predictions Based on a General Gaussian Process Model

10.26434/chemrxiv.6406505.v1 ◽

2018 ◽

Author(s):

Caitlin C. Bannan ◽

David L. Mobley ◽

Geoff Skillman

Keyword(s):

Gaussian Process ◽

Process Model ◽

Molecular Graph ◽

Gaussian Process Regression ◽

Ionization State ◽

Training Set ◽

Physiochemical Properties ◽

Quantile Plots ◽

Physical And Chemical ◽

Good Agreement

<div>A variety of fields would benefit from accurate pK<sub>a</sub> predictions, especially drug design due to the affect a change in ionization state can have on a molecules physiochemical properties.</div><div>Participants in the recent SAMPL6 blind challenge were asked to submit predictions for microscopic and macroscopic pK<sub>a</sub>s of 24 drug like small molecules.</div><div>We recently built a general model for predicting pK<sub>a</sub>s using a Gaussian process regression trained using physical and chemical features of each ionizable group.</div><div>Our pipeline takes a molecular graph and uses the OpenEye Toolkits to calculate features describing the removal of a proton.</div><div>These features are fed into a Scikit-learn Gaussian process to predict microscopic pK<sub>a</sub>s which are then used to analytically determine macroscopic pK<sub>a</sub>s.</div><div>Our Gaussian process is trained on a set of 2,700 macroscopic pK<sub>a</sub>s from monoprotic and select diprotic molecules.</div><div>Here, we share our results for microscopic and macroscopic predictions in the SAMPL6 challenge.</div><div>Overall, we ranked in the middle of the pack compared to other participants, but our fairly good agreement with experiment is still promising considering the challenge molecules are chemically diverse and often polyprotic while our training set is predominately monoprotic.</div><div>Of particular importance to us when building this model was to include an uncertainty estimate based on the chemistry of the molecule that would reflect the likely accuracy of our prediction. </div><div>Our model reports large uncertainties for the molecules that appear to have chemistry outside our domain of applicability, along with good agreement in quantile-quantile plots, indicating it can predict its own accuracy.</div><div>The challenge highlighted a variety of means to improve our model, including adding more polyprotic molecules to our training set and more carefully considering what functional groups we do or do not identify as ionizable. </div>

Download Full-text

Minimum Mode Saddle Point Searches Using Gaussian Process Regression with Inverse-Distance Covariance Function

10.26434/chemrxiv.9994868.v1 ◽

2019 ◽

Author(s):

Olli-Pekka Koistinen ◽

Vilhjálmur Ásgeirsson ◽

Aki Vehtari ◽

Hannes Jónsson

Keyword(s):

Saddle Point ◽

Gaussian Process ◽

Process Model ◽

Energy Surface ◽

Saddle Points ◽

Gaussian Process Regression ◽

Dissociative Adsorption ◽

Initial Guess ◽

Initial State ◽

Model Based

The minimum mode following method can be used to find saddle points on an energy surface by following a direction guided by the lowest curvature mode. Such calculations are often started close to a minimum on the energy surface to find out which transitions can occur from an initial state of the system, but it is also common to start from the vicinity of a first order saddle point making use of an initial guess based on intuition or more approximate calculations. In systems where accurate evaluations of the energy and its gradient are computationally intensive, it is important to exploit the information of the previous evaluations to enhance the performance. Here, we show that the number of evaluations required for convergence to the saddle point can be significantly reduced by making use of an approximate energy surface obtained by a Gaussian process model based on inverse inter-atomic distances, evaluating accurate energy and gradient at the saddle point of the approximate surface and then correcting the model based on the new information. The performance of the method is tested with start points chosen randomly in the vicinity of saddle points for dissociative adsorption of an H2 molecule on the Cu(110) Surface and three gas phase chemical reactions.<br>

Download Full-text

Multivariate Gaussian and Student-t process regression for multi-output prediction

Neural Computing and Applications ◽

10.1007/s00521-019-04687-8 ◽

2019 ◽

Vol 32 (8) ◽

pp. 3005-3028 ◽

Cited By ~ 4

Author(s):

Zexun Chen ◽

Bo Wang ◽

Alexander N. Gorban

Keyword(s):

Gaussian Process ◽

Process Model ◽

Gaussian Process Regression ◽

Real Data ◽

Investment Strategies ◽

Unified Framework ◽

Multivariate Distributions ◽

Air Quality Prediction ◽

Multivariate Gaussian ◽

The Matrix

AbstractGaussian process model for vector-valued function has been shown to be useful for multi-output prediction. The existing method for this model is to reformulate the matrix-variate Gaussian distribution as a multivariate normal distribution. Although it is effective in many cases, reformulation is not always workable and is difficult to apply to other distributions because not all matrix-variate distributions can be transformed to respective multivariate distributions, such as the case for matrix-variate Student-t distribution. In this paper, we propose a unified framework which is used not only to introduce a novel multivariate Student-t process regression model (MV-TPR) for multi-output prediction, but also to reformulate the multivariate Gaussian process regression (MV-GPR) that overcomes some limitations of the existing methods. Both MV-GPR and MV-TPR have closed-form expressions for the marginal likelihoods and predictive distributions under this unified framework and thus can adopt the same optimization approaches as used in the conventional GPR. The usefulness of the proposed methods is illustrated through several simulated and real-data examples. In particular, we verify empirically that MV-TPR has superiority for the datasets considered, including air quality prediction and bike rent prediction. At last, the proposed methods are shown to produce profitable investment strategies in the stock markets.

Download Full-text