Efficient ensemble generation for uncertain correlated parameters in atmospheric chemical models

Mapping Intimacies ◽

10.5194/gmd-2021-26 ◽

2021 ◽

Author(s):

Annika Vogel ◽

Hendrik Elbern

Keyword(s):

Forecast Model ◽

High Dimensional ◽

Biogenic Emissions ◽

Model Parameters ◽

Emission Rates ◽

Model Uncertainties ◽

Dimensional Parameter ◽

Novel Approach ◽

Correlated Parameters ◽

Ensemble Generation

Abstract. Atmospheric chemical forecasts highly rely on various model parameters, which are often insufficiently known, as emission rates and deposition velocities. However, a reliable estimation of resulting uncertainties by an ensemble of forecasts is impaired by the high-dimensionality of the system. This study presents a novel approach to efficiently perturb atmospheric-chemical model parameters according to their leading coupled uncertainties. The algorithm is based on the idea that the forecast model acts as a dynamical system inducing multi-variational correlations of model uncertainties. The specific algorithm presented in this study is designed for parameters which depend on local environmental conditions and consists of three major steps: (1) an efficient assessment of various sources of model uncertainties spanned by independent sensitivities, (2) an efficient extraction of leading coupled uncertainties using eigenmode decomposition, and (3) an efficient generation of perturbations for high-dimensional parameter fields by the Karhunen-Loéve expansion. Due to their perceived simulation challenge the method has been applied to biogenic emissions of five trace gases, considering state-dependent sensitivities to local atmospheric and terrestrial conditions. Rapidly decreasing eigenvalues state high spatial- and cross-correlations of regional biogenic emissions, which are represented by a low number of dominating components. Consequently, leading uncertainties can be covered by low number of perturbations enabling ensemble sizes of the order of 10 members. This demonstrates the suitability of the algorithm for efficient ensemble generation for high-dimensional atmospheric chemical parameters.

Download Full-text

Efficient ensemble generation for uncertain correlated parameters in atmospheric chemical models: a case study for biogenic emissions from EURAD-IM version 5

Geoscientific Model Development ◽

10.5194/gmd-14-5583-2021 ◽

2021 ◽

Vol 14 (9) ◽

pp. 5583-5605

Author(s):

Annika Vogel ◽

Hendrik Elbern

Keyword(s):

Forecast Model ◽

High Dimensional ◽

Biogenic Emissions ◽

Model Parameters ◽

Emission Rates ◽

Parameter Uncertainties ◽

Model Uncertainties ◽

Dimensional Parameter ◽

Correlated Parameters ◽

Ensemble Generation

Abstract. Atmospheric chemical forecasts heavily rely on various model parameters, which are often insufficiently known, such as emission rates and deposition velocities. However, a reliable estimation of resulting uncertainties with an ensemble of forecasts is impaired by the high dimensionality of the system. This study presents a novel approach, which substitutes the problem into a low-dimensional subspace spanned by the leading uncertainties. It is based on the idea that the forecast model acts as a dynamical system inducing multivariate correlations of model uncertainties. This enables an efficient perturbation of high-dimensional model parameters according to their leading coupled uncertainties. The specific algorithm presented in this study is designed for parameters that depend on local environmental conditions and consists of three major steps: (1) an efficient assessment of various sources of model uncertainties spanned by independent sensitivities, (2) an efficient extraction of leading coupled uncertainties using eigenmode decomposition, and (3) an efficient generation of perturbations for high-dimensional parameter fields by the Karhunen–Loéve expansion. Due to their perceived simulation challenge, the method has been applied to biogenic emissions of five trace gases, considering state-dependent sensitivities to local atmospheric and terrestrial conditions. Rapidly decreasing eigenvalues state that highly correlated uncertainties of regional biogenic emissions can be represented by a low number of dominant components. Depending on the required level of detail, leading parameter uncertainties with dimensions of 𝒪(106) can be represented by a low number of about 10 ensemble members. This demonstrates the suitability of the algorithm for efficient ensemble generation for high-dimensional atmospheric chemical parameters.

Download Full-text

Towards an Efficient Validation of Dynamical Whole-brain Models

10.21203/rs.3.rs-1139051/v1 ◽

2021 ◽

Author(s):

Kevin J. Wischnewski ◽

Simon B. Eickhoff ◽

Viktor K. Jirsa ◽

Oleksandr V. Popovych

Keyword(s):

Structural Connectivity ◽

Three Dimensional ◽

Bayesian Optimization ◽

High Dimensional ◽

Brain Dynamics ◽

Model Parameters ◽

Phase Oscillators ◽

Dimensional Parameter ◽

Whole Brain ◽

Brain Models

Abstract Simulating the resting-state brain dynamics via mathematical whole-brain models requires an optimal selection of parameters, which determine the model’s capability to replicate empirical data. Since the parameter optimization via a grid search (GS) becomes unfeasible for high-dimensional models, we evaluate several alternative approaches to maximize the correspondence between simulated and empirical functional connectivity. A dense GS serves as a benchmark to assess the performance of four optimization schemes: Nelder-Mead Algorithm (NMA), Particle Swarm Optimization (PSO), Covariance Matrix Adaptation Evolution Strategy (CMAES) and Bayesian Optimization (BO). To compare them, we employ an ensemble of coupled phase oscillators built upon individual empirical structural connectivity of 105 healthy subjects. We determine optimal model parameters from two- and three-dimensional parameter spaces and show that the overall fitting quality of the tested methods can compete with the GS. There are, however, marked differences in the required computational resources and stability properties, which we also investigate before proposing CMAES and BO as efficient alternatives to a high-dimensional GS. For the three-dimensional case, these methods generated similar results as the GS, but within less than 6% of the computation time. Our results contribute to an efficient validation of models for personalized simulations of brain dynamics.

Download Full-text

New Estimators of the Bayes Factor for Models with High-Dimensional Parameter and/or Latent Variable Spaces

Entropy ◽

10.3390/e23040399 ◽

2021 ◽

Vol 23 (4) ◽

pp. 399

Author(s):

Anna Pajor

Keyword(s):

Latent Variables ◽

Latent Variable ◽

Bayes Factor ◽

High Dimensional ◽

Model Parameters ◽

Posterior Odds ◽

Dimensional Parameter ◽

Data Set ◽

Multivariate Stochastic Volatility ◽

Competing Models

Formal Bayesian comparison of two competing models, based on the posterior odds ratio, amounts to estimation of the Bayes factor, which is equal to the ratio of respective two marginal data density values. In models with a large number of parameters and/or latent variables, they are expressed by high-dimensional integrals, which are often computationally infeasible. Therefore, other methods of evaluation of the Bayes factor are needed. In this paper, a new method of estimation of the Bayes factor is proposed. Simulation examples confirm good performance of the proposed estimators. Finally, these new estimators are used to formally compare different hybrid Multivariate Stochastic Volatility–Multivariate Generalized Autoregressive Conditional Heteroskedasticity (MSV-MGARCH) models which have a large number of latent variables. The empirical results show, among other things, that the validity of reduction of the hybrid MSV-MGARCH model to the MGARCH specification depends on the analyzed data set as well as on prior assumptions about model parameters.

Download Full-text

Constraining the parameters of high-dimensional models with active learning

The European Physical Journal C ◽

10.1140/epjc/s10052-019-7437-5 ◽

2019 ◽

Vol 79 (11) ◽

Cited By ~ 1

Author(s):

Sascha Caron ◽

Tom Heskes ◽

Sydney Otten ◽

Bob Stienen

Keyword(s):

Active Learning ◽

Particle Physics ◽

High Energy Physics ◽

High Energy ◽

Physical Models ◽

High Dimensional ◽

Physical Parameters ◽

Model Parameters ◽

Dimensional Parameter ◽

Parameter Spaces

AbstractConstraining the parameters of physical models with $$>5-10$$>5-10 parameters is a widespread problem in fields like particle physics and astronomy. The generation of data to explore this parameter space often requires large amounts of computational resources. The commonly used solution of reducing the number of relevant physical parameters hampers the generality of the results. In this paper we show that this problem can be alleviated by the use of active learning. We illustrate this with examples from high energy physics, a field where simulations are often expensive and parameter spaces are high-dimensional. We show that the active learning techniques query-by-committee and query-by-dropout-committee allow for the identification of model points in interesting regions of high-dimensional parameter spaces (e.g. around decision boundaries). This makes it possible to constrain model parameters more efficiently than is currently done with the most common sampling algorithms and to train better performing machine learning models on the same amount of data. Code implementing the experiments in this paper can be found on GitHub "Image missing"

Download Full-text

Explaining Neural Networks Using Attentive Knowledge Distillation

Sensors ◽

10.3390/s21041280 ◽

2021 ◽

Vol 21 (4) ◽

pp. 1280

Author(s):

Hyeonseok Lee ◽

Sungchan Kim

Keyword(s):

Neural Networks ◽

Model Prediction ◽

Saliency Map ◽

Model Parameters ◽

Learning Capability ◽

Fine Grained ◽

Network Layers ◽

Saliency Maps ◽

Novel Approach ◽

Knowledge Distillation

Explaining the prediction of deep neural networks makes the networks more understandable and trusted, leading to their use in various mission critical tasks. Recent progress in the learning capability of networks has primarily been due to the enormous number of model parameters, so that it is usually hard to interpret their operations, as opposed to classical white-box models. For this purpose, generating saliency maps is a popular approach to identify the important input features used for the model prediction. Existing explanation methods typically only use the output of the last convolution layer of the model to generate a saliency map, lacking the information included in intermediate layers. Thus, the corresponding explanations are coarse and result in limited accuracy. Although the accuracy can be improved by iteratively developing a saliency map, this is too time-consuming and is thus impractical. To address these problems, we proposed a novel approach to explain the model prediction by developing an attentive surrogate network using the knowledge distillation. The surrogate network aims to generate a fine-grained saliency map corresponding to the model prediction using meaningful regional information presented over all network layers. Experiments demonstrated that the saliency maps are the result of spatially attentive features learned from the distillation. Thus, they are useful for fine-grained classification tasks. Moreover, the proposed method runs at the rate of 24.3 frames per second, which is much faster than the existing methods by orders of magnitude.

Download Full-text

Gaussian Processes Proxy Model with Latent Variable Models and Variogram-Based Sensitivity Analysis for Assisted History Matching

Energies ◽

10.3390/en13174290 ◽

2020 ◽

Vol 13 (17) ◽

pp. 4290

Author(s):

Dongmei Zhang ◽

Yuyang Zhang ◽

Bohou Jiang ◽

Xinwei Jiang ◽

Zhijiang Kang

Keyword(s):

Sensitivity Analysis ◽

Gaussian Processes ◽

Latent Variable ◽

History Matching ◽

Latent Variable Models ◽

High Dimensional ◽

Model Parameters ◽

Variable Model ◽

Assisted History Matching ◽

Proxy Models

Reservoir history matching is a well-known inverse problem for production prediction where enormous uncertain reservoir parameters of a reservoir numerical model are optimized by minimizing the misfit between the simulated and history production data. Gaussian Process (GP) has shown promising performance for assisted history matching due to the efficient nonparametric and nonlinear model with few model parameters to be tuned automatically. Recently introduced Gaussian Processes proxy models and Variogram Analysis of Response Surface-based sensitivity analysis (GP-VARS) uses forward and inverse Gaussian Processes (GP) based proxy models with the VARS-based sensitivity analysis to optimize the high-dimensional reservoir parameters. However, the inverse GP solution (GPIS) in GP-VARS are unsatisfactory especially for enormous reservoir parameters where the mapping from low-dimensional misfits to high-dimensional uncertain reservoir parameters could be poorly modeled by GP. To improve the performance of GP-VARS, in this paper we propose the Gaussian Processes proxy models with Latent Variable Models and VARS-based sensitivity analysis (GPLVM-VARS) where Gaussian Processes Latent Variable Model (GPLVM)-based inverse solution (GPLVMIS) instead of GP-based GPIS is provided with the inputs and outputs of GPIS reversed. The experimental results demonstrate the effectiveness of the proposed GPLVM-VARS in terms of accuracy and complexity. The source code of the proposed GPLVM-VARS is available at https://github.com/XinweiJiang/GPLVM-VARS.

Download Full-text

Searching bifurcations in high-dimensional parameter space via a feedback loop breaking approach

International Journal of Systems Science ◽

10.1080/00207720902957269 ◽

2009 ◽

Vol 40 (7) ◽

pp. 769-782 ◽

Cited By ~ 10

Author(s):

Steffen Waldherr ◽

Frank Allgöwer

Keyword(s):

Parameter Space ◽

Feedback Loop ◽

High Dimensional ◽

Dimensional Parameter

Download Full-text

Considering multiple process observables to determine material model parameters for FE-cutting simulations

The International Journal of Advanced Manufacturing Technology ◽

10.1007/s00170-021-06845-6 ◽

2021 ◽

Author(s):

Marvin Hardt ◽

Thomas Bergs

Keyword(s):

Chip Thickness ◽

Material Model ◽

Experimental Investigations ◽

Model Parameters ◽

Inverse Approach ◽

Local Material ◽

Novel Approach ◽

Validation Parameters ◽

Multiple Process ◽

Cutting Simulations

AbstractAnalyzing the chip formation process by means of the finite element method (FEM) is an established procedure to understand the cutting process. For a realistic simulation, different input models are required, among which the material model is crucial. To determine the underlying material model parameters, inverse methods have found an increasing acceptance within the last decade. The calculated model parameters exhibit good validity within the domain of investigation, but suffer from their non-uniqueness. To overcome the drawback of the non-uniqueness, the literature suggests either to enlarge the domain of experimental investigations or to use more process observables as validation parameters. This paper presents a novel approach merging both suggestions: a fully automatized procedure in conjunction with the use of multiple process observables is utilized to investigate the non-uniqueness of material model parameters for the domain of cutting simulations. The underlying approach is two-fold: Firstly, the accuracy of the evaluated process observables from FE simulations is enhanced by establishing an automatized routine. Secondly, the number of process observables that are considered in the inverse approach is increased. For this purpose, the cutting force, cutting normal force, chip temperature, chip thickness, and chip radius are taken into account. It was shown that multiple parameter sets of the material model can result in almost identical simulation results in terms of the simulated process observables and the local material loads.

Download Full-text

PSV-14 Using Bayesian Hamiltonian Monte Carlo and a nonlinear model for the estimation of genetic parameters for lactation curves in goats

Journal of Animal Science ◽

10.1093/jas/skab235.562 ◽

2021 ◽

Vol 99 (Supplement_3) ◽

pp. 305-307

Author(s):

Andre C Araujo ◽

Leonardo Gloria ◽

Paulo Abreu ◽

Fabyano Silva ◽

Marcelo Rodrigues ◽

...

Keyword(s):

Monte Carlo ◽

Milk Yield ◽

Nonlinear Model ◽

Genetic Parameters ◽

Model Parameters ◽

Promising Alternative ◽

Hamiltonian Monte Carlo ◽

Heritability Estimates ◽

Correlated Parameters ◽

Wood Model

Abstract Hamiltonian Monte Carlo (HMC) is an algorithm of the Markov Chain Monte Carlo (MCMC) method that uses dynamics to propose samples that follow a target distribution. This algorithm enables more effective and consistent exploration of the probability interval and is more sensitive to correlated parameters. Therefore, Bayesian-HMC is a promising alternative to estimate individual parameters of complex functions such as nonlinear models, especially when using small datasets. Our objective was to estimate genetic parameters for milk traits defined based on nonlinear model parameters predicted using the Bayesian-HMC algorithm. A total of 64,680 milk yield test-day records from 2,624 first, second, and third lactations of Saanen and Alpine goats were used. First, the Wood model was fitted to the data. Second, lactation persistency (LP), peak time (PT), peak yield (PY), and total milk yield [estimated from zero to 50 (TMY50), 100(TMY100), 150(TMY150), 200(TMY200), 250(TMY250), and 300(TMY300) days-in-milk] were predicted for each animal and parity based on the output of the first step (the individual phenotypic parameters of the Wood model). Thereafter, these predicted phenotypes were used for estimating genetic parameters for each trait. In general, the heritability estimates across lactations ranged from 0.10 to 0.20 for LP, 0.04 to 0.07 for PT, 0.26 to 0.27 for PY, and 0.21 to 0.28 for TMY (considering the different intervals). Lower heritabilities were obtained for the nonlinear function parameters (A, b and l) compared to its predicted traits (except PT), especially for the first and second lactations (range: 0.09 to 0.18). Higher heritability estimates were obtained for the third lactation traits. To our best knowledge, this study is the first attempt to use the HMC algorithm to fit a nonlinear model in animal breeding. The two-step method proposed here allowed us to estimate genetic parameters for all traits evaluated.

Download Full-text