Online Model Selection Based on the Variational Bayes

2001 ◽  
Vol 13 (7) ◽  
pp. 1649-1681 ◽  
Author(s):  
Masa-aki Sato

The Bayesian framework provides a principled way of model selection. This framework estimates a probability distribution over an ensemble of models, and the prediction is done by averaging over the ensemble of models. Accordingly, the uncertainty of the models is taken into account, and complex models with more degrees of freedom are penalized. However, integration over model parameters is often intractable, and some approximation scheme is needed. Recently, a powerful approximation scheme, called the variational bayes (VB) method, has been proposed. This approach defines the free energy for a trial probability distribution, which approximates a joint posterior probability distribution over model parameters and hidden variables. The exact maximization of the free energy gives the true posterior distribution. The VB method uses factorized trial distributions. The integration over model parameters can be done analytically, and an iterative expectation-maximization-like algorithm, whose convergence is guaranteed, is derived. In this article, we derive an online version of the VB algorithm and prove its convergence by showing that it is a stochastic approximation for finding the maximum of the free energy. By combining sequential model selection procedures, the online VB method provides a fully online learning method with a model selection mechanism. In preliminary experiments using synthetic data, the online VB method was able to adapt the model structure to dynamic environments.

Author(s):  
Luis D. Couto ◽  
Dong Zhang ◽  
Antti Aitio ◽  
Scott Moura ◽  
David Howey

Abstract This paper addresses the parameter estimation problem for lithium-ion battery pack models comprising cells in series. This valuable information can be exploited in fault diagnostics to estimate the number of cells that are exhibiting abnormal behaviour, e.g. large resistances or small capacities. In particular, we use a Bayesian approach to estimate the parameters of a two-cell arrangement modelled using equivalent circuits. Although our modeling framework has been extensively reported in the literature, its structural identifiability properties have not been reported yet to the best of the authors’ knowledge. Moreover, most contributions in the literature tackle the estimation problem through point-wise estimates assuming Gaussian noise using e.g. least-squares methods (maximum likelihood estimation) or Kalman filters (maximum a posteriori estimation). In contrast, we apply methods that are suitable for nonlinear and non-Gaussian estimation problems and estimate the full posterior probability distribution of the parameters. We study how the model structure, available measurements and prior knowledge of the model parameters impact the underlying posterior probability distribution that is recovered for the parameters. For two cells in series, a bimodal distribution is obtained whose modes are centered around the real values of the parameters for each cell. Therefore, bounds on the model parameters for a battery pack can be derived.


2017 ◽  
Vol 10 (11) ◽  
pp. 4079-4098 ◽  
Author(s):  
Anu Kauppi ◽  
Pekka Kolmonen ◽  
Marko Laine ◽  
Johanna Tamminen

Abstract. We discuss uncertainty quantification for aerosol-type selection in satellite-based atmospheric aerosol retrieval. The retrieval procedure uses precalculated aerosol microphysical models stored in look-up tables (LUTs) and top-of-atmosphere (TOA) spectral reflectance measurements to solve the aerosol characteristics. The forward model approximations cause systematic differences between the modelled and observed reflectance. Acknowledging this model discrepancy as a source of uncertainty allows us to produce more realistic uncertainty estimates and assists the selection of the most appropriate LUTs for each individual retrieval.This paper focuses on the aerosol microphysical model selection and characterisation of uncertainty in the retrieved aerosol type and aerosol optical depth (AOD). The concept of model evidence is used as a tool for model comparison. The method is based on Bayesian inference approach, in which all uncertainties are described as a posterior probability distribution. When there is no single best-matching aerosol microphysical model, we use a statistical technique based on Bayesian model averaging to combine AOD posterior probability densities of the best-fitting models to obtain an averaged AOD estimate. We also determine the shared evidence of the best-matching models of a certain main aerosol type in order to quantify how plausible it is that it represents the underlying atmospheric aerosol conditions.The developed method is applied to Ozone Monitoring Instrument (OMI) measurements using a multiwavelength approach for retrieving the aerosol type and AOD estimate with uncertainty quantification for cloud-free over-land pixels. Several larger pixel set areas were studied in order to investigate the robustness of the developed method. We evaluated the retrieved AOD by comparison with ground-based measurements at example sites. We found that the uncertainty of AOD expressed by posterior probability distribution reflects the difficulty in model selection. The posterior probability distribution can provide a comprehensive characterisation of the uncertainty in this kind of problem for aerosol-type selection. As a result, the proposed method can account for the model error and also include the model selection uncertainty in the total uncertainty budget.


2017 ◽  
Author(s):  
Anu Kauppi ◽  
Pekka Kolmonen ◽  
Marko Laine ◽  
Johanna Tamminen

Abstract. We discuss uncertainty quantification for aerosol type selection in satellite-based atmospheric aerosol retrieval. The retrieval procedure uses pre-calculated aerosol microphysical models stored in look-up tables (LUTs) and top of atmosphere spectral reflectance measurements to solve the aerosol characteristics. The forward model approximations cause systematic differences between the modeled and observed reflectance. Acknowledging this model discrepancy as a source of uncertainty allows us to produce more realistic uncertainty estimates and assists the selection of the most appropriate LUTs for each individual retrieval. This paper focuses on the aerosol microphysical model selection and characterization of uncertainty in the retrieved aerosol type and aerosol optical depth (AOD). The concept of model evidence is used as a tool for model comparison. The method is based on Bayesian inference approach where all uncertainties are described as a posterior probability distribution. When there is no single best matching aerosol microphysical model we use a statistical technique based on Bayesian model averaging to combine AOD posterior probability densities of the best fitting models to obtain an averaged AOD estimate. We also determine the shared evidence of the best matching models of a certain main aerosol type in order to quantify how plausible each main aerosol type is in representing the underlying atmospheric aerosol conditions. The developed method is applied to Ozone Monitoring Instrument (OMI) measurements using multi-wavelength approach for retrieving the aerosol type and AOD estimate with uncertainty quantification for cloud-free over-land pixels. Several larger pixel set areas were studied in order to investigate robustness of the developed method. We evaluated the retrieved AOD by comparison with ground-based measurements at example sites. We found that the uncertainty of AOD expressed by posterior probability distribution reflects the difficulty in model selection. The posterior probability distribution can provide a comprehensive characterization of the uncertainty in this kind of problem for aerosol type selection. As a result, the proposed method can account for the model error and also include the model selection uncertainty in the total uncertainty budget.


Author(s):  
Christopher J. Arthurs ◽  
Nan Xiao ◽  
Philippe Moireau ◽  
Tobias Schaeffter ◽  
C. Alberto Figueroa

AbstractA major challenge in constructing three dimensional patient specific hemodynamic models is the calibration of model parameters to match patient data on flow, pressure, wall motion, etc. acquired in the clinic. Current workflows are manual and time-consuming. This work presents a flexible computational framework for model parameter estimation in cardiovascular flows that relies on the following fundamental contributions. (i) A Reduced-Order Unscented Kalman Filter (ROUKF) model for data assimilation for wall material and simple lumped parameter network (LPN) boundary condition model parameters. (ii) A constrained least squares augmentation (ROUKF-CLS) for more complex LPNs. (iii) A “Netlist” implementation, supporting easy filtering of parameters in such complex LPNs. The ROUKF algorithm is demonstrated using non-invasive patient-specific data on anatomy, flow and pressure from a healthy volunteer. The ROUKF-CLS algorithm is demonstrated using synthetic data on a coronary LPN. The methods described in this paper have been implemented as part of the CRIMSON hemodynamics software package.


2013 ◽  
Vol 20 (6) ◽  
pp. 1071-1078 ◽  
Author(s):  
E. Piegari ◽  
R. Di Maio ◽  
A. Avella

Abstract. Reasonable prediction of landslide occurrences in a given area requires the choice of an appropriate probability distribution of recurrence time intervals. Although landslides are widespread and frequent in many parts of the world, complete databases of landslide occurrences over large periods are missing and often such natural disasters are treated as processes uncorrelated in time and, therefore, Poisson distributed. In this paper, we examine the recurrence time statistics of landslide events simulated by a cellular automaton model that reproduces well the actual frequency-size statistics of landslide catalogues. The complex time series are analysed by varying both the threshold above which the time between events is recorded and the values of the key model parameters. The synthetic recurrence time probability distribution is shown to be strongly dependent on the rate at which instability is approached, providing a smooth crossover from a power-law regime to a Weibull regime. Moreover, a Fano factor analysis shows a clear indication of different degrees of correlation in landslide time series. Such a finding supports, at least in part, a recent analysis performed for the first time of an historical landslide time series over a time window of fifty years.


2015 ◽  
Vol 23 (2) ◽  
pp. 102-111 ◽  
Author(s):  
Radosław Cellmer ◽  
Katarzyna Szczepankowska

Abstract The regularities and relations between real estate prices and the factors that shape them may be presented in the form of statistical models, thanks to which the diagnosis and prediction of prices is possible. A formal description of empirical observation presented in the form of regressive models also offers a possibility for creating certain phenomena in a virtual dimension. Market phenomena cannot be fully described with the use of determinist models, which clarify only a part of price variation. The predicted price is, in this situation, a special case of implementing a random function. Assuming that other implementations are also possible, regressive models may constitute a basis for simulation, which results in the procurement of a future image of the market. Simulation may refer both to real estate prices and transaction prices. The basis for price simulation may be familiarity with the structure of the analyzed market data. Assuming that this structure has a static character, simulation of real estate prices is performed on the basis of familiarity with the probability distribution and a generator of random numbers. The basis for price simulation is familiarity with model parameters and probability distribution of the random factor. The study presents the core and theoretical description of a transaction simulation on the real estate market, as well as the results of an experiment regarding transaction prices of office real estate located within the area of the city of Olsztyn. The result of the study is a collection of virtual real properties with known features and simulated prices, constituting a reflection of market processes which may take place in the near future. Comparison between the simulated characteristic and actual transactions in turn allows the correctness of the description of reality by the model to be verified.


2020 ◽  
Vol 17 (173) ◽  
pp. 20200886
Author(s):  
L. Mihaela Paun ◽  
Mitchel J. Colebank ◽  
Mette S. Olufsen ◽  
Nicholas A. Hill ◽  
Dirk Husmeier

This study uses Bayesian inference to quantify the uncertainty of model parameters and haemodynamic predictions in a one-dimensional pulmonary circulation model based on an integration of mouse haemodynamic and micro-computed tomography imaging data. We emphasize an often neglected, though important source of uncertainty: in the mathematical model form due to the discrepancy between the model and the reality, and in the measurements due to the wrong noise model (jointly called ‘model mismatch’). We demonstrate that minimizing the mean squared error between the measured and the predicted data (the conventional method) in the presence of model mismatch leads to biased and overly confident parameter estimates and haemodynamic predictions. We show that our proposed method allowing for model mismatch, which we represent with Gaussian processes, corrects the bias. Additionally, we compare a linear and a nonlinear wall model, as well as models with different vessel stiffness relations. We use formal model selection analysis based on the Watanabe Akaike information criterion to select the model that best predicts the pulmonary haemodynamics. Results show that the nonlinear pressure–area relationship with stiffness dependent on the unstressed radius predicts best the data measured in a control mouse.


2011 ◽  
Vol 391-392 ◽  
pp. 1017-1021
Author(s):  
Ru Zhang ◽  
Yan Fen Wu ◽  
Ping Hu

Six binary silane systems were chosen to calculate the activity coefficients (γ) and free energies of mixing (ΔGm). These systems included: methyldichlorosilane + methyltrichlorosilane, methyldichlorosilane + methylvinyldichlorosilane, methyldichlorosilane + toluene, methyltrichlorosilane + methylvinyldichlorosilane, methyltrichlorosilane + toluene, methylvinyldichlorosilane + toluene. Based on the Antoine constants, critical parameters of the pure components and Wilson model parameters, γ and ΔGmwere calculated. The influence factors of these thermodynamic properties were also discussed.


Sign in / Sign up

Export Citation Format

Share Document