scholarly journals Correcting a bias in a climate model with an augmented emulator

2020 ◽  
Vol 13 (5) ◽  
pp. 2487-2509
Author(s):  
Doug McNeall ◽  
Jonny Williams ◽  
Richard Betts ◽  
Ben Booth ◽  
Peter Challenor ◽  
...  

Abstract. A key challenge in developing flagship climate model configurations is the process of setting uncertain input parameters at values that lead to credible climate simulations. Setting these parameters traditionally relies heavily on insights from those involved in parameterisation of the underlying climate processes. Given the many degrees of freedom and computational expense involved in evaluating such a selection, this can be imperfect leaving open questions about whether any subsequent simulated biases result from mis-set parameters or wider structural model errors (such as missing or partially parameterised processes). Here, we present a complementary approach to identifying plausible climate model parameters, with a method of bias correcting subcomponents of a climate model using a Gaussian process emulator that allows credible values of model input parameters to be found even in the presence of a significant model bias. A previous study (McNeall et al., 2016) found that a climate model had to be run using land surface input parameter values from very different, almost non-overlapping, parts of parameter space to satisfactorily simulate the Amazon and other forests respectively. As the forest fraction of modelled non-Amazon forests was broadly correct at the default parameter settings and the Amazon too low, that study suggested that the problem most likely lay in the model's treatment of non-plant processes in the Amazon region. This might be due to modelling errors such as missing deep rooting in the Amazon in the land surface component of the climate model, to a warm–dry bias in the Amazon climate of the model or a combination of both. In this study, we bias correct the climate of the Amazon in the climate model from McNeall et al. (2016) using an “augmented” Gaussian process emulator, where temperature and precipitation, variables usually regarded as model outputs, are treated as model inputs alongside land surface input parameters. A sensitivity analysis finds that the forest fraction is nearly as sensitive to climate variables as it is to changes in its land surface parameter values. Bias correcting the climate in the Amazon region using the emulator corrects the forest fraction to tolerable levels in the Amazon at many candidates for land surface input parameter values, including the default ones, and increases the valid input space shared with the other forests. We need not invoke a structural model error in the land surface model, beyond having too dry and hot a climate in the Amazon region. The augmented emulator allows bias correction of an ensemble of climate model runs and reduces the risk of choosing poor parameter values because of an error in a subcomponent of the model. We discuss the potential of the augmented emulator to act as a translational layer between model subcomponents, simplifying the process of model tuning when there are compensating errors and helping model developers discover and prioritise model errors to target.

2019 ◽  
Author(s):  
Doug McNeall ◽  
Jonny Williams ◽  
Richard Betts ◽  
Ben Booth ◽  
Peter Challenor ◽  
...  

Abstract. A key challenge in developing flagship climate model configurations is the process of setting uncertain input parameters at values that lead to credible climate simulations. Setting these parameters traditionally relies heavily on insights from those involved in parameterisation of the underlying climate processes. Given the many degrees of freedom and computational expense involved in evaluating such a selection, this can be imperfect leaving open questions about whether any subsequent simulated biases result from mis-set parameters or wider structural model errors (such as missing or partially parameterised processes). Here we present a complementary approach to identifying plausible climate model parameters, with a method of bias correcting subcomponents of a climate model using a Gaussian process emulator that allows credible values of model input parameters to be found even in the presence of a significant model bias. A previous study (McNeall et al., 2016) found that a climate model had to be run using land surface input parameter values from very different, almost non-overlapping parts of parameter space to satisfactorily simulate the Amazon and other forests respectively. As the forest fraction of modelled non-Amazon forests was broadly correct at the default parameter settings and the Amazon too low, that study suggested that the problem most likely lay in the model's treatment of non-plant processes in the Amazon region. This might be due to (1) modelling errors such as missing deep-rooting in the Amazon in the land surface component of the climate model, (2) a warm-dry bias in the Amazon climate of the model, or a combination of both. In this study we bias correct the climate of the Amazon in a climate model using an augmented Gaussian process emulator, where temperature and precipitation, variables usually regarded as model outputs, are treated as model inputs alongside regular land surface input parameters. A sensitivity analysis finds that the forest fraction is nearly as sensitive to climate variables as changes in its land surface parameter values. Bias correcting the climate in the Amazon region using the emulator corrects the forest fraction to tolerable levels in the Amazon at many candidates for land surface input parameter values, including the default ones, and increases the valid input space shared with the other forests. We need not invoke a structural model error in the land surface model, beyond having too dry and hot a climate in the Amazon region. The augmented emulator allows bias correction of an ensemble of climate model runs and reduces the risk of choosing poor parameter values because of an error in a subcomponent of the model. We discuss the potential of the augmented emulator to act as a translational layer between model subcomponents, simplifying the process of model tuning when there are compensating errors, and helping model developers discover and prioritise model errors to target.


2021 ◽  
Author(s):  
Jonas Voigt ◽  
Keith-Noah Jurke ◽  
Julius Schultz ◽  
Ulrich Römer ◽  
Jens Friedrichs

Abstract In this work, we consider a parallel compressor model (PCM), which decomposes a compressor encountering non-uniform inflow into a distorted and an undistorted subcompressor, respectively, to determine its overall operating point. The main advantage of PCM modeling is a significantly reduced computational workload. At the same time, modeling errors are introduced, which need to be quantified together with model input uncertainties. Therefore, we introduce a probabilistic setting where unknown parameters are modeled as random variables. We carry out a global sensitivity analysis, which allows to reduce the complexity of the probabilistic model, by setting unimportant input parameters to their nominal values. This analysis attributes portions of the model output variance (the fan efficiency for instance) to particular input parameters or input parameter combinations, through so-called Sobol coefficients. We further include a parameter describing the PCM inflow averaging process into the analysis, which allows to determine the influence of specific modeling choices onto the predicted efficiency. Efficient sampling methods are needed to estimate the sensitivity coefficients with a reasonable computational effort. A key advantage of the global approach is that nonlinear effects are fully taken into account, the necessity of which will be demonstrated by our numerical examples. The model is also compared to CFD reference simulations to quantify structural model errors. This comparison is based on area validation metrics comparing the stochastic distribution functions of the probabilistic PCM model and the reference data.


2005 ◽  
Vol 6 (6) ◽  
pp. 1063-1072 ◽  
Author(s):  
Steven A. Margulis ◽  
Jongyoun Kim ◽  
Terri Hogue

Abstract Future operational frameworks for estimating surface turbulent fluxes over the necessary spatial and temporal scales will undoubtedly require the use of remote sensing products. Techniques used to estimate surface fluxes from radiometric surface temperature generally fall into two categories: retrieval-based and data assimilation approaches. Up to this point, there has been little comparison between retrieval- and assimilation-based techniques. In this note, the triangle retrieval method is compared to a variational data assimilation approach for estimating surface turbulent fluxes from radiometric surface temperature observations. Results from a set of synthetic experiments and an application using real data from the First International Satellite Land Surface Climatology Project (ISLSCP) Field Experiment (FIFE) site indicate that the assimilation approach performs slightly better than the triangle method because of the robustness of the estimation to measurement errors and parsimony of the system model, which leads to fewer sources of structural model errors. Future comparison work using retrieval and data assimilation algorithms will provide more insight into the optimal approach for diagnosis of land surface fluxes using remote sensing observations.


2014 ◽  
Vol 28 (1) ◽  
pp. 272-291 ◽  
Author(s):  
Daniela Dalmonech ◽  
Sönke Zaehle ◽  
Gregor J. Schürmann ◽  
Victor Brovkin ◽  
Christian Reick ◽  
...  

Abstract The capacity of earth system models (ESMs) to make reliable projections of future atmospheric CO2 and climate is strongly dependent on the ability of the land surface model to adequately simulate the land carbon (C) cycle. Defining “adequate” performance of the land model requires an understanding of the contributions of climate model and land model errors to the land C cycle. Here, a benchmarking framework is applied based on significant, observed characteristics of the land C cycle for the contemporary period, for which sufficient evaluation data are available, to test the ability of the JSBACH land surface component of the Max Planck Institute Earth System Model (MPI-ESM) to simulate land C trends. Particular attention is given to the role of potential effects caused by climate biases, and therefore investigation is made of the results of model configurations in which JSBACH is interactively “coupled” to atmosphere and ocean components and of an “uncoupled” configuration, where JSBACH is driven by reconstructed meteorology. The ability of JSBACH to simulate the observed phase of phenology and seasonal C fluxes is not strongly affected by climate biases. Contrarily, noticeable differences in the simulated gross primary productivity and land C stocks emerge between coupled and uncoupled configurations, leading to significant differences in the decadal terrestrial C balance and its sensitivity to climate. These differences are strongly controlled by climate biases of the MPI-ESM, in particular those affecting soil moisture. To effectively characterize model performance, the potential effects of climate biases on the land C dynamics need to be considered during the development and calibration of land surface models.


2020 ◽  
Author(s):  
Jonas Van Breedam ◽  
Philippe Huybrechts ◽  
Michel Crucifix

<p>Fully coupled state-of-the-art Atmosphere-Ocean General Circulation Models are the best tool to investigate feedbacks between the different components of the climate system on a decadal to centennial timescale. On millennial to multi-millennial timescales, Earth System Models of Intermediate Complexity are used to explore the feedbacks in the climate system between the ice sheets, the atmosphere and the ocean. Those fully coupled models, even at coarser resolution, are computationally very expensive and other techniques have been proposed to simulate ice sheet-climate interactions on a million-year timescale. The asynchronous coupling technique proposes to run a climate model for a few decades and subsequently an ice sheet model for a few millennia. Another, more efficient method is the use of a matrix look-up table where climate model runs are performed for end-members and intermediate climatic states are linearly interpolated.</p><p>In this study, a novel coupling approach is presented where a Gaussian Process emulator applied to the climate model HadSM3 is coupled to the ice sheet model AISMPALEO. We have tested the sensitivity of the formulation of the ice sheet parameter and of the coupling time to the evolution of the ice sheet over time. Additionally, we used different lapse rate adjustments between the relatively coarse climate model and the much finer ice sheet model topography. It is shown that the ice sheet evolution over a million year timescale is strongly sensitive to the choice of the coupling time and to the implementation of the lapse rate adjustment. With the new coupling procedure, we provide a more realistic and computationally efficient framework for ice sheet-climate interactions on a multi-million year timescale.</p><p> </p>


2016 ◽  
Author(s):  
Doug McNeall ◽  
Jonny Williams ◽  
Ben Booth ◽  
Richard Betts ◽  
Peter Challenor ◽  
...  

Abstract. We use observations of forest fraction to constrain carbon cycle and land surface input parameters of the reduced resolution global climate model, FAMOUS. Using a history matching approach along with a computationally cheap statistical proxy (emulator) of the climate model, we compare an ensemble of simulations of forest fraction with observations, and rule out parameter settings where the forests are poorly simulated. Regions of parameter space where FAMOUS best simulates the Amazon forest fraction are incompatible with the regions where FAMOUS best simulates other forests, indicating a structural error in the model. Using observations of the Amazon forest to constrain input parameters leads to very different conclusions about the acceptable values of input parameters than using the other forests. We characterise the structural model discrepancy, and explore the consequences of ignoring it in a history matching exercise. We use sensitivity analysis to find the parameters which have most impact on simulator error. We use the emulator to simulate the forest fraction at the best set of parameters implied by matching the model to the Amazon, and to other major forests in turn. We can find parameters that lead to a realistic forest fraction in the Amazon, but using the Amazon alone to tune the simulator would result in a significant overestimate of forest fraction in the other forests. Conversely, using the other forests to calibrate the model leads to a larger underestimate of the Amazon forest fraction. Finally, we perform a history matching exercise using credible estimates for simulator discrepancy and observational uncertainty terms. We are unable to constrain the parameters individually, but just under half of joint parameter space is ruled out as being incompatible with forest observations. We discuss the possible sources of the discrepancy in the simulated Amazon, including missing processes in the land surface component, and a bias in the climatology of the Amazon.


2021 ◽  
Author(s):  
Jonas Van Breedam ◽  
Philippe Huybrechts ◽  
Michel Crucifix

Abstract. On multi-million year timescales, fully coupled ice sheet – climate simulations are hampered by computational limitations, even at coarser resolutions and when considering asynchronous coupling schemes. In this study, a novel coupling method CLISEMv1.0 (CLimate-Ice Sheet EMulator version 1.0) is presented where a Gaussian process emulator is applied to the climate model HadSM3 coupled to the ice sheet model AISMPALEO. The temperature and precipitation fields from HadSM3 are emulated to feed the mass balance model from AISMPALEO. The sensitivity of the evolution of the ice sheet over time is tested to the number of predefined ice sheet geometries the emulator is calibrated on, to the formulation of the ice sheet parameter (being either ice sheet volume, either ice sheet area, or both) and to the coupling time. Sensitivity experiments are conducted to explore the uncertainty introduced by the emulator. Additionally, different lapse rate adjustments are used between the relatively coarse climate model and the much finer ice sheet model topography. It is shown that the ice sheet evolution over a million-year timescale is strongly sensitive to the definition of the ice sheet parameter and to the number of predefined ice sheet geometries. With the new coupling procedure, we provide a computationally efficient framework for simulating ice sheet-climate interactions on a multi-million year timescale that allows for a large number of sensitivity tests.


2014 ◽  
Vol 7 (1) ◽  
pp. 1197-1244
Author(s):  
A. Valade ◽  
P. Ciais ◽  
N. Vuichard ◽  
N. Viovy ◽  
N. Huth ◽  
...  

Abstract. Agro-Land Surface Models (agro-LSM) have been developed from the integration of specific crop processes into large-scale generic land surface models that allow calculating the spatial distribution and variability of energy, water and carbon fluxes within the soil-vegetation-atmosphere continuum. When developing agro-LSM models, a particular attention must be given to the effects of crop phenology and management on the turbulent fluxes exchanged with the atmosphere, and the underlying water and carbon pools. A part of the uncertainty of Agro-LSM models is related to their usually large number of parameters. In this study, we quantify the parameter-values uncertainty in the simulation of sugar cane biomass production with the agro-LSM ORCHIDEE-STICS, using a multi-regional approach with data from sites in Australia, La Réunion and Brazil. In ORCHIDEE-STICS, two models are chained: STICS, an agronomy model that calculates phenology and management, and ORCHIDEE, a land surface model that calculates biomass and other ecosystem variables forced by STICS' phenology. First, the parameters that dominate the uncertainty of simulated biomass at harvest date are determined through a screening of 67 different parameters of both STICS and ORCHIDEE on a multi-site basis. Secondly, the uncertainty of harvested biomass attributable to those most sensitive parameters is quantified and specifically attributed to either STICS (phenology, management) or to ORCHIDEE (other ecosystem variables including biomass) through distinct Monte-Carlo runs. The uncertainty on parameter values is constrained using observations by calibrating the model independently at seven sites. In a third step, a sensitivity analysis is carried out by varying the most sensitive parameters to investigate their effects at continental scale. A Monte-Carlo sampling method associated with the calculation of Partial Ranked Correlation Coefficients is used to quantify the sensitivity of harvested biomass to input parameters on a continental scale across the large regions of intensive sugar cane cultivation in Australia and Brazil. Ten parameters driving most of the uncertainty in the ORCHIDEE-STICS modeled biomass at the 7 sites are identified by the screening procedure. We found that the 10 most sensitive parameters control phenology (maximum rate of increase of LAI) and root uptake of water and nitrogen (root profile and root growth rate, nitrogen stress threshold) in STICS, and photosynthesis (optimal temperature of photosynthesis, optimal carboxylation rate), radiation interception (extinction coefficient), and transpiration and respiration (stomatal conductance, growth and maintenance respiration coefficients) in ORCHIDEE. We find that the optimal carboxylation rate and photosynthesis temperature parameters contribute most to the uncertainty in harvested biomass simulations at site scale. The spatial variation of the ranked correlation between input parameters and modeled biomass at harvest is well explained by rain and temperature drivers, suggesting climate-mediated different sensitivities of modeled sugar cane yield to the model parameters, for Australia and Brazil. This study reveals the spatial and temporal patterns of uncertainty variability for a highly parameterized agro-LSM and calls for more systematic uncertainty analyses of such models.


2014 ◽  
Vol 7 (3) ◽  
pp. 1225-1245 ◽  
Author(s):  
A. Valade ◽  
P. Ciais ◽  
N. Vuichard ◽  
N. Viovy ◽  
A. Caubel ◽  
...  

Abstract. Agro-land surface models (agro-LSM) have been developed from the integration of specific crop processes into large-scale generic land surface models that allow calculating the spatial distribution and variability of energy, water and carbon fluxes within the soil–vegetation–atmosphere continuum. When developing agro-LSM models, particular attention must be given to the effects of crop phenology and management on the turbulent fluxes exchanged with the atmosphere, and the underlying water and carbon pools. A part of the uncertainty of agro-LSM models is related to their usually large number of parameters. In this study, we quantify the parameter-values uncertainty in the simulation of sugarcane biomass production with the agro-LSM ORCHIDEE–STICS, using a multi-regional approach with data from sites in Australia, La Réunion and Brazil. In ORCHIDEE–STICS, two models are chained: STICS, an agronomy model that calculates phenology and management, and ORCHIDEE, a land surface model that calculates biomass and other ecosystem variables forced by STICS phenology. First, the parameters that dominate the uncertainty of simulated biomass at harvest date are determined through a screening of 67 different parameters of both STICS and ORCHIDEE on a multi-site basis. Secondly, the uncertainty of harvested biomass attributable to those most sensitive parameters is quantified and specifically attributed to either STICS (phenology, management) or to ORCHIDEE (other ecosystem variables including biomass) through distinct Monte Carlo runs. The uncertainty on parameter values is constrained using observations by calibrating the model independently at seven sites. In a third step, a sensitivity analysis is carried out by varying the most sensitive parameters to investigate their effects at continental scale. A Monte Carlo sampling method associated with the calculation of partial ranked correlation coefficients is used to quantify the sensitivity of harvested biomass to input parameters on a continental scale across the large regions of intensive sugarcane cultivation in Australia and Brazil. The ten parameters driving most of the uncertainty in the ORCHIDEE–STICS modeled biomass at the 7 sites are identified by the screening procedure. We found that the 10 most sensitive parameters control phenology (maximum rate of increase of LAI) and root uptake of water and nitrogen (root profile and root growth rate, nitrogen stress threshold) in STICS, and photosynthesis (optimal temperature of photosynthesis, optimal carboxylation rate), radiation interception (extinction coefficient), and transpiration and respiration (stomatal conductance, growth and maintenance respiration coefficients) in ORCHIDEE. We find that the optimal carboxylation rate and photosynthesis temperature parameters contribute most to the uncertainty in harvested biomass simulations at site scale. The spatial variation of the ranked correlation between input parameters and modeled biomass at harvest is well explained by rain and temperature drivers, suggesting different climate-mediated sensitivities of modeled sugarcane yield to the model parameters, for Australia and Brazil. This study reveals the spatial and temporal patterns of uncertainty variability for a highly parameterized agro-LSM and calls for more systematic uncertainty analyses of such models.


2007 ◽  
Vol 135 (12) ◽  
pp. 4077-4096 ◽  
Author(s):  
Jean-Christophe Golaz ◽  
Vincent E. Larson ◽  
James A. Hansen ◽  
David P. Schanen ◽  
Brian M. Griffin

Abstract Every cloud parameterization contains structural model errors. The source of these errors is difficult to pinpoint because cloud parameterizations contain nonlinearities and feedbacks. To elucidate these model inadequacies, this paper uses a general-purpose ensemble parameter estimation technique. In principle, the technique is applicable to any parameterization that contains a number of adjustable coefficients. It optimizes or calibrates parameter values by attempting to match predicted fields to reference datasets. Rather than striving to find the single best set of parameter values, the output is instead an ensemble of parameter sets. This ensemble provides a wealth of information. In particular, it can help uncover model deficiencies and structural errors that might not otherwise be easily revealed. The calibration technique is applied to an existing single-column model (SCM) that parameterizes boundary layer clouds. The SCM is a higher-order turbulence closure model. It is closed using a multivariate probability density function (PDF) that represents subgrid-scale variability. Reference datasets are provided by large-eddy simulations (LES) of a variety of cloudy boundary layers. The calibration technique locates some model errors in the SCM. As a result, empirical modifications are suggested. These modifications are evaluated with independent datasets and found to lead to an overall improvement in the SCM’s performance.


Sign in / Sign up

Export Citation Format

Share Document