Response to Interactive comment on “Consistent assimilation of multiple data streams in a carbon cycle data assimilation system” by Natasha MacBean et al. by Anonymous Referee #2

Abstract. Data assimilation methods provide a rigorous statistical framework for constraining the parametric uncertainty of land surface models (LSMs), with the aim of improving our predictive capability as well as identifying areas in which the models need improvement. The increase in the number of available datasets in recent years allows us to address different aspects of the model at a variety of spatial and temporal scales. However, combining data streams in a DA system is not a trivial task. In this study we highlight some of the challenges surrounding multiple data stream assimilation, with a particular focus on the carbon cycle component of LSMs. We examine the impact of biases and inconsistencies between the observations and the model (resulting in non Gaussian error distributions) and the impact of non-linearity in model dynamics. In addition we explore the differences between performing a simultaneous assimilation (in which all data streams are included in one optimisation) and a step-wise approach (in which each data steam is assimilated sequentially), given the assumptions inherent to the inversion algorithm chosen for this study. We demonstrate some of these issues by assimilating synthetic observations into two simple models: the first a simplified version of the carbon cycle processes represented in many LSMs, and the second a non-linear toy model. We further discuss these experimental results in the context of recent studies in the carbon cycle data assimilation literature, and finally we provide some perspectives and advice to other land surface modellers wishing to use multiple data streams to constrain their models.

Download Full-text

A new stepwise carbon cycle data assimilation system using multiple data streams to constrain the simulated land surface carbon cycle

Geoscientific Model Development ◽

10.5194/gmd-9-3321-2016 ◽

2016 ◽

Vol 9 (9) ◽

pp. 3321-3346 ◽

Cited By ~ 32

Author(s):

Philippe Peylin ◽

Cédric Bacour ◽

Natasha MacBean ◽

Sébastien Leonard ◽

Peter Rayner ◽

...

Keyword(s):

Data Assimilation ◽

Carbon Cycle ◽

Data Streams ◽

Land Surface ◽

Atmospheric Co2 ◽

Stepwise Approach ◽

Modis Ndvi ◽

Optimized Model ◽

Data Assimilation System ◽

Assimilation System

Abstract. Large uncertainties in land surface models (LSMs) simulations still arise from inaccurate forcing, poor description of land surface heterogeneity (soil and vegetation properties), incorrect model parameter values and incomplete representation of biogeochemical processes. The recent increase in the number and type of carbon cycle-related observations, including both in situ and remote sensing measurements, has opened a new road to optimize model parameters via robust statistical model–data integration techniques, in order to reduce the uncertainties of simulated carbon fluxes and stocks. In this study we present a carbon cycle data assimilation system that assimilates three major data streams, namely the Moderate Resolution Imaging Spectroradiometer (MODIS)-Normalized Difference Vegetation Index (NDVI) observations of vegetation activity, net ecosystem exchange (NEE) and latent heat (LE) flux measurements at more than 70 sites (FLUXNET), as well as atmospheric CO2 concentrations at 53 surface stations, in order to optimize the main parameters (around 180 parameters in total) of the Organizing Carbon and Hydrology in Dynamics Ecosystems (ORCHIDEE) LSM (version 1.9.5 used for the Coupled Model Intercomparison Project Phase 5 (CMIP5) simulations). The system relies on a stepwise approach that assimilates each data stream in turn, propagating the information gained on the parameters from one step to the next. Overall, the ORCHIDEE model is able to achieve a consistent fit to all three data streams, which suggests that current LSMs have reached the level of development to assimilate these observations. The assimilation of MODIS-NDVI (step 1) reduced the growing season length in ORCHIDEE for temperate and boreal ecosystems, thus decreasing the global mean annual gross primary production (GPP). Using FLUXNET data (step 2) led to large improvements in the seasonal cycle of the NEE and LE fluxes for all ecosystems (i.e., increased amplitude for temperate ecosystems). The assimilation of atmospheric CO2, using the general circulation model (GCM) of the Laboratoire de Météorologie Dynamique (LMDz; step 3), provides an overall constraint (i.e., constraint on large-scale net CO2 fluxes), resulting in an improvement of the fit to the observed atmospheric CO2 growth rate. Thus, the optimized model predicts a land C (carbon) sink of around 2.2 PgC yr−1 (for the 2000–2009 period), which is more compatible with current estimates from the Global Carbon Project (GCP) than the prior value. The consistency of the stepwise approach is evaluated with back-compatibility checks. The final optimized model (after step 3) does not significantly degrade the fit to MODIS-NDVI and FLUXNET data that were assimilated in the first two steps, suggesting that a stepwise approach can be used instead of the more “challenging” implementation of a simultaneous optimization in which all data streams are assimilated together. Most parameters, including the scalar of the initial soil carbon pool size, changed during the optimization with a large error reduction. This work opens new perspectives for better predictions of the land carbon budgets.

Download Full-text

A new step-wise Carbon Cycle Data Assimilation System using multiple data streams to constrain the simulated land surface carbon cycle

10.5194/gmd-2016-13 ◽

2016 ◽

Cited By ~ 1

Author(s):

P. Peylin ◽

C. Bacour ◽

N. MacBean ◽

S. Leonard ◽

P. J. Rayner ◽

...

Keyword(s):

Data Assimilation ◽

Carbon Cycle ◽

Data Streams ◽

Land Surface ◽

Atmospheric Co2 ◽

Model Parameters ◽

Modis Ndvi ◽

Optimized Model ◽

Data Assimilation System ◽

Assimilation System

Abstract. Large uncertainties in Land surface models (LSMs) simulations still arise from inaccurate forcing, incorrect model parameter values and incomplete representation of biogeochemical processes. The recent increase in the number and type of carbon cycle related observations, including both in situ and remote sensing measurements, has opened a new road to optimize model parameters via robust statistical model-data integration techniques, in order to reduce the simulated carbon fluxes and stocks uncertainties. In this study we present a Carbon Cycle Data Assimilation System (CCDAS) that assimilates three major data streams, namely MODIS-NDVI observations of vegetation activity, net ecosystem exchange (NEE) and latent heat (LE) flux measurements at more than 70 sites (FLUXNET), and atmospheric CO2 concentrations at 53 surface stations, in order to optimize the main parameters of the ORCHIDEE LSM (around 180 parameters in total). The system relies on a step-wise approach that assimilates each data stream in turn, propagating the information gained on the parameters from one step to the next. Overall, the ORCHIDEE model is able to achieve a consistent fit to all three data streams, which suggests that current LSMs have reached the level of development to assimilate these observations. The assimilation of MODIS-NDVI (step 1) reduced the growing season length in ORCHIDEE for temperate and boreal ecosystems, thus decreasing the global mean annual gross primary production (GPP). Using FLUXNET data (step 2) led to large improvements in the seasonal cycle of the NEE and LE fluxes for all ecosystems (i.e., increased amplitude for temperate ecosystems). The assimilation of atmospheric CO2, using the atmospheric transport model LMDz (step 3), provides an overall constraint (i.e., constraint on large scale net CO2 fluxes), resulting in an improvement of the fit to the observed atmospheric CO2 growth rate. Thus the optimized model predicts a land C sink of around 2.2 PgC.yr−1 (for the 2000–2009 period), which is more compatible with current estimates from the Global Carbon Project (GCP) than the prior value. The consistency of the step-wise approach is evaluated with backcompatibility checks. The final optimized model (after step 3) does not significantly degrade the fit to MODIS-NDVI and FLUXNET data that were assimilated in the first two steps, suggesting that a stepwise approach can be used instead of the more “challenging” implementation of a simultaneous optimization in which all data streams are assimilated together. Most parameters, including the scalar of the initial soil carbon pool size, changed during the optimization with a large error reduction. This work opens new perspectives for better predictions of the land carbon budgets.

Download Full-text

Consistent assimilation of multiple data streams in a carbon cycle data assimilation system

Geoscientific Model Development ◽

10.5194/gmd-9-3569-2016 ◽

2016 ◽

Vol 9 (10) ◽

pp. 3569-3588 ◽

Cited By ~ 28

Author(s):

Natasha MacBean ◽

Philippe Peylin ◽

Frédéric Chevallier ◽

Marko Scholze ◽

Gregor Schürmann

Keyword(s):

Data Assimilation ◽

Carbon Cycle ◽

Data Streams ◽

Land Surface ◽

Data Stream ◽

Correlated Errors ◽

Multiple Data ◽

Model Dynamics ◽

Error Distributions ◽

Multiple Data Streams

Abstract. Data assimilation methods provide a rigorous statistical framework for constraining parametric uncertainty in land surface models (LSMs), which in turn helps to improve their predictive capability and to identify areas in which the representation of physical processes is inadequate. The increase in the number of available datasets in recent years allows us to address different aspects of the model at a variety of spatial and temporal scales. However, combining data streams in a DA system is not a trivial task. In this study we highlight some of the challenges surrounding multiple data stream assimilation for the carbon cycle component of LSMs. We give particular consideration to the assumptions associated with the type of inversion algorithm that are typically used when optimising global LSMs – namely, Gaussian error distributions and linearity in the model dynamics. We explore the effect of biases and inconsistencies between the observations and the model (resulting in non-Gaussian error distributions), and we examine the difference between a simultaneous assimilation (in which all data streams are included in one optimisation) and a step-wise approach (in which each data stream is assimilated sequentially) in the presence of non-linear model dynamics. In addition, we perform a preliminary investigation into the impact of correlated errors between two data streams for two cases, both when the correlated observation errors are included in the prior observation error covariance matrix, and when the correlated errors are ignored. We demonstrate these challenges by assimilating synthetic observations into two simple models: the first a simplified version of the carbon cycle processes represented in many LSMs and the second a non-linear toy model. Finally, we provide some perspectives and advice to other land surface modellers wishing to use multiple data streams to constrain their model parameters.

Download Full-text

Constraining a land surface model with multiple observations by application of the MPI-Carbon Cycle Data Assimilation System

10.5194/gmd-2015-263 ◽

2016 ◽

Cited By ~ 3

Author(s):

G. J. Schürmann ◽

T. Kaminski ◽

C. Köstler ◽

N. Carvalhais ◽

M. Voßbeck ◽

...

Keyword(s):

Data Assimilation ◽

Carbon Cycle ◽

Land Surface ◽

Data Stream ◽

Atmospheric Co2 ◽

Surface Model ◽

Terrestrial Biosphere ◽

Multiple Observations ◽

Data Assimilation System ◽

Assimilation System

Abstract. We describe the Max Planck Institute Carbon Cycle Data Assimilation System (MPI-CCDAS) built around the tangent-linear version of the land surface scheme of the MPI-Earth System Model v1 (JSBACH). The simulated terrestrial biosphere processes (phenology and carbon balance) were constrained by observations of the fraction of photosynthetically active radiation (TIP-FAPAR product) and by observations of atmospheric CO2 at a global set of monitoring stations for the years 2005–2009. The system successfully, and computationally efficiently, improved average foliar area and northern extra-tropical seasonality of foliar area when constrained by TIP-FAPAR. Global net and gross carbon fluxes were improved when constrained by atmospheric CO2, although the system tended to underestimate tropical productivity. Assimilating both data streams jointly allowed the MPI-CCDAS to match both observations (TIP-FAPAR and atmospheric CO2) equally well as the single data stream assimilation cases, therefore overall increasing the appropriateness of the resultant parameter values and biosphere dynamics. Our study thus highlights the role of the TIP-FAPAR product in stabilising the underdetermined atmospheric inversion problem and demonstrates the value of multiple-data stream assimilation for the simulation of terrestrial biosphere dynamics. The constraint on regional gross and net CO2 flux patterns is limited through the parametrisation of the biosphere model. We expect improvement on that aspect through a refined initialisation strategy and inclusion of further biosphere observations as constraints.

Download Full-text