A Neural Network-Based Observation Operator for Coupled Ocean-Acoustic Variational Data Assimilation

Author(s):  
Andrea Storto ◽  
Giovanni De Magistris ◽  
Silvia Falchetti ◽  
Paolo Oddo

<p>Variational data assimilation requires implementing the tangent-linear and adjoint (TA/AD) version of any operator. This intrinsically hampers the use of complicated observations. Here, we assess a new data-driven approach to assimilate acoustic underwater propagation measurements (Transmission Loss, TL) into a regional ocean forecasting system. TL measurements depend on the underlying sound speed fields, mostly temperature, and their inversion would require heavy coding of the TA/AD of an acoustic underwater propagation model. In this study, the non-linear version of the acoustic model is applied to an ensemble of perturbed oceanic conditions. TL outputs are used to formulate both a statistical linear operator based on Canonical Correlation Analysis (CCA), and a neural network-based (NN) operator. For the latter, two linearization strategies are compared, the best performing one relying on reverse-mode automatic differentiation. The new observation operator is applied in data assimilation experiments over the Ligurian Sea (Mediterranean Sea), using the Observing System Simulation Experiments (OSSE) methodology to assess the impact of TL observations onto oceanic fields. TL observations are extracted from a nature run with perturbed surface boundary conditions and stochastic ocean physics. Sensitivity analysis and forecast experiments show not only the highest accuracy of the NN reconstruction of TL when compared to CCA, but also that its use in the assimilation of TL observations is able to significantly improve the upper ocean forecast skills. The use of the NN observation operator is computationally affordable, and its general formulation appears promising for the adjoint-free assimilation of any remote sensing observing network.</p>

Author(s):  
Andrea Storto ◽  
Giovanni De Magistris ◽  
Silvia Falchetti ◽  
Paolo Oddo

AbstractVariational data assimilation requires implementing the tangent-linear and adjoint (TA/AD) version of any operator. This intrinsically hampers the use of complicated observations. Here, we assess a new data-driven approach to assimilate acoustic underwater propagation measurements (Transmission Loss, TL) into a regional ocean forecasting system. TL measurements depend on the underlying sound speed fields, mostly temperature, and their inversion would require heavy coding of the TA/AD of an acoustic underwater propagation model. In this study, the non-linear version of the acoustic model is applied to an ensemble of perturbed oceanic conditions. TL outputs are used to formulate both a statistical linear operator based on canonical correlation analysis (CCA), and a neural network-based (NN) operator. For the latter, two linearization strategies are compared, the best-performing one relying on reverse-mode automatic differentiation. The new observation operator is applied in data assimilation experiments over the Ligurian Sea (Mediterranean Sea), using the Observing System Simulation Experiments (OSSE) methodology to assess the impact of TL observations onto oceanic fields. TL observations are extracted from a nature run with perturbed surface boundary conditions and stochastic ocean physics. Sensitivity analyses indicate that the NN reconstruction of TL is significantly better than CCA. Both CCA and NN are able to improve the upper ocean skill scores in forecast experiments, with NN outperforming CCA on the average. The use of the NN observation operator is computationally affordable, and its general formulation appears promising for the adjoint-free assimilation of any remote sensing observing network.


2018 ◽  
Vol 146 (2) ◽  
pp. 447-465 ◽  
Author(s):  
Mark Buehner ◽  
Ping Du ◽  
Joël Bédard

Abstract Two types of approaches are commonly used for estimating the impact of arbitrary subsets of observations on short-range forecast error. The first was developed for variational data assimilation systems and requires the adjoint of the forecast model. Comparable approaches were developed for use with the ensemble Kalman filter and rely on ensembles of forecasts. In this study, a new approach for computing observation impact is proposed for ensemble–variational data assimilation (EnVar). Like standard adjoint approaches, the adjoint of the data assimilation procedure is implemented through the iterative minimization of a modified cost function. However, like ensemble approaches, the adjoint of the forecast step is obtained by using an ensemble of forecasts. Numerical experiments were performed to compare the new approach with the standard adjoint approach in the context of operational deterministic NWP. Generally similar results are obtained with both approaches, especially when the new approach uses covariance localization that is horizontally advected between analysis and forecast times. However, large differences in estimated impacts are obtained for some surface observations. Vertical propagation of the observation impact is noticeably restricted with the new approach because of vertical covariance localization. The new approach is used to evaluate changes in observation impact as a result of the use of interchannel observation error correlations for radiance observations. The estimated observation impact in similarly configured global and regional prediction systems is also compared. Overall, the new approach should provide useful estimates of observation impact for data assimilation systems based on EnVar when an adjoint model is not available.


Author(s):  
Nemesio Rodriguez-Fernandez ◽  
Patricia de Rosnay ◽  
Clement Albergel ◽  
Philippe Richaume ◽  
Filipe Aires ◽  
...  

The assimilation of Soil Moisture and Ocean Salinity (SMOS) data into the ECMWF (European Centre for Medium Range Weather Forecasts) H-TESSEL (Hydrology revised - Tiled ECMWF Scheme for Surface Exchanges over Land) model is presented. SMOS soil moisture (SM) estimates have been produced specifically by training a neural network with SMOS brightness temperatures as input and H-TESSEL model SM simulations as reference. This can help the assimilation of SMOS information in several ways: (1) the neural network soil moisture (NNSM) data have a similar climatology to the model, (2) no global bias is present with respect to the model even if regional differences can exist. Experiments performing joint data assimilation (DA) of NNSM, 2 metre air temperature and relative humidity or NNSM-only DA are discussed. The resulting SM was evaluated against a large number of in situ measurements of SM obtaining similar results to those of the model with no assimilation, even if significant differences were found from site to site. In addition, atmospheric forecasts initialized with H-TESSEL runs (without DA) or with the analysed SM were compared to measure of the impact of the satellite information. Although, NNSM DA has an overall neutral impact in the forecast in the Tropics, a significant positive impact was found in other areas and periods, especially in regions with limited in situ information. The joint NNSM, T2m and RH2m DA improves the forecast for all the seasons in the Southern Hemisphere. The impact is mostly due to T2m and RH2m, but SMOS NN DA alone also improves the forecast in July- September. In the Northern Hemisphere, the joint NNSM, T2m and RH2m DA improves the forecast in April-September, while NNSM alone has a significant positive effect in July-September. Furthermore, forecasting skill maps show that SMOS NNSM improves the forecast in North America and in Northern Asia for up to 72 hours lead time.


2021 ◽  
Author(s):  
Ronan Fablet ◽  
Bertrand Chapron ◽  
Lucas Drumetz ◽  
Etienne Memin ◽  
Olivier Pannekoucke ◽  
...  

<p>This paper addresses representation learning for the resolution of inverse problems  with geophysical dynamics. Among others, examples of inverse problems of interest include space-time interpolation, short-term forecasting, conditional simulation w.r.t. available observations, downscaling problems… From a methodological point of view, we rely on a variational data assimilation framework. Data assimilation (DA) aims to reconstruct the time evolution of some state given a series of  observations, possibly noisy and irregularly-sampled. Here, we investigate DA from a machine learning point of view backed by an underlying variational representation.  Using automatic differentiation tools embedded in deep learning frameworks, we introduce end-to-end neural network architectures for variational data assimilation. It comprises two key components: a variational model and a gradient-based solver both implemented as neural networks. A key feature of the proposed end-to-end learning architecture is that we may train the neural networks models using both supervised and unsupervised strategies. We first illustrate applications to the reconstruction of Lorenz-63 and Lorenz-96 systems from partial and noisy observations. Whereas the gain issued from the supervised learning setting emphasizes the relevance of groundtruthed observation dataset for real-world case-studies, these results also suggest new means to design data assimilation models from data. Especially, they suggest that learning task-oriented representations of the underlying dynamics may be beneficial. We further discuss applications to short-term forecasting and sampling design along with preliminary results for the reconstruction of sea surface currents from satellite altimetry data. </p><p>This abstract is supported by a preprint available online: https://arxiv.org/abs/2007.12941</p>


2005 ◽  
Vol 133 (4) ◽  
pp. 829-843 ◽  
Author(s):  
Milija Zupanski ◽  
Dusanka Zupanski ◽  
Tomislava Vukicevic ◽  
Kenneth Eis ◽  
Thomas Vonder Haar

A new four-dimensional variational data assimilation (4DVAR) system is developed at the Cooperative Institute for Research in the Atmosphere (CIRA)/Colorado State University (CSU). The system is also called the Regional Atmospheric Modeling Data Assimilation System (RAMDAS). In its present form, the 4DVAR system is employing the CSU/Regional Atmospheric Modeling System (RAMS) nonhydrostatic primitive equation model. The Weather Research and Forecasting (WRF) observation operator is used to access the observations, adopted from the WRF three-dimensional variational data assimilation (3DVAR) algorithm. In addition to the initial conditions adjustment, the RAMDAS includes the adjustment of model error (bias) and lateral boundary conditions through an augmented control variable definition. Also, the control variable is defined in terms of the velocity potential and streamfunction instead of the horizontal winds. The RAMDAS is developed after the National Centers for Environmental Prediction (NCEP) Eta 4DVAR system, however with added improvements addressing its use in a research environment. Preliminary results with RAMDAS are presented, focusing on the minimization performance and the impact of vertical correlations in error covariance modeling. A three-dimensional formulation of the background error correlation is introduced and evaluated. The Hessian preconditioning is revisited, and an alternate algebraic formulation is presented. The results indicate a robust minimization performance.


2015 ◽  
Vol 8 (3) ◽  
pp. 669-696 ◽  
Author(s):  
G. Descombes ◽  
T. Auligné ◽  
F. Vandenberghe ◽  
D. M. Barker ◽  
J. Barré

Abstract. The specification of state background error statistics is a key component of data assimilation since it affects the impact observations will have on the analysis. In the variational data assimilation approach, applied in geophysical sciences, the dimensions of the background error covariance matrix (B) are usually too large to be explicitly determined and B needs to be modeled. Recent efforts to include new variables in the analysis such as cloud parameters and chemical species have required the development of the code to GENerate the Background Errors (GEN_BE) version 2.0 for the Weather Research and Forecasting (WRF) community model. GEN_BE allows for a simpler, flexible, robust, and community-oriented framework that gathers methods used by some meteorological operational centers and researchers. We present the advantages of this new design for the data assimilation community by performing benchmarks of different modeling of B and showing some of the new features in data assimilation test cases. As data assimilation for clouds remains a challenge, we present a multivariate approach that includes hydrometeors in the control variables and new correlated errors. In addition, the GEN_BE v2.0 code is employed to diagnose error parameter statistics for chemical species, which shows that it is a tool flexible enough to implement new control variables. While the generation of the background errors statistics code was first developed for atmospheric research, the new version (GEN_BE v2.0) can be easily applied to other domains of science and chosen to diagnose and model B. Initially developed for variational data assimilation, the model of the B matrix may be useful for variational ensemble hybrid methods as well.


2007 ◽  
Vol 135 (6) ◽  
pp. 2339-2354 ◽  
Author(s):  
Pierre Gauthier ◽  
Monique Tanguay ◽  
Stéphane Laroche ◽  
Simon Pellerin ◽  
Josée Morneau

Abstract On 15 March 2005, the Meteorological Service of Canada (MSC) proceeded to the implementation of a four-dimensional variational data assimilation (4DVAR) system, which led to significant improvements in the quality of global forecasts. This paper describes the different elements of MSC’s 4DVAR assimilation system, discusses some issues encountered during the development, and reports on the overall results from the 4DVAR implementation tests. The 4DVAR system adopted an incremental approach with two outer iterations. The simplified model used in the minimization has a horizontal resolution of 170 km and its simplified physics includes vertical diffusion, surface drag, orographic blocking, stratiform condensation, and convection. One important element of the design is its modularity, which has permitted continued progress on the three-dimensional variational data assimilation (3DVAR) component (e.g., addition of new observation types) and the model (e.g., computational and numerical changes). This paper discusses some numerical problems that occur in the vicinity of the Poles where the semi-Lagrangian scheme becomes unstable when there is a simultaneous occurrence of converging meridians and strong wind gradients. These could be removed by filtering the winds in the zonal direction before they are used to estimate the upstream position in the semi-Lagrangian scheme. The results show improvements in all aspects of the forecasts over all regions. The impact is particularly significant in the Southern Hemisphere where 4DVAR is able to extract more information from satellite data. In the Northern Hemisphere, 4DVAR accepts more asynoptic data, in particular coming from profilers and aircrafts. The impact noted is also positive and the short-term forecasts are particularly improved over the west coast of North America. Finally, the dynamical consistency of the 4DVAR global analyses leads to a significant impact on regional forecasts. Experimentation has shown that regional forecasts initiated directly from a 4DVAR global analysis are improved with respect to the regional forecasts resulting from the regional 3DVAR analysis.


2020 ◽  
Vol 242 ◽  
pp. 104980
Author(s):  
Yaodeng Chen ◽  
Shan Guo ◽  
Deming Meng ◽  
Hongli Wang ◽  
Dongmei Xu ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document