Quasi-Newton full-waveform inversion with a projected Hessian matrix

Geophysics ◽  
2012 ◽  
Vol 77 (5) ◽  
pp. R207-R216 ◽  
Author(s):  
Yong Ma ◽  
Dave Hale

We present a method, in realistic-size full-waveform inversion (FWI), to explicitly construct a projected Hessian matrix and its inverse matrix, which we subsequently used to solve FWI with a quasi-Newton method. Newton’s method is practically unfeasible in solving realistic-size FWI problems because of the prohibitive cost (computing time and memory consumption) of calculating the Hessian matrix and the inverse Hessian. Therefore, the Gauss-Newton method and various quasi-Newton methods are proposed to approximate a Hessian matrix. Particularly, current quasi-Newton FWI (QNFWI) commonly uses the limited-memory BFGS (L-BFGS) method, which, however, only implicitly approximates an inverse Hessian. We repose FWI as a sparse optimization problem in a sparse model space, which contains substantially fewer model parameters that are constrained by structures of the model. With respect to fewer parameters in the sparse model, we can avoid the “limited-memory” approximation and are able to explicitly compute and store a projected Hessian matrix that saves the computational time and required memory. We constructed such a projected Hessian matrix by adapting the classic BFGS method to a projected BFGS (P-BFGS) method in the sparse space. Using the projected Hessian matrix and its inverse, we can apply the P-BFGS method to solve FWI with a quasi-Newton method. In QNFWI with P-BFGS because we invert for a sparse model with much fewer parameters, the memory required to compute the projected Hessian is negligible compared to either forward modeling or gradient calculation. QNFWI with P-BFGS converges in fewer iterations than conjugate-gradient based methods and QNFWI with L-BFGS.

Geophysics ◽  
2012 ◽  
Vol 77 (4) ◽  
pp. R189-R198 ◽  
Author(s):  
Yong Ma ◽  
Dave Hale ◽  
Bin Gong ◽  
Zhaobo (Joe) Meng

Multiple problems, including high computational cost, spurious local minima, and solutions with no geologic sense, have prevented widespread application of full waveform inversion (FWI), especially FWI of seismic reflections. These problems are fundamentally related to a large number of model parameters and to the absence of low frequencies in recorded seismograms. Instead of inverting for all the parameters in a dense model, image-guided full waveform inversion inverts for a sparse model space that contains far fewer parameters. We represent a model with a sparse set of values, and from these values, we use image-guided interpolation (IGI) and its adjoint operator to compute finely and uniformly sampled models that can fit recorded data in FWI. Because of this sparse representation, image-guided FWI updates more blocky models, and this blockiness in the model space mitigates the absence of low frequencies in recorded data. Moreover, IGI honors imaged structures, so image-guided FWI built in this way yields models that are geologically sensible.


2020 ◽  
Author(s):  
Dirk-Philip van Herwaarden ◽  
Christian Boehm ◽  
Michael Afanasiev ◽  
Solvi Thrastarson ◽  
Lion Krischer ◽  
...  

<p>We present an evolutionary full-waveform inversion based on dynamic mini-batch optimization, which naturally exploits redundancies in observed data from different sources and allows the model to evolve along with the amount of available information in the data.</p><p>Quasi-random subsets (mini-batches) of sources are used to approximate the misfit and the gradient of the complete dataset. The size of the mini-batch is dynamically controlled by the desired quality of the approximation of the full gradient. Within each mini-batch, redundancy is minimized by selecting sources with the largest angular differences between their respective gradients, and spatial coverage is maximized by selecting candidate events with Mitchell’s best-candidate algorithm. Information from sources included in a previous mini-batch is incorporated into each gradient calculation through a quasi-Newton approximation of the Hessian, and a consistent misfit measure is achieved through the inclusion of a control group of sources.</p><p>By design, the dynamic mini-batch approach has several main advantages: (1) The use of mini-batches with adaptive sizes minimizes the number of redundant simulations per iteration, thus potentially leading to significant computational savings. (2) Curvature information is accumulated and used during the inversion, using a stochastic quasi-Newton method. (3) Data from new events or different time windows can seamlessly be incorporated during the iterations, thereby enabling an evolutionary mode of full-waveform inversion.</p><p>To illustrate our method, we start an inversion for upper mantle structure beneath the African plate. Starting from a smooth 1-D background model for a dataset recorded in the years 1990 to 1995, we then sequentially add more and more recent data into the inversion and show how the model can evolve as a function of data coverage. The mini-batch sampling approach allows us to incorporate data from several hundred earthquakes without increasing the computational burden, thereby going significantly beyond previous regional-scale full-waveform inversions.</p>


Geophysics ◽  
2013 ◽  
Vol 78 (5) ◽  
pp. WC113-WC121 ◽  
Author(s):  
Nishant Kamath ◽  
Ilya Tsvankin

Although full-waveform inversion (FWI) has shown significant promise in reconstructing heterogeneous velocity fields, most existing methodologies are limited to acoustic models. We extend FWI to multicomponent (PP and PS) data from anisotropic media, with the current implementation limited to a stack of horizontal, homogeneous VTI (transversely isotropic with a vertical symmetry axis) layers. The algorithm is designed to estimate the interval vertical P- and S-wave velocities ([Formula: see text] and [Formula: see text]) and Thomsen parameters [Formula: see text] and [Formula: see text] from long-spread PP and PSV reflections. The forward-modeling operator is based on the anisotropic reflectivity technique, and the inversion is performed in the time domain using the gradient (Gauss-Newton) method. We employ nonhyperbolic semblance analysis and Dix-type equations to build the initial model. To identify the medium parameters constrained by the data, we perform eigenvalue/eigenvector decomposition of the approximate Hessian matrix for a VTI layer embedded between isotropic media. Analysis of the eigenvectors shows that the parameters [Formula: see text], [Formula: see text], [Formula: see text], and [Formula: see text] (density is assumed to be known) can be resolved not only by joint inversion of PP and PS data, but also with PP reflections alone. Although the inversion becomes more stable with increasing spreadlength-to-depth ([Formula: see text]) ratio, the parameters of the three-layer model are constrained even by PP data acquired on conventional spreads ([Formula: see text]). For multilayered VTI media, the sensitivity of the objective function to the interval parameters decreases with depth. Still, it is possible to resolve [Formula: see text], [Formula: see text], [Formula: see text], and [Formula: see text] for the deeper layers using PP-waves, if the ratio [Formula: see text] for the bottom of the layer reaches two. Mode-converted waves provide useful additional constraints for FWI, which become essential for smaller spreads. The insights gained here by examining horizontally layered models should help guide the inversion for heterogeneous TI media.


Geophysics ◽  
2016 ◽  
Vol 81 (6) ◽  
pp. R399-R415 ◽  
Author(s):  
Jizhong Yang ◽  
Yuzhu Liu ◽  
Liangguo Dong

Density is known to be difficult to reconstruct in multiparameter full-waveform inversion (FWI). This difficulty results from the similarity of the diffraction patterns of velocity and density at small scattering angles. In addition, the sensitivities of seismic data with respect to velocity and density have different orders of magnitude which make the inversion ill-conditioned. The inverse Hessian has been shown to mitigate the coupling effects and rescale the magnitudes of different parameters, such that reliable updates for all parameters are available. We have investigated the possibility of simultaneous estimations of velocity and density in acoustic media using the truncated Gauss-Newton method. The model updates are calculated using a matrix-free conjugate gradient solution of the Gauss-Newton normal equation. The gradients of the misfit function with respect to the model parameters and the Hessian-vector products are computed using an improved scattering-integral approach. To give some insights into the trade-off effects between velocity and density, and the imaging resolution in FWI, the sensitivity kernels of both parameters are numerically calculated in homogeneous background models, and their spatial distributions and characteristics are analyzed. The synthetic experiments on a canonical inclusion model and the 2004 BP model confirm that, in cases in which the Gauss-Newton approximate Hessian, especially its off-diagonal blocks, is accurately taken into account, the truncated Gauss-Newton method can effectively mitigate the trade-off effects between velocity and density and provide accelerated convergence rate. Hence, well-resolved velocity and density models are expected.


2018 ◽  
Vol 8 (2) ◽  
Author(s):  
Katherine Flórez ◽  
Sergio Alberto Abreo Carrillo ◽  
Ana Beatriz Ramírez Silva

Full Waveform Inversion (FWI) schemes are gradually becoming more common in the oil and gas industry, as a new tool for studying complex geological zones, based on their reliability for estimating velocity models. FWI is a non-linear inversion method that iteratively estimates subsurface characteristics such as seismic velocity, starting from an initial velocity model and the preconditioned data acquired. Blended sources have been used in marine seismic acquisitions to reduce acquisition costs, reducing the number of times that the vessel needs to cross the exploration delineation trajectory. When blended or simultaneous without previous de-blending or separation, stage data are used in the reconstruction of the velocity model with the FWI method, and the computational time is reduced. However, blended data implies overlapping single shot-gathers, producing interference that affects the result of seismic approaches, such as FWI or seismic image migration. In this document, an encoding strategy is developed, which reduces the overlap areas within the blended data to improve the final velocity model with the FWI method.


Sign in / Sign up

Export Citation Format

Share Document