Adjoint-based exact Hessian computation

BIT Numerical Mathematics ◽

10.1007/s10543-020-00833-0 ◽

2021 ◽

Cited By ~ 1

Author(s):

Shin-ichi Ito ◽

Takeru Matsuda ◽

Yuto Miyatake

Keyword(s):

Krylov Subspace ◽

Hessian Matrix ◽

Adjoint System ◽

Coefficient Matrix ◽

Scalar Function ◽

Second Order ◽

Subspace Method ◽

Initial Value ◽

Research Fields ◽

Memory Efficiency

AbstractWe consider a scalar function depending on a numerical solution of an initial value problem, and its second-derivative (Hessian) matrix for the initial value. The need to extract the information of the Hessian or to solve a linear system having the Hessian as a coefficient matrix arises in many research fields such as optimization, Bayesian estimation, and uncertainty quantification. From the perspective of memory efficiency, these tasks often employ a Krylov subspace method that does not need to hold the Hessian matrix explicitly and only requires computing the multiplication of the Hessian and a given vector. One of the ways to obtain an approximation of such Hessian-vector multiplication is to integrate the so-called second-order adjoint system numerically. However, the error in the approximation could be significant even if the numerical integration to the second-order adjoint system is sufficiently accurate. This paper presents a novel algorithm that computes the intended Hessian-vector multiplication exactly and efficiently. For this aim, we give a new concise derivation of the second-order adjoint system and show that the intended multiplication can be computed exactly by applying a particular numerical method to the second-order adjoint system. In the discussion, symplectic partitioned Runge–Kutta methods play an essential role.

Download Full-text

Matrix Analysis of Second-Order Kinematic Constraints of Single-Loop Linkages in Screw Coordinates

Volume 5B: 42nd Mechanisms and Robotics Conference ◽

10.1115/detc2018-85433 ◽

2018 ◽

Author(s):

Liheng Wu ◽

Andreas Müller ◽

Jian S. Dai

Keyword(s):

Quadratic Forms ◽

Hessian Matrix ◽

Coefficient Matrix ◽

Higher Order ◽

Second Order ◽

Matrix Analysis ◽

Kinematic Constraints ◽

Single Loop ◽

Order Analysis ◽

Order Constraints

Higher order loop constraints play a key role in the local mobility, singularity and dynamic analysis of closed loop linkages. Recently, closed forms of higher order kinematic constraints have been achieved with nested Lie product in screw coordinates, and are purely algebraic operations. However, the complexity of expressions makes the higher order analysis complicated and highly reliant on computer implementations. In this paper matrix expressions of first and second-order kinematic constraints, i.e. involving the Jacobian and Hessian matrix, are formulated explicitly for single-loop linkages in terms of screw coordinates. For overconstrained linkages, which possess self-stress, the first- and second-order constraints are reduced to a set of quadratic forms. The test for the order of mobility relies on solutions of higher order constraints. Second-order mobility analysis boils down to testing the property of coefficient matrix of the quadratic forms (i.e. the Hessian) rather than to solving them. Thus, the second-order analysis is simplified.

Download Full-text

A Krylov subspace method based on multi-moment matching for model order reduction of large-scale second order bilinear systems

Applied Mathematical Modelling ◽

10.1016/j.apm.2018.03.048 ◽

2018 ◽

Vol 60 ◽

pp. 739-757 ◽

Cited By ~ 3

Author(s):

M. Vakilzadeh ◽

M. Eghtesad ◽

R. Vatankhah ◽

M. Mahmoodi

Keyword(s):

Model Order Reduction ◽

Large Scale ◽

Krylov Subspace ◽

Order Reduction ◽

Second Order ◽

Bilinear Systems ◽

Krylov Subspace Method ◽

Moment Matching ◽

Subspace Method ◽

Model Order

Download Full-text

EA-CG: An Approximate Second-Order Method for Training Fully-Connected Neural Networks

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33013337 ◽

2019 ◽

Vol 33 ◽

pp. 3337-3346

Author(s):

Sheng-Wei Chen ◽

Chun-Nan Chou ◽

Edward Y. Chang

Keyword(s):

Neural Networks ◽

Empirical Studies ◽

Hessian Matrix ◽

Coefficient Matrix ◽

Second Order ◽

Order Method ◽

Criterion Functions ◽

Rank Approximation ◽

Fully Connected ◽

Approximate Hessian

For training fully-connected neural networks (FCNNs), we propose a practical approximate second-order method including: 1) an approximation of the Hessian matrix and 2) a conjugate gradient (CG) based method. Our proposed approximate Hessian matrix is memory-efficient and can be applied to any FCNNs where the activation and criterion functions are twice differentiable. We devise a CG-based method incorporating one-rank approximation to derive Newton directions for training FCNNs, which significantly reduces both space and time complexity. This CG-based method can be employed to solve any linear equation where the coefficient matrix is Kroneckerfactored, symmetric and positive definite. Empirical studies show the efficacy and efficiency of our proposed method.

Download Full-text

Restartable Generalized Second Order Krylov Subspace Method

Journal of Computational and Theoretical Nanoscience ◽

10.1166/jctn.2015.4405 ◽

2015 ◽

Vol 12 (11) ◽

pp. 4584-4592

Author(s):

Zhongming Teng ◽

Linzhang Lu ◽

Xiaoqian Niu

Keyword(s):

Krylov Subspace ◽

Second Order ◽

Krylov Subspace Method ◽

Subspace Method

Download Full-text

A Krylov Subspace Method for Large-Scale Second-Order Cone Linear Complementarity Problem

SIAM Journal on Scientific Computing ◽

10.1137/140995064 ◽

2015 ◽

Vol 37 (4) ◽

pp. A2046-A2075 ◽

Cited By ~ 6

Author(s):

Lei-Hong Zhang ◽

Wei Hong Yang ◽

Chungen Shen ◽

Ren-Cang Li

Keyword(s):

Linear Complementarity Problem ◽

Complementarity Problem ◽

Large Scale ◽

Krylov Subspace ◽

Linear Complementarity ◽

Second Order ◽

Krylov Subspace Method ◽

Subspace Method ◽

Second Order Cone

Download Full-text

Oscillation Criteria of Singular Initial-Value Problem for Second Order Nonlinear Dynamic Equation on Time Scales

Nonautonomous Dynamical Systems ◽

10.1515/msds-2018-0008 ◽

2018 ◽

Vol 5 (1) ◽

pp. 102-112 ◽

Cited By ~ 5

Author(s):

Shekhar Singh Negi ◽

Syed Abbas ◽

Muslim Malik

Keyword(s):

Time Scales ◽

Initial Value Problem ◽

Dynamic Equation ◽

Nonlinear Dynamic ◽

Type Inequality ◽

Second Order ◽

Oscillation Criterion ◽

Initial Value ◽

Nonlinear Dynamic Equation ◽

Singular Initial Value Problem

AbstractBy using of generalized Opial’s type inequality on time scales, a new oscillation criterion is given for a singular initial-value problem of second-order dynamic equation on time scales. Some oscillatory results of its generalizations are also presented. Example with various time scales is given to illustrate the analytical findings.

Download Full-text

Krylov subspace methods for estimating operator-vector multiplications in Hilbert spaces

Japan Journal of Industrial and Applied Mathematics ◽

10.1007/s13160-021-00460-4 ◽

2021 ◽

Author(s):

Yuka Hashimoto ◽

Takashi Nodera

Keyword(s):

Krylov Subspace ◽

Time Series Data ◽

Linear Equations ◽

Transfer Operator ◽

Krylov Subspace Methods ◽

Krylov Subspace Method ◽

Linear Operators ◽

Series Data ◽

Subspace Method ◽

Subspace Methods

AbstractThe Krylov subspace method has been investigated and refined for approximating the behaviors of finite or infinite dimensional linear operators. It has been used for approximating eigenvalues, solutions of linear equations, and operator functions acting on vectors. Recently, for time-series data analysis, much attention is being paid to the Krylov subspace method as a viable method for estimating the multiplications of a vector by an unknown linear operator referred to as a transfer operator. In this paper, we investigate a convergence analysis for Krylov subspace methods for estimating operator-vector multiplications.

Download Full-text

Increased space-parallelism via time-simultaneous Newton-multigrid methods for nonstationary nonlinear PDE problems

The International Journal of High Performance Computing Applications ◽

10.1177/10943420211001940 ◽

2021 ◽

Vol 35 (3) ◽

pp. 211-225

Author(s):

Jonas Dünnebacke ◽

Stefan Turek ◽

Christoph Lohmann ◽

Andriy Sokolov ◽

Peter Zajac

Keyword(s):

Krylov Subspace ◽

Solution Procedure ◽

Grid Size ◽

Waveform Relaxation ◽

Test Problems ◽

Subspace Method ◽

Multigrid Algorithm ◽

Multigrid Solvers ◽

Large Systems ◽

Linear Pdes

We discuss how “parallel-in-space & simultaneous-in-time” Newton-multigrid approaches can be designed which improve the scaling behavior of the spatial parallelism by reducing the latency costs. The idea is to solve many time steps at once and therefore solving fewer but larger systems. These large systems are reordered and interpreted as a space-only problem leading to multigrid algorithm with semi-coarsening in space and line smoothing in time direction. The smoother is further improved by embedding it as a preconditioner in a Krylov subspace method. As a prototypical application, we concentrate on scalar partial differential equations (PDEs) with up to many thousands of time steps which are discretized in time, resp., space by finite difference, resp., finite element methods. For linear PDEs, the resulting method is closely related to multigrid waveform relaxation and its theoretical framework. In our parabolic test problems the numerical behavior of this multigrid approach is robust w.r.t. the spatial and temporal grid size and the number of simultaneously treated time steps. Moreover, we illustrate how corresponding time-simultaneous fixed-point and Newton-type solvers can be derived for nonlinear nonstationary problems that require the described solution of linearized problems in each outer nonlinear step. As the main result, we are able to generate much larger problem sizes to be treated by a large number of cores so that the combination of the robustly scaling multigrid solvers together with a larger degree of parallelism allows a faster solution procedure for nonstationary problems.

Download Full-text

A second-order hybrid finite difference scheme for a system of singularly perturbed initial value problems

Journal of Computational and Applied Mathematics ◽

10.1016/j.cam.2010.05.006 ◽

2010 ◽

Vol 234 (12) ◽

pp. 3445-3457 ◽

Cited By ~ 13

Author(s):

Zhongdi Cen ◽

Aimin Xu ◽

Anbo Le

Keyword(s):

Finite Difference ◽

Difference Scheme ◽

Finite Difference Scheme ◽

Initial Value Problems ◽

Second Order ◽

Singularly Perturbed ◽

Initial Value

Download Full-text

Global solutions of initial value problems for nonlinear second-order integro-differential equations of mixed type in Banach spaces

Journal of Mathematical Analysis and Applications ◽

10.1016/j.jmaa.2006.07.103 ◽

2007 ◽

Vol 330 (2) ◽

pp. 1139-1151 ◽

Cited By ~ 2

Author(s):

Hua Su ◽

Lishan Liu ◽

Xiaoyan Zhang ◽

Yonghong Wu

Keyword(s):

Differential Equations ◽

Banach Spaces ◽

Initial Value Problems ◽

Mixed Type ◽

Global Solutions ◽

Second Order ◽

Initial Value ◽

Equations Of Mixed Type

Download Full-text