Efficient gradient computation for optimization of hyperparameters

Physics in Medicine and Biology ◽

10.1088/1361-6560/ac4442 ◽

2021 ◽

Author(s):

Jingyan Xu ◽

Frédéric Noo

Keyword(s):

Automatic Differentiation ◽

Learning Strategy ◽

Computation Time ◽

Proximal Mapping ◽

Convex Problems ◽

Numerical Studies ◽

Convex Problem ◽

Convex Objective Function ◽

Gradient Computation ◽

Initial Results

Abstract We are interested in learning the hyperparameters in a convex objective function in a supervised setting. The complex relationship between the input data to the convex problem and the desirable hyperparameters can be modeled by a neural network; the hyperparameters and the data then drive the convex minimization problem, whose solution is then compared to training labels. In our previous work [1], we evaluated a prototype of this learning strategy in an optimization-based sinogram smoothing plus FBP reconstruction framework. A question arising in this setting is how to efficiently compute (backpropagate) the gradient from the solution of the optimization problem, to the hyperparameters to enable end-to-end training. In this work, we first develop general formulas for gradient backpropagation for a subset of convex problems, namely the proximal mapping. To illustrate the value of the general formulas and to demonstrate how to use them, we consider the specific instance of 1-D quadratic smoothing (denoising) whose solution admits a dynamic programming (DP) algorithm. The general formulas lead to another DP algorithm for exact computation of the gradient of the hyperparameters. Our numerical studies demonstrate a 55%- 65% computation time savings by providing a custom gradient instead of relying on automatic differentiation in deep learning libraries. While our discussion focuses on 1-D quadratic smoothing, our initial results (not presented) support the statement that the general formulas and the computational strategy apply equally well to TV or Huber smoothing problems on simple graphs whose solutions can be computed exactly via DP.

Download Full-text

Reverse Automatic Differentiation for Optimum Design: From Adjoint State Assembly to Gradient Computation

Optimization Methods and Software ◽

10.1080/10556780310001610501 ◽

2003 ◽

Vol 18 (5) ◽

pp. 615-627 ◽

Cited By ~ 17

Author(s):

Francois Courty ◽

Alain Dervieux ◽

Bruno Koobus ◽

Laurent Hascoët

Keyword(s):

Optimum Design ◽

Automatic Differentiation ◽

Adjoint State ◽

Gradient Computation

Download Full-text

Image Registration with Automatic Computation of Gradients

The Insight Journal ◽

10.54294/ov202k ◽

2008 ◽

Author(s):

Eli Kahn ◽

Lawrence Staib

Keyword(s):

Image Registration ◽

Gradient Descent ◽

Automatic Differentiation ◽

Optimization Problems ◽

Objective Functions ◽

Software Library ◽

Symbolic Form ◽

Automatic Computation ◽

Gradient Computation

Many image registration algorithms are formulated as optimization problems with a gradient descent based solver, One difficulty with designing and implementing such methods is in the implementation of the gradient computation. This process can be time-consuming and error-prone. In addition some functions do not have gradients that can be expressed in symbolic form. Automatic differentiation is useful for computing gradients of complicated objective functions. It moves the burden of computing gradients from the programmer to the computer. So far, AD has not been exploited for use in image registration. This paper describes a software library the authors have developed to automate the process of computing gradients of registration objective functions. This can alleviate the job of registration designers somewhat and potentially make it easier to design better registration algorithms.

Download Full-text

Approximate supervised learning of quantum gates via ancillary qubits

International Journal of Quantum Information ◽

10.1142/s021974991840004x ◽

2018 ◽

Vol 16 (08) ◽

pp. 1840004

Author(s):

Luca Innocenti ◽

Leonardo Banchi ◽

Sougato Bose ◽

Alessandro Ferraro ◽

Mauro Paternostro

Keyword(s):

Supervised Learning ◽

Gradient Descent ◽

Automatic Differentiation ◽

Learning Strategy ◽

Stochastic Gradient ◽

Stochastic Gradient Descent ◽

Quantum Gates ◽

Half Adder ◽

Qubit Operations

We present strategies for the training of a qubit network aimed at the ancilla-assisted synthesis of multi-qubit gates based on a set of restricted resources. By assuming the availability of only time-independent single and two-qubit interactions, we introduce and describe a supervised learning strategy implemented through momentum-stochastic gradient descent with automatic differentiation methods. We demonstrate the effectiveness of the scheme by discussing the implementation of nontrivial three qubit operations, including a QFT and a half-adder gate.

Download Full-text

A MODEL AND METHOD FOR SOLVING A DYNAMIC TRANSPORT PROBLEM

Journal of Business Economics and Management ◽

10.3846/16111699.2004.9636070 ◽

2004 ◽

Vol 5 (2) ◽

pp. 67-76

Author(s):

Gediminas Davulis

Keyword(s):

Optimization Methods ◽

Fixed Time ◽

Linear Constraints ◽

Network Development ◽

Network State ◽

Convex Problem ◽

Convex Objective Function ◽

Universal Solution ◽

Solution Methods ◽

Cargo Transportation

The problem of optimal development of transport network is considered. We have to define a plan of network development, i.e. a network state at fixed time moments possible the scope of allocated resources such that the total expenses for reconstruction of the network and construction of its new elements as well as for passenger and cargo transportation be the lowest. Thus the problem considered can be described by the optimization model with a non‐linear non‐convex objective function and linear constraints of special structures. Since that is a non‐convex problem with a lot of extreme therefore one can expect to find only an approximate solution, close to a global one, at best. There is no effective and universal solution methods for this problem even in the sense of a local solution. This paper discusses a method for solving the problem using the synthesis of static section, that allows us to decompose dynamic problem into the set of static problems of a smaller volume, and contour optimization methods. The experimental calculation confirm that the proposed method is suitable for solving problem represented in the paper.

Download Full-text

3-D chemistry-transport model Polair: numerical issues, validation and automatic-differentiation strategy

Atmospheric Chemistry and Physics Discussions ◽

10.5194/acpd-4-1371-2004 ◽

2004 ◽

Vol 4 (2) ◽

pp. 1371-1392 ◽

Cited By ~ 15

Author(s):

V. Mallet ◽

B. Sportisse

Keyword(s):

Automatic Differentiation ◽

Transport Model ◽

Three Dimensional ◽

Sensitivity Analyses ◽

Short Paper ◽

Chemistry Transport Model ◽

Linear Mode ◽

Differentiation Strategy ◽

Numerical Studies ◽

Made In

Abstract. We briefly present in this short paper some issues related to the development and the validation of the three-dimensional chemistry-transport model Polair. Numerical studies have been performed in order to let Polair be an efficient and robust solver. This paper summarizes and comments choices that were made in this respect. Simulations of relevant photochemical episodes were led to assess the validity of the model. The results can be considered as a validation, which allows next studies to focus on fine modeling issues. A major feature of Polair is the availability of a tangent linear mode and an adjoint mode entirely generated by automatic differentiation. Tangent linear and adjoint modes grant the opportunity to perform detailed sensitivity analyses and data assimilation. This paper shows how inverse modeling is achieved with Polair.

Download Full-text

Automatic Differentiation Based Nonlinear Reduced-Order Model with Geometric Gradient Computation Capability

AIAA Scitech 2021 Forum ◽

10.2514/6.2021-1475 ◽

2021 ◽

Author(s):

Jeffrey Thomas ◽

Earl H. Dowell

Keyword(s):

Automatic Differentiation ◽

Reduced Order Model ◽

Order Model ◽

Reduced Order ◽

Gradient Computation

Download Full-text

Analysis and Control of Complex Flows in U-Bends Using Computational Fluid Dynamics

Volume 1A, Symposia: Advances in Fluids Engineering Education; Turbomachinery Flow Predictions and Optimization; Applications in CFD; Bio-Inspired Fluid Mechanics; Droplet-Surface Interactions; CFD Verification and Validation; Development and Applications of Immersed Boundary Methods; DNS, LES, and Hybrid RANS/LES Methods ◽

10.1115/fedsm2014-21557 ◽

2014 ◽

Cited By ~ 1

Author(s):

Yiğitcan Güden ◽

Mehmet Metin Yavuz

Keyword(s):

Fluid Dynamic ◽

Computation Time ◽

Radius Of Curvature ◽

Complex Flow ◽

Complex Flows ◽

Dean Number ◽

Mean Velocity ◽

Dean Vortices ◽

Numerical Studies ◽

And Control

Analysis and control of flow structure in U-bends are crucial since U-bends are used in many different engineering applications. As a flow parameter in U-bends, the ratio of inertial and centrifugal forces to viscous forces is called as Dean number. The increase of Dean number destabilizes the flow and leads to a three-dimensional flow consisting of stream wise parallel counter-rotating vortices (Dean vortices) stacked along the curved wall. Due to the curvature in U-bends, the flow development involves complex flow structures including Dean vortices and high levels of turbulence that are not seen in straight duct flows. These are quite critical in considering noise problems and structural failure of the ducts. In this work, computational fluid dynamic (CFD) models are developed using ANSYS FLUENT to simulate these complex flows patterns in square sectioned U-bend with a radius of curvature Rc/D=0.65. The predictions of mean velocity profiles on different angular positions of the U-bend are compared against the experimental results available in the literature and previous numerical studies. Performance of six different turbulence models are evaluated, namely: the standard k-ε, the k-ε Realizable, the k-ε RNG, the k-ω SST, the Reynolds Stress Model (RSM) and the Scale-Adaptive Simulation Model (SAS), to propose the best numerical approach with increasing the accuracy of the solutions while reducing the computation time. Numerical results show remarkable improvements with respect to previous numerical studies and good agreement with the available experimental data. The best turbulence model for this application is proposed considering both the computation time and the result accuracy. In addition, different flow control techniques are still under investigation to eliminate Dean vortices and to reduce turbulence levels in U-bends.

Download Full-text

Electron Probe Analysis of Vertebrate Smooth Muscle: Distribution of Ca and Ci

Proceedings, annual meeting, Electron Microscopy Society of America ◽

10.1017/s0424820100091652 ◽

1976 ◽

Vol 34 ◽

pp. 334-335 ◽

Cited By ~ 1

Author(s):

Avril V. Somlyo ◽

H. Shuman ◽

A.P. Somlyo

Keyword(s):

Smooth Muscle ◽

Electron Probe ◽

Preliminary Report ◽

Cell Damage ◽

Electron Probe Analysis ◽

Perinuclear Space ◽

Probe Analysis ◽

High K ◽

Low K ◽

Initial Results

This is a preliminary report of electron probe analysis of rabbit portal-anterior mesenteric vein (PAMV) smooth muscle cryosectioned without fixation or cryoprotection. The instrumentation and method of electron probe quantitation used (1) and our initial results with cardiac (2) and skeletal (3) muscle have been presented elsewhere.In preparations depolarized with high K (K2SO4) solution, significant calcium peaks were detected over the sarcoplasmic reticulum (Fig 1 and 2) and the continuous perinuclear space. In some of the fibers there were also significant (up to 200 mM/kg dry wt) calcium peaks over the mitochondria. However, in smooth muscle that was not depolarized, high mitochondrial Ca was found in fibers that also contained elevated Na and low K (Fig 3). Therefore, the possibility that these Ca-loaded mitochondria are indicative of cell damage remains to be ruled out.

Download Full-text