Efficient gradient computation for optimization of hyperparameters

Author(s):  
Jingyan Xu ◽  
Frédéric Noo

Abstract We are interested in learning the hyperparameters in a convex objective function in a supervised setting. The complex relationship between the input data to the convex problem and the desirable hyperparameters can be modeled by a neural network; the hyperparameters and the data then drive the convex minimization problem, whose solution is then compared to training labels. In our previous work [1], we evaluated a prototype of this learning strategy in an optimization-based sinogram smoothing plus FBP reconstruction framework. A question arising in this setting is how to efficiently compute (backpropagate) the gradient from the solution of the optimization problem, to the hyperparameters to enable end-to-end training. In this work, we first develop general formulas for gradient backpropagation for a subset of convex problems, namely the proximal mapping. To illustrate the value of the general formulas and to demonstrate how to use them, we consider the specific instance of 1-D quadratic smoothing (denoising) whose solution admits a dynamic programming (DP) algorithm. The general formulas lead to another DP algorithm for exact computation of the gradient of the hyperparameters. Our numerical studies demonstrate a 55%- 65% computation time savings by providing a custom gradient instead of relying on automatic differentiation in deep learning libraries. While our discussion focuses on 1-D quadratic smoothing, our initial results (not presented) support the statement that the general formulas and the computational strategy apply equally well to TV or Huber smoothing problems on simple graphs whose solutions can be computed exactly via DP.

2003 ◽  
Vol 18 (5) ◽  
pp. 615-627 ◽  
Author(s):  
Francois Courty ◽  
Alain Dervieux ◽  
Bruno Koobus ◽  
Laurent Hascoët

2008 ◽  
Author(s):  
Eli Kahn ◽  
Lawrence Staib

Many image registration algorithms are formulated as optimization problems with a gradient descent based solver, One difficulty with designing and implementing such methods is in the implementation of the gradient computation. This process can be time-consuming and error-prone. In addition some functions do not have gradients that can be expressed in symbolic form. Automatic differentiation is useful for computing gradients of complicated objective functions. It moves the burden of computing gradients from the programmer to the computer. So far, AD has not been exploited for use in image registration. This paper describes a software library the authors have developed to automate the process of computing gradients of registration objective functions. This can alleviate the job of registration designers somewhat and potentially make it easier to design better registration algorithms.


2018 ◽  
Vol 16 (08) ◽  
pp. 1840004
Author(s):  
Luca Innocenti ◽  
Leonardo Banchi ◽  
Sougato Bose ◽  
Alessandro Ferraro ◽  
Mauro Paternostro

We present strategies for the training of a qubit network aimed at the ancilla-assisted synthesis of multi-qubit gates based on a set of restricted resources. By assuming the availability of only time-independent single and two-qubit interactions, we introduce and describe a supervised learning strategy implemented through momentum-stochastic gradient descent with automatic differentiation methods. We demonstrate the effectiveness of the scheme by discussing the implementation of nontrivial three qubit operations, including a QFT and a half-adder gate.


2004 ◽  
Vol 5 (2) ◽  
pp. 67-76
Author(s):  
Gediminas Davulis

The problem of optimal development of transport network is considered. We have to define a plan of network development, i.e. a network state at fixed time moments possible the scope of allocated resources such that the total expenses for reconstruction of the network and construction of its new elements as well as for passenger and cargo transportation be the lowest. Thus the problem considered can be described by the optimization model with a non‐linear non‐convex objective function and linear constraints of special structures. Since that is a non‐convex problem with a lot of extreme therefore one can expect to find only an approximate solution, close to a global one, at best. There is no effective and universal solution methods for this problem even in the sense of a local solution. This paper discusses a method for solving the problem using the synthesis of static section, that allows us to decompose dynamic problem into the set of static problems of a smaller volume, and contour optimization methods. The experimental calculation confirm that the proposed method is suitable for solving problem represented in the paper.


2004 ◽  
Vol 4 (2) ◽  
pp. 1371-1392 ◽  
Author(s):  
V. Mallet ◽  
B. Sportisse

Abstract. We briefly present in this short paper some issues related to the development and the validation of the three-dimensional chemistry-transport model Polair. Numerical studies have been performed in order to let Polair be an efficient and robust solver. This paper summarizes and comments choices that were made in this respect. Simulations of relevant photochemical episodes were led to assess the validity of the model. The results can be considered as a validation, which allows next studies to focus on fine modeling issues. A major feature of Polair is the availability of a tangent linear mode and an adjoint mode entirely generated by automatic differentiation. Tangent linear and adjoint modes grant the opportunity to perform detailed sensitivity analyses and data assimilation. This paper shows how inverse modeling is achieved with Polair.


Author(s):  
Yiğitcan Güden ◽  
Mehmet Metin Yavuz

Analysis and control of flow structure in U-bends are crucial since U-bends are used in many different engineering applications. As a flow parameter in U-bends, the ratio of inertial and centrifugal forces to viscous forces is called as Dean number. The increase of Dean number destabilizes the flow and leads to a three-dimensional flow consisting of stream wise parallel counter-rotating vortices (Dean vortices) stacked along the curved wall. Due to the curvature in U-bends, the flow development involves complex flow structures including Dean vortices and high levels of turbulence that are not seen in straight duct flows. These are quite critical in considering noise problems and structural failure of the ducts. In this work, computational fluid dynamic (CFD) models are developed using ANSYS FLUENT to simulate these complex flows patterns in square sectioned U-bend with a radius of curvature Rc/D=0.65. The predictions of mean velocity profiles on different angular positions of the U-bend are compared against the experimental results available in the literature and previous numerical studies. Performance of six different turbulence models are evaluated, namely: the standard k-ε, the k-ε Realizable, the k-ε RNG, the k-ω SST, the Reynolds Stress Model (RSM) and the Scale-Adaptive Simulation Model (SAS), to propose the best numerical approach with increasing the accuracy of the solutions while reducing the computation time. Numerical results show remarkable improvements with respect to previous numerical studies and good agreement with the available experimental data. The best turbulence model for this application is proposed considering both the computation time and the result accuracy. In addition, different flow control techniques are still under investigation to eliminate Dean vortices and to reduce turbulence levels in U-bends.


Author(s):  
Avril V. Somlyo ◽  
H. Shuman ◽  
A.P. Somlyo

This is a preliminary report of electron probe analysis of rabbit portal-anterior mesenteric vein (PAMV) smooth muscle cryosectioned without fixation or cryoprotection. The instrumentation and method of electron probe quantitation used (1) and our initial results with cardiac (2) and skeletal (3) muscle have been presented elsewhere.In preparations depolarized with high K (K2SO4) solution, significant calcium peaks were detected over the sarcoplasmic reticulum (Fig 1 and 2) and the continuous perinuclear space. In some of the fibers there were also significant (up to 200 mM/kg dry wt) calcium peaks over the mitochondria. However, in smooth muscle that was not depolarized, high mitochondrial Ca was found in fibers that also contained elevated Na and low K (Fig 3). Therefore, the possibility that these Ca-loaded mitochondria are indicative of cell damage remains to be ruled out.


2001 ◽  
Vol 120 (5) ◽  
pp. A226-A226 ◽  
Author(s):  
W LAMMERS ◽  
S DHANASEKARAN ◽  
J SLACK ◽  
B STEPHEN

Sign in / Sign up

Export Citation Format

Share Document