stochastic gradient method Latest Research Papers

In this article, we present a distributed variant of an adaptive stochastic gradient method for training deep neural networks in the parameter-server model. To reduce the communication cost among the workers and server, we incorporate two types of quantization schemes, i.e., gradient quantization and weight quantization, into the proposed distributed Adam. In addition, to reduce the bias introduced by quantization operations, we propose an error-feedback technique to compensate for the quantized gradient. Theoretically, in the stochastic nonconvex setting, we show that the distributed adaptive gradient method with gradient quantization and error feedback converges to the first-order stationary point, and that the distributed adaptive gradient method with weight quantization and error feedback converges to the point related to the quantized level under both the single-worker and multi-worker modes. Last, we apply the proposed distributed adaptive gradient methods to train deep neural networks. Experimental results demonstrate the efficacy of our methods.

Download Full-text

Triply stochastic gradient method for large-scale nonlinear similar unlabeled classification

Machine Learning ◽

10.1007/s10994-021-05980-1 ◽

2021 ◽

Author(s):

Wanli Shi ◽

Bin Gu ◽

Xiang Li ◽

Cheng Deng ◽

Heng Huang

Keyword(s):

Gradient Method ◽

Large Scale ◽

Stochastic Gradient ◽

Stochastic Gradient Method

Download Full-text

Multi-stage Stochastic Gradient Method with Momentum Acceleration

Signal Processing ◽

10.1016/j.sigpro.2021.108201 ◽

2021 ◽

pp. 108201

Author(s):

Zhijian Luo ◽

Siyu Chen ◽

Yuntao Qian ◽

Yueen Hou

Keyword(s):

Gradient Method ◽

Stochastic Gradient ◽

Multi Stage ◽

Stochastic Gradient Method

Download Full-text

Orthant Based Proximal Stochastic Gradient Method for $$\ell _1$$-Regularized Optimization

Machine Learning and Knowledge Discovery in Databases - Lecture Notes in Computer Science ◽

10.1007/978-3-030-67664-3_4 ◽

2021 ◽

pp. 57-73

Author(s):

Tianyi Chen ◽

Tianyu Ding ◽

Bo Ji ◽

Guanyi Wang ◽

Yixin Shi ◽

...

Keyword(s):

Gradient Method ◽

Stochastic Gradient ◽

Regularized Optimization ◽

Stochastic Gradient Method

Download Full-text

An iteratively regularized stochastic gradient method for estimating a random parameter in a stochastic PDE. A variational inequality approach

Journal of Nonlinear and Variational Analysis ◽

10.23952/jnva.5.2021.6.02 ◽

2021 ◽

Vol 5 (6) ◽

Keyword(s):

Variational Inequality ◽

Gradient Method ◽

Random Parameter ◽

Stochastic Gradient ◽

Stochastic Pde ◽

Stochastic Gradient Method

Download Full-text

Stochastic Gradient Method with Barzilai–Borwein Step for Unconstrained Nonlinear Optimization

Journal of Computer and Systems Sciences International ◽

10.1134/s106423072101010x ◽

2021 ◽

Vol 60 (1) ◽

pp. 75-86

Author(s):

L. Wang ◽

H. Wu ◽

I. A. Matveev

Keyword(s):

Nonlinear Optimization ◽

Gradient Method ◽

Stochastic Gradient ◽

Unconstrained Nonlinear Optimization ◽

Stochastic Gradient Method

Download Full-text

An Optimal Multistage Stochastic Gradient Method for Minimax Problems

2020 59th IEEE Conference on Decision and Control (CDC) ◽

10.1109/cdc42340.2020.9304033 ◽

2020 ◽

Author(s):

Alireza Fallah ◽

Asuman Ozdaglar ◽

Sarath Pattathil

Keyword(s):

Gradient Method ◽

Minimax Problems ◽

Stochastic Gradient ◽

Stochastic Gradient Method

Download Full-text

CSG: A new stochastic gradient method for the efficient solution of structural optimization problems with infinitely many states

Structural and Multidisciplinary Optimization ◽

10.1007/s00158-020-02571-x ◽

2020 ◽

Vol 61 (6) ◽

pp. 2595-2611

Author(s):

Lukas Pflug ◽

Niklas Bernhardt ◽

Max Grieshammer ◽

Michael Stingl

Keyword(s):

Structural Optimization ◽

Gradient Method ◽

Efficient Solution ◽

Optimization Problems ◽

Stochastic Gradient ◽

Stochastic Gradient Method

Download Full-text

Universal Adversarial Training

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i04.6017 ◽

2020 ◽

Vol 34 (04) ◽

pp. 5636-5643 ◽

Cited By ~ 1

Author(s):

Ali Shafahi ◽

Mahyar Najibi ◽

Zheng Xu ◽

John Dickerson ◽

Larry S. Davis ◽

...

Keyword(s):

Gradient Method ◽

Broad Class ◽

Stochastic Gradient ◽

Network Architectures ◽

Class Label ◽

Small Perturbations ◽

Efficient Generation ◽

Adversarial Training ◽

The Cost ◽

Stochastic Gradient Method

Standard adversarial attacks change the predicted class label of a selected image by adding specially tailored small perturbations to its pixels. In contrast, a universal perturbation is an update that can be added to any image in a broad class of images, while still changing the predicted class label. We study the efficient generation of universal adversarial perturbations, and also efficient methods for hardening networks to these attacks. We propose a simple optimization-based universal attack that reduces the top-1 accuracy of various network architectures on ImageNet to less than 20%, while learning the universal perturbation 13× faster than the standard method.To defend against these perturbations, we propose universal adversarial training, which models the problem of robust classifier generation as a two-player min-max game, and produces robust models with only 2× the cost of natural training. We also propose a simultaneous stochastic gradient method that is almost free of extra computation, which allows us to do universal adversarial training on ImageNet.

Download Full-text

A Stochastic Gradient Method With Mesh Refinement for PDE-Constrained Optimization Under Uncertainty

SIAM Journal on Scientific Computing ◽

10.1137/19m1263297 ◽

2020 ◽

Vol 42 (5) ◽

pp. A2750-A2772

Author(s):

Caroline Geiersbach ◽

Winnifried Wollner

Keyword(s):

Constrained Optimization ◽

Gradient Method ◽

Mesh Refinement ◽

Stochastic Gradient ◽

Optimization Under Uncertainty ◽

Pde Constrained Optimization ◽

Stochastic Gradient Method

Download Full-text

stochastic gradient method
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Quantized Adam with Error Feedback

Triply stochastic gradient method for large-scale nonlinear similar unlabeled classification

Multi-stage Stochastic Gradient Method with Momentum Acceleration

Orthant Based Proximal Stochastic Gradient Method for $$\ell _1$$-Regularized Optimization

An iteratively regularized stochastic gradient method for estimating a random parameter in a stochastic PDE. A variational inequality approach

Stochastic Gradient Method with Barzilai–Borwein Step for Unconstrained Nonlinear Optimization

An Optimal Multistage Stochastic Gradient Method for Minimax Problems

CSG: A new stochastic gradient method for the efficient solution of structural optimization problems with infinitely many states

Universal Adversarial Training

A Stochastic Gradient Method With Mesh Refinement for PDE-Constrained Optimization Under Uncertainty

Export Citation Format

stochastic gradient methodRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Quantized Adam with Error Feedback

Triply stochastic gradient method for large-scale nonlinear similar unlabeled classification

Multi-stage Stochastic Gradient Method with Momentum Acceleration

Orthant Based Proximal Stochastic Gradient Method for $$\ell _1$$-Regularized Optimization

An iteratively regularized stochastic gradient method for estimating a random parameter in a stochastic PDE. A variational inequality approach

Stochastic Gradient Method with Barzilai–Borwein Step for Unconstrained Nonlinear Optimization

An Optimal Multistage Stochastic Gradient Method for Minimax Problems

CSG: A new stochastic gradient method for the efficient solution of structural optimization problems with infinitely many states

Universal Adversarial Training

A Stochastic Gradient Method With Mesh Refinement for PDE-Constrained Optimization Under Uncertainty

stochastic gradient method
Recently Published Documents