Minimizing Average of Loss Functions Using Gradient Descent and Stochastic Gradient Descent

Dhaka University Journal of Science ◽

10.3329/dujs.v64i2.54490 ◽

2016 ◽

Vol 64 (2) ◽

pp. 141-145

Author(s):

Md Rajib Arefin ◽

M Asadujjaman

Keyword(s):

Convex Functions ◽

Numerical Experiments ◽

Gradient Descent ◽

Complexity Analysis ◽

Stochastic Gradient ◽

Loss Functions ◽

Stochastic Gradient Descent ◽

This paper deals with minimizing average of loss functions using Gradient Descent (GD) and Stochastic Gradient Descent (SGD). We present these two algorithms for minimizing average of a large number of smooth convex functions. We provide some discussions on their complexity analysis, also illustrate the algorithms geometrically. At the end, we compare their performance through numerical experiments. Dhaka Univ. J. Sci. 64(2): 141-145, 2016 (July)

Download Full-text

Neural ODEs as the deep limit of ResNets with constant weights

Analysis and Applications ◽

10.1142/s0219530520400023 ◽

2020 ◽

pp. 1-41 ◽

Author(s):

Benny Avelin ◽

Kaj Nyström

Keyword(s):

Neural Network ◽

Gradient Descent ◽

Deep Neural Network ◽

Theoretical Foundation ◽

Stochastic Gradient ◽

Loss Functions ◽

Stochastic Gradient Descent ◽

Decay Estimates ◽

Fokker Planck Equations

In this paper, we prove that, in the deep limit, the stochastic gradient descent on a ResNet type deep neural network, where each layer shares the same weight matrix, converges to the stochastic gradient descent for a Neural ODE and that the corresponding value/loss functions converge. Our result gives, in the context of minimization by stochastic gradient descent, a theoretical foundation for considering Neural ODEs as the deep limit of ResNets. Our proof is based on certain decay estimates for associated Fokker–Planck equations.

Download Full-text

On the Convergence Rate of Stochastic Gradient Descent for Strongly Convex Functions

Regularization, Optimization, Kernels, and Support Vector Machines ◽

10.1201/b17558-10 ◽

2014 ◽

pp. 177-194

Keyword(s):

Convergence Rate ◽

Convex Functions ◽

Gradient Descent ◽

Stochastic Gradient ◽

Stochastic Gradient Descent ◽

Strongly Convex Functions ◽

Strongly Convex

Download Full-text

Linear Support Vector Machine (SVM) with Stochastic Gradient Descent (SGD) training and multinomial Nave Bayes (NB) in News Classification

International Journal of Computer Sciences and Engineering ◽

10.26438/ijcse/v7i4.360363 ◽

2019 ◽

Vol 7 (4) ◽

pp. 360-363

Author(s):

Feroz Ahmed ◽

Shabina Ghafir

Keyword(s):

Support Vector Machine ◽

Gradient Descent ◽

Stochastic Gradient ◽

Stochastic Gradient Descent ◽

Support Vector ◽

Linear Support Vector Machine

Download Full-text

Stochastic gradient descent training for L1-regularized log-linear models with cumulative penalty

Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - ACL-IJCNLP '09 ◽

10.3115/1687878.1687946 ◽

2009 ◽

Author(s):

Yoshimasa Tsuruoka ◽

Jun'ichi Tsujii ◽

Sophia Ananiadou

Keyword(s):

Gradient Descent ◽

Linear Models ◽

Stochastic Gradient ◽

Stochastic Gradient Descent ◽

Download Full-text

Drivetrain System Identification in a Multi-Task Learning Strategy using Partial Asynchronous Elastic Averaging Stochastic Gradient Descent

2020 IEEE/ASME International Conference on Advanced Intelligent Mechatronics (AIM) ◽

10.1109/aim43001.2020.9158977 ◽

2020 ◽

Author(s):

Tom Staessens ◽

Guillaume Crevecoeur

Keyword(s):

System Identification ◽

Gradient Descent ◽

Learning Strategy ◽

Stochastic Gradient ◽

Stochastic Gradient Descent ◽

Download Full-text

Optimized directed roadmap graph for multi-agent path finding using stochastic gradient descent

Proceedings of the 35th Annual ACM Symposium on Applied Computing ◽

10.1145/3341105.3373916 ◽

2020 ◽

Author(s):

Christian Henkel ◽

Marc Toussaint

Keyword(s):

Gradient Descent ◽

Stochastic Gradient ◽

Stochastic Gradient Descent ◽

Path Finding ◽

Download Full-text

Enhanced Stochastic Gradient Descent with Backward Queried Data for Online Learning

2020 IEEE International Conference on Machine Learning and Applied Network Technologies (ICMLANT) ◽

10.1109/icmlant50963.2020.9355978 ◽

2020 ◽

Author(s):

Gio Huh

Keyword(s):

Online Learning ◽

Gradient Descent ◽

Stochastic Gradient ◽

Stochastic Gradient Descent

Download Full-text

Learning Rates for Stochastic Gradient Descent with Nonconvex Objectives

IEEE Transactions on Pattern Analysis and Machine Intelligence ◽

10.1109/tpami.2021.3068154 ◽

2021 ◽

pp. 1-1

Author(s):

Yunwen Lei ◽

Ke Tang

Keyword(s):

Gradient Descent ◽

Stochastic Gradient ◽

Stochastic Gradient Descent ◽

Download Full-text

High Performance Parallel Stochastic Gradient Descent in Shared Memory

2016 IEEE International Parallel and Distributed Processing Symposium (IPDPS) ◽

10.1109/ipdps.2016.107 ◽

2016 ◽

Author(s):

Scott Sallinen ◽

Nadathur Satish ◽

Mikhail Smelyanskiy ◽

Samantika S. Sury ◽

Christopher Re

Keyword(s):

Shared Memory ◽

Gradient Descent ◽

High Performance ◽

Stochastic Gradient ◽

Stochastic Gradient Descent ◽

Parallel Stochastic Gradient Descent

Download Full-text

Text Categorization by Multi-instance Multi-label and Momentum Stochastic Gradient Descent Strategy

2020 3rd International Conference on Algorithms, Computing and Artificial Intelligence ◽

10.1145/3446132.3446158 ◽

2020 ◽

Author(s):

Xiang Bao ◽

Guifeng Liu ◽

Manrong Wang

Keyword(s):

Gradient Descent ◽

Text Categorization ◽

Stochastic Gradient ◽

Stochastic Gradient Descent

Download Full-text