Parameter calibration with stochastic gradient descent for interacting particle systems driven by neural networks

Mathematics of Control Signals and Systems ◽

10.1007/s00498-021-00309-8 ◽

2021 ◽

Author(s):

Simone Göttlich ◽

Claudia Totzeck

Keyword(s):

Gradient Descent ◽

Interacting Particle Systems ◽

Stochastic Gradient ◽

Stochastic Gradient Descent ◽

Force Model ◽

Data Sets ◽

Optimal Controls ◽

Parameter Calibration ◽

Descent Algorithm ◽

Gradient Descent Algorithm

AbstractWe propose a neural network approach to model general interaction dynamics and an adjoint-based stochastic gradient descent algorithm to calibrate its parameters. The parameter calibration problem is considered as optimal control problem that is investigated from a theoretical and numerical point of view. We prove the existence of optimal controls, derive the corresponding first-order optimality system and formulate a stochastic gradient descent algorithm to identify parameters for given data sets. To validate the approach, we use real data sets from traffic and crowd dynamics to fit the parameters. The results are compared to forces corresponding to well-known interaction models such as the Lighthill–Whitham–Richards model for traffic and the social force model for crowd motion.

Download Full-text

Soft-Sign Stochastic Gradient Descent Algorithm for Wireless Federated Learning

10.1109/spawc51858.2021.9593212 ◽

2021 ◽

Author(s):

Seunghoon Lee ◽

Chanho Park ◽

Songnam Hong ◽

Yonina C. Eldar ◽

Namyoon Lee

Keyword(s):

Gradient Descent ◽

Stochastic Gradient ◽

Stochastic Gradient Descent ◽

Descent Algorithm ◽

Gradient Descent Algorithm

Download Full-text

A Stochastic Gradient Descent Algorithm for Structural Risk Minimisation

Lecture Notes in Computer Science - Algorithmic Learning Theory ◽

10.1007/978-3-540-39624-6_17 ◽

2003 ◽

pp. 205-220 ◽

Cited By ~ 1

Author(s):

Joel Ratsaby

Keyword(s):

Gradient Descent ◽

Stochastic Gradient ◽

Stochastic Gradient Descent ◽

Risk Minimisation ◽

Descent Algorithm ◽

Gradient Descent Algorithm ◽

Structural Risk

Download Full-text

A Novel Stochastic Gradient Descent Algorithm Based on Grouping over Heterogeneous Cluster Systems for Distributed Deep Learning

2019 19th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID) ◽

10.1109/ccgrid.2019.00053 ◽

2019 ◽

Author(s):

Wenbin Jiang ◽

Geyan Ye ◽

Laurence T. Yang ◽

Jian Zhu ◽

Yang Ma ◽

...

Keyword(s):

Deep Learning ◽

Gradient Descent ◽

Stochastic Gradient ◽

Stochastic Gradient Descent ◽

Heterogeneous Cluster ◽

Cluster Systems ◽

Descent Algorithm ◽

Gradient Descent Algorithm

Download Full-text

An Efficient, Distributed Stochastic Gradient Descent Algorithm for Deep-Learning Applications

2017 46th International Conference on Parallel Processing (ICPP) ◽

10.1109/icpp.2017.10 ◽

2017 ◽

Cited By ~ 2

Author(s):

Guojing Cong ◽

Onkar Bhardwaj ◽

Minwei Feng

Keyword(s):

Deep Learning ◽

Gradient Descent ◽

Stochastic Gradient ◽

Stochastic Gradient Descent ◽

Descent Algorithm ◽

Gradient Descent Algorithm

Download Full-text

Identifying lateral boundary conditions for the M2 tide in a coastal model using a stochastic gradient descent algorithm

Ocean Modelling ◽

10.1016/j.ocemod.2020.101709 ◽

2020 ◽

Vol 156 ◽

pp. 101709

Author(s):

Guillaume Koenig ◽

Clement Aldebert ◽

Cristele Chevalier ◽

Jean-Luc Devenon

Keyword(s):

Boundary Conditions ◽

Gradient Descent ◽

Stochastic Gradient ◽

Stochastic Gradient Descent ◽

Lateral Boundary ◽

Descent Algorithm ◽

Lateral Boundary Conditions ◽

Gradient Descent Algorithm

Download Full-text

Blind multiuser detector for DS/CDMA channels based on the modified stochastic gradient descent algorithm

ICC 2001. IEEE International Conference on Communications. Conference Record (Cat. No.01CH37240) ◽

10.1109/icc.2001.937157 ◽

2002 ◽

Author(s):

A. Mukherjee ◽

K.C. Teh ◽

E. Gunawan

Keyword(s):

Gradient Descent ◽

Stochastic Gradient ◽

Stochastic Gradient Descent ◽

Descent Algorithm ◽

Multiuser Detector ◽

Gradient Descent Algorithm

Download Full-text

On Projected Stochastic Gradient Descent Algorithm with Weighted Averaging for Least Squares Regression

IEEE Transactions on Automatic Control ◽

10.1109/tac.2017.2705559 ◽

2017 ◽

Vol 62 (11) ◽

pp. 5974-5981 ◽

Cited By ~ 9

Author(s):

Kobi Cohen ◽

Angelia Nedic ◽

R. Srikant

Keyword(s):

Least Squares ◽

Gradient Descent ◽

Stochastic Gradient ◽

Weighted Averaging ◽

Stochastic Gradient Descent ◽

Least Squares Regression ◽

Descent Algorithm ◽

Gradient Descent Algorithm

Download Full-text

SW-SGD: The Sliding Window Stochastic Gradient Descent Algorithm

Procedia Computer Science ◽

10.1016/j.procs.2017.05.082 ◽

2017 ◽

Vol 108 ◽

pp. 2318-2322 ◽

Cited By ~ 9

Author(s):

Imen Chakroun ◽

Tom Haber ◽

Thomas J. Ashby

Keyword(s):

Gradient Descent ◽

Sliding Window ◽

Stochastic Gradient ◽

Stochastic Gradient Descent ◽

Descent Algorithm ◽

Gradient Descent Algorithm

Download Full-text

On the Convergence Properties of a K-step Averaging Stochastic Gradient Descent Algorithm for Nonconvex Optimization

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2018/447 ◽

2018 ◽

Cited By ~ 8

Author(s):

Fan Zhou ◽

Guojing Cong

Keyword(s):

Gradient Descent ◽

Large Scale ◽

Stochastic Gradient ◽

Learning Problems ◽

Stochastic Gradient Descent ◽

Convergence Properties ◽

Descent Algorithm ◽

Convergence Results ◽

Gradient Descent Algorithm ◽

Parallel Stochastic Gradient Descent

We adopt and analyze a synchronous K-step averaging stochastic gradient descent algorithm which we call K-AVG for solving large scale machine learning problems. We establish the convergence results of K-AVG for nonconvex objectives. Our analysis of K-AVG applies to many existing variants of synchronous SGD. We explain why the K-step delay is necessary and leads to better performance than traditional parallel stochastic gradient descent which is equivalent to K-AVG with $K=1$. We also show that K-AVG scales better with the number of learners than asynchronous stochastic gradient descent (ASGD). Another advantage of K-AVG over ASGD is that it allows larger stepsizes and facilitates faster convergence. On a cluster of $128$ GPUs, K-AVG is faster than ASGD implementations and achieves better accuracies and faster convergence for training with the CIFAR-10 dataset.

Download Full-text

SSGD: Sparsity-Promoting Stochastic Gradient Descent Algorithm for Unbiased Dnn Pruning

ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp40776.2020.9054436 ◽

2020 ◽

Cited By ~ 1

Author(s):

Ching-Hua Lee ◽

Igor Fedorov ◽

Bhaskar D. Rao ◽

Harinath Garudadri

Keyword(s):

Gradient Descent ◽

Stochastic Gradient ◽

Stochastic Gradient Descent ◽

Descent Algorithm ◽

Gradient Descent Algorithm

Download Full-text