Efficient Adaptive Online Learning via Frequent Directions

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2018/381 ◽

2018 ◽

Cited By ~ 1

Author(s):

Yuanyu Wan ◽

Nan Wei ◽

Lijun Zhang

Keyword(s):

Online Learning ◽

Large Scale ◽

Descent Method ◽

Low Rank ◽

Subgradient Methods ◽

Outer Product ◽

Mirror Descent Method ◽

Mirror Descent ◽

Primal Dual ◽

Frequent Directions

By employing time-varying proximal functions, adaptive subgradient methods (ADAGRAD) have improved the regret bound and been widely used in online learning and optimization. However, ADAGRAD with full matrix proximal functions (ADA-FULL) cannot deal with large-scale problems due to the impractical time and space complexities, though it has better performance when gradients are correlated. In this paper, we propose ADA-FD, an efficient variant of ADA-FULL based on a deterministic matrix sketching technique called frequent directions. Following ADA-FULL, we incorporate our ADA-FD into both primal-dual subgradient method and composite mirror descent method to develop two efficient methods. By maintaining and manipulating low-rank matrices, at each iteration, the space complexity is reduced from $O(d^2)$ to $O(\tau d)$ and the time complexity is reduced from $O(d^3)$ to $O(\tau^2d)$, where $d$ is the dimensionality of the data and $\tau \ll d$ is the sketching size. Theoretical analysis reveals that the regret of our methods is close to that of ADA-FULL as long as the outer product matrix of gradients is approximately low-rank. Experimental results show that our ADA-FD is comparable to ADA-FULL and outperforms other state-of-the-art algorithms in online convex optimization as well as in training convolutional neural networks (CNN).

Download Full-text

Primal–Dual Mirror Descent Method for Constraint Stochastic Optimization Problems

Computational Mathematics and Mathematical Physics ◽

10.1134/s0965542518110039 ◽

2018 ◽

Vol 58 (11) ◽

pp. 1728-1736 ◽

Cited By ~ 2

Author(s):

A. S. Bayandina ◽

A. V. Gasnikov ◽

E. V. Gasnikova ◽

S. V. Matsievskii

Keyword(s):

Stochastic Optimization ◽

Optimization Problems ◽

Descent Method ◽

Mirror Descent Method ◽

Mirror Descent ◽

Primal Dual

Download Full-text

Distributed mirror descent method for multi-agent optimization with delay

Neurocomputing ◽

10.1016/j.neucom.2015.12.017 ◽

2016 ◽

Vol 177 ◽

pp. 643-650 ◽

Cited By ~ 21

Author(s):

Jueyou Li ◽

Guo Chen ◽

Zhaoyang Dong ◽

Zhiyou Wu

Keyword(s):

Descent Method ◽

Mirror Descent Method ◽

Mirror Descent ◽

Multi Agent

Download Full-text

Stochastic mirror descent method for distributed multi-agent optimization

Optimization Letters ◽

10.1007/s11590-016-1071-z ◽

2016 ◽

Vol 12 (6) ◽

pp. 1179-1197 ◽

Cited By ~ 2

Author(s):

Jueyou Li ◽

Guoquan Li ◽

Zhiyou Wu ◽

Changzhi Wu

Keyword(s):

Descent Method ◽

Mirror Descent Method ◽

Mirror Descent ◽

Multi Agent

Download Full-text

Projective Quadratic Regression for Online Learning

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i04.5951 ◽

2020 ◽

Vol 34 (04) ◽

pp. 5093-5100

Author(s):

Wenye Ma

Keyword(s):

Online Learning ◽

Large Scale ◽

Learning Algorithm ◽

Optimal Solution ◽

Streaming Data ◽

Low Rank ◽

High Dimensional ◽

Quadratic Regression ◽

Convex Model ◽

Real World Data

This paper considers online convex optimization (OCO) problems - the paramount framework for online learning algorithm design. The loss function of learning task in OCO setting is based on streaming data so that OCO is a powerful tool to model large scale applications such as online recommender systems. Meanwhile, real-world data are usually of extreme high-dimensional due to modern feature engineering techniques so that the quadratic regression is impractical. Factorization Machine as well as its variants are efficient models for capturing feature interactions with low-rank matrix model but they can't fulfill the OCO setting due to their non-convexity. In this paper, We propose a projective quadratic regression (PQR) model. First, it can capture the import second-order feature information. Second, it is a convex model, so the requirements of OCO are fulfilled and the global optimal solution can be achieved. Moreover, existing modern online optimization methods such as Online Gradient Descent (OGD) or Follow-The-Regularized-Leader (FTRL) can be applied directly. In addition, by choosing a proper hyper-parameter, we show that it has the same order of space and time complexity as the linear model and thus can handle high-dimensional data. Experimental results demonstrate the performance of the proposed PQR model in terms of accuracy and efficiency by comparing with the state-of-the-art methods.

Download Full-text

Distributed mirror descent method for saddle point problems over directed graphs

Complexity ◽

10.1002/cplx.21794 ◽

2016 ◽

Vol 21 (S2) ◽

pp. 178-190 ◽

Cited By ~ 2

Author(s):

Jueyou Li ◽

Guo Chen ◽

Zhaoyang Dong ◽

Zhiyou Wu ◽

Minghai Yao

Keyword(s):

Saddle Point ◽

Directed Graphs ◽

Descent Method ◽

Saddle Point Problems ◽

Mirror Descent Method ◽

Mirror Descent

Download Full-text

Algorithms of Robust Stochastic Optimization Based on Mirror Descent Method

Automation and Remote Control ◽

10.1134/s0005117919090042 ◽

2019 ◽

Vol 80 (9) ◽

pp. 1607-1627 ◽

Cited By ~ 1

Author(s):

A. V. Nazin ◽

A. S. Nemirovsky ◽

A. B. Tsybakov ◽

A. B. Juditsky

Keyword(s):

Stochastic Optimization ◽

Descent Method ◽

Mirror Descent Method ◽

Mirror Descent

Download Full-text

A Version of the Mirror descent Method to Solve Variational Inequalities*

Cybernetics and Systems Analysis ◽

10.1007/s10559-017-9923-9 ◽

2017 ◽

Vol 53 (2) ◽

pp. 234-243 ◽

Cited By ~ 13

Author(s):

V. V. Semenov

Keyword(s):

Variational Inequalities ◽

Descent Method ◽

Mirror Descent Method ◽

Mirror Descent

Download Full-text

RLC Circuits-Based Distributed Mirror Descent Method

IEEE Control Systems Letters ◽

10.1109/lcsys.2020.2972908 ◽

2020 ◽

Vol 4 (3) ◽

pp. 548-553 ◽

Cited By ~ 1

Author(s):

Yue Yu ◽

Behcet Acikmese

Keyword(s):

Descent Method ◽

Mirror Descent Method ◽

Mirror Descent ◽

Rlc Circuits

Download Full-text

Application of the Mirror Descent Method to minimize average losses coming by a poisson flow

2014 European Control Conference (ECC) ◽

10.1109/ecc.2014.6862486 ◽

2014 ◽

Cited By ~ 2

Author(s):

Alexander Nazin ◽

Svetlana Anulova ◽

Andrey Tremba

Keyword(s):

Descent Method ◽

Mirror Descent Method ◽

Mirror Descent

Download Full-text

A variant of mirror descent method for solving variational inequalities

2017 Constructive Nonsmooth Analysis and Related Topics (dedicated to the memory of V.F. Demyanov) (CNSA) ◽

10.1109/cnsa.2017.7974011 ◽

2017 ◽

Cited By ~ 1

Author(s):

Vladimir V. Semenov

Keyword(s):

Variational Inequalities ◽

Descent Method ◽

Mirror Descent Method ◽

Mirror Descent

Download Full-text