Parallel Version of the Mirror Descent Algorithm for the Two-Armed Bandit Problem

We consider the minimax setup for the two-armed bandit problem as applied to data processing if there are two alternative processing methods with different a priori unknown efficiencies. One should determine the most efficient method and provide its predominant application. To this end, we use the mirror descent algorithm (MDA). It is well-known that corresponding minimax risk has the order of $N^{1/2$ with $N$ being the number of processed data and this bound is unimprovable in order. We propose a batch version of the MDA which allows processing data by packets that is especially important if parallel data processing can be provided. In this case, the processing time is determined by the number of batches rather than by the total number of data. Unexpectedly, it turned out that the batch version behaves unlike the ordinary one even if the number of packets is large. Moreover, the batch version provides significantly smaller value of the minimax risk, i.e., it considerably improves a control performance. We explain this result by considering another batch modification of the MDA which behavior is close to behavior of the ordinary version and minimax risk is close as well. Our estimates use invariant descriptions of the algorithms based on Gaussian approximations of incomes in batches of data in the domain of ``close'' distributions and are obtained by Monte-Carlo simulations.

Download Full-text

Recursive Aggregation of Estimators by the Mirror Descent Algorithm with Averaging

Problems of Information Transmission ◽

10.1007/s11122-006-0005-2 ◽

2005 ◽

Vol 41 (4) ◽

pp. 368-384 ◽

Cited By ~ 36

Author(s):

A. B. Juditsky ◽

A. V. Nazin ◽

A. B. Tsybakov ◽

N. Vayatis

Keyword(s):

Descent Algorithm ◽

Mirror Descent

Download Full-text

Distributed stochastic mirror descent algorithm for resource allocation problem

Control Theory and Technology ◽

10.1007/s11768-020-00018-8 ◽

2020 ◽

Vol 18 (4) ◽

pp. 339-347

Author(s):

Yinghui Wang ◽

Zhipeng Tu ◽

Huashu Qin

Keyword(s):

Resource Allocation ◽

Allocation Problem ◽

Resource Allocation Problem ◽

Descent Algorithm ◽

Mirror Descent

Download Full-text

Stochastic Mirror Descent Algorithm for L1-Regularized Risk Minimizations

2010 10th IEEE International Conference on Computer and Information Technology ◽

10.1109/cit.2010.224 ◽

2010 ◽

Author(s):

Hua Ouyang ◽

Alexander Gray

Keyword(s):

Descent Algorithm ◽

Mirror Descent

Download Full-text

A Stochastic Sub-Gradient Mirror Descent Algorithm for Non-Smooth and Strongly Convex Functions

Pure Mathematics ◽

10.12677/pm.2018.83028 ◽

2018 ◽

Vol 08 (03) ◽

pp. 221-229

Author(s):

倩周

Keyword(s):

Convex Functions ◽

Strongly Convex Functions ◽

Descent Algorithm ◽

Strongly Convex ◽

Mirror Descent

Download Full-text

On Stochastic Subgradient Mirror-Descent Algorithm with Weighted Averaging

SIAM Journal on Optimization ◽

10.1137/120894464 ◽

2014 ◽

Vol 24 (1) ◽

pp. 84-107 ◽

Cited By ~ 32

Author(s):

Angelia Nedić ◽

Soomin Lee

Keyword(s):

Weighted Averaging ◽

Descent Algorithm ◽

Mirror Descent

Download Full-text

A mirror descent algorithm for minimization of mean Poisson flow driven losses

Automation and Remote Control ◽

10.1134/s0005117914060022 ◽

2014 ◽

Vol 75 (6) ◽

pp. 1010-1016

Author(s):

A. V. Nazin ◽

S. V. Anulova ◽

A. A. Tremba

Keyword(s):

Descent Algorithm ◽

Mirror Descent

Download Full-text

The Mirror Descent Algorithm

Convex Optimization with Computational Errors - Springer Optimization and Its Applications ◽

10.1007/978-3-030-37822-6_3 ◽

2020 ◽

pp. 83-125

Author(s):

Alexander J. Zaslavski

Keyword(s):

Descent Algorithm ◽

Mirror Descent

Download Full-text

Analysis of Online Composite Mirror Descent Algorithm

Neural Computation ◽

10.1162/neco_a_00930 ◽

2017 ◽

Vol 29 (3) ◽

pp. 825-860 ◽

Cited By ~ 4

Author(s):

Yunwen Lei ◽

Ding-Xuan Zhou

Keyword(s):

Error Analysis ◽

Objective Function ◽

Convergence Rates ◽

Hölder Continuous ◽

Descent Algorithm ◽

Strongly Convex ◽

Mirror Descent ◽

One Step ◽

Error Decomposition ◽

Mirror Map

We study the convergence of the online composite mirror descent algorithm, which involves a mirror map to reflect the geometry of the data and a convex objective function consisting of a loss and a regularizer possibly inducing sparsity. Our error analysis provides convergence rates in terms of properties of the strongly convex differentiable mirror map and the objective function. For a class of objective functions with Hölder continuous gradients, the convergence rates of the excess (regularized) risk under polynomially decaying step sizes have the order [Formula: see text] after [Formula: see text] iterates. Our results improve the existing error analysis for the online composite mirror descent algorithm by avoiding averaging and removing boundedness assumptions, and they sharpen the existing convergence rates of the last iterate for online gradient descent without any boundedness assumptions. Our methodology mainly depends on a novel error decomposition in terms of an excess Bregman distance, refined analysis of self-bounding properties of the objective function, and the resulting one-step progress bounds.

Download Full-text

Saddle point mirror descent algorithm for the robust PageRank problem

Automation and Remote Control ◽

10.1134/s0005117916080075 ◽

2016 ◽

Vol 77 (8) ◽

pp. 1403-1418

Author(s):

A. V. Nazin ◽

A. A. Tremba

Keyword(s):

Saddle Point ◽

Descent Algorithm ◽

Mirror Descent

Download Full-text