scholarly journals Control in a weakly inhomogeneous two-alternative random environment using the mirror descent algorithm

2019 ◽  
Vol 1352 ◽  
pp. 012048 ◽  
Author(s):  
D N Shiyan ◽  
A V Kolnogorov
2021 ◽  
Vol 2052 (1) ◽  
pp. 012039
Author(s):  
D N Shiyan ◽  
A V Kolnogorov

Abstract We consider the problem of optimal control in a random environment in a minimax setting as applied to data processing. It is assumed that the random environment provides two methods of data processing, the effectiveness of which is not known in advance. The goal of the control in this case is to find the optimal strategy for the application of processing methods and to minimize losses. To solve this problem, the mirror descent algorithm is used, including its modifications for batch processing. The use of algorithms for batch processing allows us to get a significant gain in speed due to the parallel processing of batches. In the classical statement, the search for the optimal strategy is considered on a fixed control horizon but this article considers an indefinite control horizon. With an indefinite horizon, the control algorithm cannot use information about the value of the horizon when searching for an optimal strategy. Using numerical modeling, the operation of the mirror descent algorithm and its modifications on an indefinite control horizon is studied and obtained results are presented.


2005 ◽  
Vol 41 (4) ◽  
pp. 368-384 ◽  
Author(s):  
A. B. Juditsky ◽  
A. V. Nazin ◽  
A. B. Tsybakov ◽  
N. Vayatis

2014 ◽  
Vol 75 (6) ◽  
pp. 1010-1016
Author(s):  
A. V. Nazin ◽  
S. V. Anulova ◽  
A. A. Tremba

2017 ◽  
Vol 29 (3) ◽  
pp. 825-860 ◽  
Author(s):  
Yunwen Lei ◽  
Ding-Xuan Zhou

We study the convergence of the online composite mirror descent algorithm, which involves a mirror map to reflect the geometry of the data and a convex objective function consisting of a loss and a regularizer possibly inducing sparsity. Our error analysis provides convergence rates in terms of properties of the strongly convex differentiable mirror map and the objective function. For a class of objective functions with Hölder continuous gradients, the convergence rates of the excess (regularized) risk under polynomially decaying step sizes have the order [Formula: see text] after [Formula: see text] iterates. Our results improve the existing error analysis for the online composite mirror descent algorithm by avoiding averaging and removing boundedness assumptions, and they sharpen the existing convergence rates of the last iterate for online gradient descent without any boundedness assumptions. Our methodology mainly depends on a novel error decomposition in terms of an excess Bregman distance, refined analysis of self-bounding properties of the objective function, and the resulting one-step progress bounds.


Sign in / Sign up

Export Citation Format

Share Document