Mirror descent algorithm on the indefinite control horizon

Abstract We consider the problem of optimal control in a random environment in a minimax setting as applied to data processing. It is assumed that the random environment provides two methods of data processing, the effectiveness of which is not known in advance. The goal of the control in this case is to find the optimal strategy for the application of processing methods and to minimize losses. To solve this problem, the mirror descent algorithm is used, including its modifications for batch processing. The use of algorithms for batch processing allows us to get a significant gain in speed due to the parallel processing of batches. In the classical statement, the search for the optimal strategy is considered on a fixed control horizon but this article considers an indefinite control horizon. With an indefinite horizon, the control algorithm cannot use information about the value of the horizon when searching for an optimal strategy. Using numerical modeling, the operation of the mirror descent algorithm and its modifications on an indefinite control horizon is studied and obtained results are presented.

Download Full-text

Control in a weakly inhomogeneous two-alternative random environment using the mirror descent algorithm

Journal of Physics Conference Series ◽

10.1088/1742-6596/1352/1/012048 ◽

2019 ◽

Vol 1352 ◽

pp. 012048 ◽

Cited By ~ 1

Author(s):

D N Shiyan ◽

A V Kolnogorov

Keyword(s):

Random Environment ◽

Descent Algorithm ◽

Mirror Descent

Download Full-text

Two-armed bandit problem and batch version of the mirror descent algorithm

Mathematical Game Theory and Applications ◽

10.17076/mgta_2021_2_34 ◽

2021 ◽

Vol 13 (2) ◽

pp. 9-39

Author(s):

Александр Валерианович Колногоров ◽

Alexander Kolnogorov ◽

Александр Викторович Назин ◽

Alexander Nazin ◽

Дмитрий Николаевич Шиян ◽

...

Keyword(s):

Data Processing ◽

A Priori ◽

Control Performance ◽

Bandit Problem ◽

Minimax Risk ◽

Descent Algorithm ◽

Alternative Processing ◽

Mirror Descent ◽

Processing Data ◽

Parallel Data

We consider the minimax setup for the two-armed bandit problem as applied to data processing if there are two alternative processing methods with different a priori unknown efficiencies. One should determine the most efficient method and provide its predominant application. To this end, we use the mirror descent algorithm (MDA). It is well-known that corresponding minimax risk has the order of $N^{1/2$ with $N$ being the number of processed data and this bound is unimprovable in order. We propose a batch version of the MDA which allows processing data by packets that is especially important if parallel data processing can be provided. In this case, the processing time is determined by the number of batches rather than by the total number of data. Unexpectedly, it turned out that the batch version behaves unlike the ordinary one even if the number of packets is large. Moreover, the batch version provides significantly smaller value of the minimax risk, i.e., it considerably improves a control performance. We explain this result by considering another batch modification of the MDA which behavior is close to behavior of the ordinary version and minimax risk is close as well. Our estimates use invariant descriptions of the algorithms based on Gaussian approximations of incomes in batches of data in the domain of ``close'' distributions and are obtained by Monte-Carlo simulations.

Download Full-text

Coordinated Multi-Microgrids Optimal Control Algorithm for Smart Distribution Management System

IEEE Transactions on Smart Grid ◽

10.1109/tsg.2013.2269481 ◽

2013 ◽

Vol 4 (4) ◽

pp. 2174-2181 ◽

Cited By ~ 123

Author(s):

Jiang Wu ◽

Xiaohong Guan

Keyword(s):

Optimal Control ◽

Management System ◽

Control Algorithm ◽

Distribution Management ◽

Distribution Management System

Download Full-text

An Economic Model Predictive Control Approach for Wind Power Smoothing and Tower Load Mitigation

Volume 2: Control and Optimization of Connected and Automated Ground Vehicles; Dynamic Systems and Control Education; Dynamics and Control of Renewable Energy Systems; Energy Harvesting; Energy Systems; Estimation and Identification; Intelligent Transportation and Vehicles; Manufacturing; Mechatronics; Modeling and Control of IC Engines and Aftertreatment Systems; Modeling and Control of IC Engines and Powertrain Systems; Modeling and Management of Power Systems ◽

10.1115/dscc2018-9032 ◽

2018 ◽

Author(s):

Mohamed M. Alhneaish ◽

Mohamed L. Shaltout ◽

Sayed M. Metwalli

Keyword(s):

Optimal Control ◽

Optimal Control Problem ◽

Model Predictive Control ◽

Wind Power ◽

Predictive Control ◽

Economic Model ◽

Control Algorithm ◽

Fatigue Load ◽

Economic Model Predictive Control ◽

Control Framework

An economic model predictive control framework is presented in this study for an integrated wind turbine and flywheel energy storage system. The control objective is to smooth wind power output and mitigate tower fatigue load. The optimal control problem within the model predictive control framework has been formulated as a convex optimal control problem with linear dynamics and convex constraints that can be solved globally. The performance of the proposed control algorithm is compared to that of a standard wind turbine controller. The effect of the proposed control actions on the fatigue loads acting on the tower and blades is studied. The simulation results, with various wind scenarios, showed the ability of the proposed control algorithm to achieve the aforementioned objectives in terms of smoothing output power and mitigating tower fatigue load at the cost of a minimal reduction of the wind energy harvested.

Download Full-text

Approximate-optimal control algorithm for constrained zero-sum differential games through event-triggering mechanism

Nonlinear Dynamics ◽

10.1007/s11071-018-4713-0 ◽

2018 ◽

Vol 95 (4) ◽

pp. 2639-2657 ◽

Cited By ~ 6

Author(s):

Chaoxu Mu ◽

Ke Wang

Keyword(s):

Optimal Control ◽

Differential Games ◽

Control Algorithm ◽

Triggering Mechanism ◽

Event Triggering ◽

Zero Sum

Download Full-text

Bayes Performance of Batch Data Mining Based on Functional Dependencies

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s0218001419590110 ◽

2019 ◽

Vol 33 (03) ◽

pp. 1959011

Author(s):

Haixu Xi ◽

Feiyue Ye ◽

Sheng He ◽

Yijun Liu ◽

Hongfen Jiang

Keyword(s):

Data Mining ◽

Data Processing ◽

Video Processing ◽

Mining Area ◽

Batch Processing ◽

Video Data ◽

Batch Processes ◽

Functional Dependencies ◽

Workflow System ◽

Traffic Video

Batch processes and phenomena in traffic video data processing, such as traffic video image processing and intelligent transportation, are commonly used. The application of batch processing can increase the efficiency of resource conservation. However, owing to limited research on traffic video data processing conditions, batch processing activities in this area remain minimally examined. By employing database functional dependency mining, we developed in this study a workflow system. Meanwhile, the Bayesian network is a focus area of data mining. It provides an intuitive means for users to comply with causality expression approaches. Moreover, graph theory is also used in data mining area. In this study, the proposed approach depends on relational database functions to remove redundant attributes, reduce interference, and select a property order. The restoration of selective hidden naive Bayesian (SHNB) affects this property order when it is used only once. With consideration of the hidden naive Bayes (HNB) influence, rather than using one pair of HNB, it is introduced twice. We additionally designed and implemented mining dependencies from a batch traffic video processing log for data execution algorithms.

Download Full-text