One-armed bandit problem for parallel data processing systems

2015 ◽  
Vol 51 (2) ◽  
pp. 177-191
Author(s):  
A. V. Kolnogorov
2021 ◽  
Vol 13 (2) ◽  
pp. 9-39
Author(s):  
Александр Валерианович Колногоров ◽  
Alexander Kolnogorov ◽  
Александр Викторович Назин ◽  
Alexander Nazin ◽  
Дмитрий Николаевич Шиян ◽  
...  

We consider the minimax setup for the two-armed bandit problem as applied to data processing if there are two alternative processing methods with different a priori unknown efficiencies. One should determine the most efficient method and provide its predominant application. To this end, we use the mirror descent algorithm (MDA). It is well-known that corresponding minimax risk has the order of $N^{1/2$ with $N$ being the number of processed data and this bound is unimprovable in order. We propose a batch version of the MDA which allows processing data by packets that is especially important if parallel data processing can be provided. In this case, the processing time is determined by the number of  batches rather than by the total number of data. Unexpectedly, it turned out that the batch version behaves unlike the ordinary one even if the number of packets is large. Moreover, the batch version provides significantly smaller value of the minimax risk, i.e., it considerably improves a control performance. We explain this result by considering another batch modification of the MDA which behavior is close to behavior of the ordinary version and minimax risk is close as well. Our estimates use invariant descriptions of the algorithms based on Gaussian approximations of incomes in batches of data in the domain of ``close'' distributions and are obtained by Monte-Carlo simulations.


Cybernetics ◽  
1990 ◽  
Vol 25 (4) ◽  
pp. 421-430 ◽  
Author(s):  
F. I. Andon ◽  
B. E. Polyachenko ◽  
O. L. Gun'ko

Author(s):  
Daniel Warneke

In recent years, so-called Infrastructure as a Service (IaaS) clouds have become increasingly popular as a flexible and inexpensive platform for ad-hoc parallel data processing. Major players in the cloud computing space like Amazon EC2 have already recognized this trend and started to create special offers which bundle their compute platform with existing software frameworks for these kinds of applications. However, the data processing frameworks which are currently used in these offers have been designed for static, homogeneous cluster systems and do not support the new features which distinguish the cloud platform. This chapter examines the characteristics of IaaS clouds with special regard to massively-parallel data processing. The author highlights use cases which are currently poorly supported by existing parallel data processing frameworks and explains how a tighter integration between the processing framework and the underlying cloud system can help to lower the monetary processing cost for the cloud customer. As a proof of concept, the author presents the parallel data processing framework Nephele, and compares its cost efficiency against the one of the well-known software Hadoop.


Sign in / Sign up

Export Citation Format

Share Document