One-armed bandit problem for parallel data processing systems

We consider the minimax setup for the two-armed bandit problem as applied to data processing if there are two alternative processing methods with different a priori unknown efficiencies. One should determine the most efficient method and provide its predominant application. To this end, we use the mirror descent algorithm (MDA). It is well-known that corresponding minimax risk has the order of $N^{1/2$ with $N$ being the number of processed data and this bound is unimprovable in order. We propose a batch version of the MDA which allows processing data by packets that is especially important if parallel data processing can be provided. In this case, the processing time is determined by the number of batches rather than by the total number of data. Unexpectedly, it turned out that the batch version behaves unlike the ordinary one even if the number of packets is large. Moreover, the batch version provides significantly smaller value of the minimax risk, i.e., it considerably improves a control performance. We explain this result by considering another batch modification of the MDA which behavior is close to behavior of the ordinary version and minimax risk is close as well. Our estimates use invariant descriptions of the algorithms based on Gaussian approximations of incomes in batches of data in the domain of ``close'' distributions and are obtained by Monte-Carlo simulations.

Download Full-text

MAPREDUCE: INSIGHT ANALYSIS OF BIG DATA VIA PARALLEL DATA PROCESSING USING JAVA PROGRAMMING, HIVE AND APACHE PIG

International Journal of Advanced Research in Computer Science ◽

10.26483/ijarcs.v9i1.5414 ◽

2018 ◽

Vol 9 (1) ◽

pp. 536-540 ◽

Cited By ~ 1

Author(s):

Dr. Ujjwal Agarwal ◽

Keyword(s):

Big Data ◽

Data Processing ◽

Java Programming ◽

Parallel Data ◽

Apache Pig

Download Full-text

Calculation of focal positions in an optical head for parallel data processing with a monolithic four-beam laser diode

Applied Optics ◽

10.1364/ao.40.001065 ◽

2001 ◽

Vol 40 (7) ◽

pp. 1065 ◽

Cited By ~ 3

Author(s):

Masahisa Shinoda

Keyword(s):

Data Processing ◽

Laser Diode ◽

Optical Head ◽

Beam Laser ◽

Parallel Data

Download Full-text

Low-power Parallel Data Processing Using Computation Reuse

Proceedings of the 7th International Conference on Information Communication and Management - ICICM 2017 ◽

10.1145/3134383.3134410 ◽

2017 ◽

Author(s):

Bita Dabiri ◽

Seyyed Hossein SeyyedAghaei Rezaei ◽

Mehdi Modarressi

Keyword(s):

Data Processing ◽

Low Power ◽

Parallel Data

Download Full-text

Parallel Data Mining and Applications in Hospital Big Data Processing

Big Data Management and Processing ◽

10.1201/9781315154008-20 ◽

2017 ◽

pp. 403-424

Author(s):

Jianguo Chen ◽

Zhuo Tang ◽

Kenli Li ◽

Keqin Li

Keyword(s):

Data Mining ◽

Big Data ◽

Data Processing ◽

Big Data Processing ◽

Parallel Data ◽

Parallel Data Mining

Download Full-text

Parallel data processing architectures for identification of structural modal properties using dense wireless sensor networks

World Forum on Smart Materials and Smart Structures Technology ◽

10.1201/9781439828441.ch132 ◽

2008 ◽

Author(s):

J Lynch ◽

D Saftner ◽

M Shiraishi ◽

R Swartz ◽

M Setareh ◽

...

Keyword(s):

Wireless Sensor Networks ◽

Sensor Networks ◽

Data Processing ◽

Wireless Sensor ◽

Modal Properties ◽

Parallel Data ◽

Processing Architectures

Download Full-text

Models of parallel data processing in multiprocessor computing systems

Cybernetics ◽

10.1007/bf01070363 ◽

1990 ◽

Vol 25 (4) ◽

pp. 421-430 ◽

Cited By ~ 1

Author(s):

F. I. Andon ◽

B. E. Polyachenko ◽

O. L. Gun'ko

Keyword(s):

Data Processing ◽

Computing Systems ◽

Parallel Data

Download Full-text

Ad-Hoc Parallel Data Processing on Pay-As-You-Go Clouds with Nephele

Advances in Systems Analysis, Software Engineering, and High Performance Computing - Distributed Computing Innovations for Business, Engineering, and Science ◽

10.4018/978-1-4666-2533-4.ch010 ◽

2013 ◽

pp. 191-218

Author(s):

Daniel Warneke

Keyword(s):

Cloud Computing ◽

Data Processing ◽

Ad Hoc ◽

Cluster Systems ◽

Software Frameworks ◽

Homogeneous Cluster ◽

Processing Cost ◽

Parallel Data ◽

The One ◽

Processing Framework

In recent years, so-called Infrastructure as a Service (IaaS) clouds have become increasingly popular as a flexible and inexpensive platform for ad-hoc parallel data processing. Major players in the cloud computing space like Amazon EC2 have already recognized this trend and started to create special offers which bundle their compute platform with existing software frameworks for these kinds of applications. However, the data processing frameworks which are currently used in these offers have been designed for static, homogeneous cluster systems and do not support the new features which distinguish the cloud platform. This chapter examines the characteristics of IaaS clouds with special regard to massively-parallel data processing. The author highlights use cases which are currently poorly supported by existing parallel data processing frameworks and explains how a tighter integration between the processing framework and the underlying cloud system can help to lower the monetary processing cost for the cloud customer. As a proof of concept, the author presents the parallel data processing framework Nephele, and compares its cost efficiency against the one of the well-known software Hadoop.

Download Full-text

Parallel Data Processing in Dynamic Hybrid Computing Environment Using MapReduce

Algorithms and Architectures for Parallel Processing - Lecture Notes in Computer Science ◽

10.1007/978-3-319-11194-0_1 ◽

2014 ◽

pp. 1-14 ◽

Cited By ~ 1

Author(s):

Bing Tang ◽

Haiwu He ◽

Gilles Fedak

Keyword(s):

Data Processing ◽

Computing Environment ◽

Hybrid Computing ◽

Parallel Data ◽

Dynamic Hybrid

Download Full-text