Apache Nemo: A Framework for Optimizing Distributed Data Processing

Optimizing scheduling and communication of distributed data processing for resource and data characteristics is crucial for achieving high performance. Existing approaches to such optimizations largely fall into two categories. First, distributed runtimes provide low-level policy interfaces to apply the optimizations, but do not ensure the maintenance of correct application semantics and thus often require significant effort to use. Second, policy interfaces that extend a high-level application programming model ensure correctness, but do not provide sufficient fine control. We describe Apache Nemo, an optimization framework for distributed dataflow processing that provides fine control for high performance and also ensures correctness for ease of use. We combine several techniques to achieve this, including an intermediate representation of dataflow, compiler optimization passes, and runtime extensions. Our evaluation results show that Nemo enables composable and reusable optimizations that bring performance improvements on par with existing specialized runtimes tailored for a specific deployment scenario. Apache Nemo is open-sourced at https://nemo.apache.org as an Apache incubator project.

Download Full-text

High performance distributed data processing pipeline for Chinese Spectral RadioHeliograph

2015 1st URSI Atlantic Radio Science Conference (URSI AT-RASC) ◽

10.1109/ursi-at-rasc.2015.7303172 ◽

2015 ◽

Author(s):

Feng Wang ◽

Hui Deng ◽

Wei Wang

Keyword(s):

Data Processing ◽

High Performance ◽

Distributed Data ◽

Processing Pipeline ◽

Distributed Data Processing

Download Full-text

Distributed data processing requirements engineering: high level DDP design

10.1109/cmpsac.1978.810413 ◽

2005 ◽

Author(s):

D.F. Palmer ◽

W.M. Denny

Keyword(s):

Data Processing ◽

Requirements Engineering ◽

Distributed Data ◽

Distributed Data Processing ◽

High Level

Download Full-text

Treatment and Research of Massive Data Mining Based on Cloud Computing

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.765-767.941 ◽

2013 ◽

Vol 765-767 ◽

pp. 941-944

Author(s):

Peng Wang ◽

Jia Nan Wang ◽

Ji Ci Ba ◽

Yu Tan

Keyword(s):

Data Mining ◽

Cloud Computing ◽

Data Processing ◽

Programming Model ◽

Massive Data ◽

Distributed Data ◽

Distributed Data Processing ◽

Hadoop Platform ◽

Core Framework

This paper introduces SPRINT algorithm optimized in the Hadoop core framework. Combing the data mining process, we will study the cloud computing in the MapReduce programming model, then improve and optimize the SPRINT algorithm in conjunction with the mode, transplant the optimized algorithm to Hadoop platform for distributed data processing.

Download Full-text

High-Level Parallel Ant Colony Optimization with Algorithmic Skeletons

International Journal of Parallel Programming ◽

10.1007/s10766-021-00714-1 ◽

2021 ◽

Author(s):

Breno A. de Melo Menezes ◽

Nina Herrmann ◽

Herbert Kuchen ◽

Fernando Buarque de Lima Neto

Keyword(s):

Ant Colony Optimization ◽

High Performance ◽

Optimization Problems ◽

Programming Model ◽

Parallel Implementation ◽

Ant Colony ◽

Algorithmic Skeletons ◽

Low Level ◽

Programming Patterns ◽

High Level

AbstractParallel implementations of swarm intelligence algorithms such as the ant colony optimization (ACO) have been widely used to shorten the execution time when solving complex optimization problems. When aiming for a GPU environment, developing efficient parallel versions of such algorithms using CUDA can be a difficult and error-prone task even for experienced programmers. To overcome this issue, the parallel programming model of Algorithmic Skeletons simplifies parallel programs by abstracting from low-level features. This is realized by defining common programming patterns (e.g. map, fold and zip) that later on will be converted to efficient parallel code. In this paper, we show how algorithmic skeletons formulated in the domain specific language Musket can cope with the development of a parallel implementation of ACO and how that compares to a low-level implementation. Our experimental results show that Musket suits the development of ACO. Besides making it easier for the programmer to deal with the parallelization aspects, Musket generates high performance code with similar execution times when compared to low-level implementations.

Download Full-text