PARALLEL IMPLEMENTATION OF THE TOPAZ OPACITY CODE: ISSUES IN LOAD-BALANCING

Abstract. In this paper, we present a parallel version of the finite-element model of the Arctic Ocean (FEMAO) configured for the White Sea and based on MPI technology. This model consists of two main parts: an ocean dynamics model and a surface ice dynamics model. These parts are very different in terms of the number of computations because the complexity of the ocean part depends on the bottom depth, while that of the sea-ice component does not. In the first step, we decided to locate both submodels on the same CPU cores with a common horizontal partition of the computational domain. The model domain is divided into small blocks, which are distributed over the CPU cores using Hilbert-curve balancing. Partitioning of the model domain is static (i.e., computed during the initialization stage). There are three baseline options: a single block per core, balancing of 2D computations, and balancing of 3D computations. After showing parallel acceleration for particular ocean and ice procedures, we construct the common partition, which minimizes joint imbalance in both submodels. Our novelty is using arrays shared by all blocks that belong to a CPU core instead of allocating separate arrays for each block, as is usually done. Computations on a CPU core are restricted by the masks of non-land grid nodes and block–core correspondence. This approach allows us to implement parallel computations into the model that are as simple as when the usual decomposition to squares is used, though with advances in load balancing. We provide parallel acceleration of up to 996 cores for the model with a resolution of 500×500×39 in the ocean component and 43 sea-ice scalars, and we carry out a detailed analysis of different partitions on the model runtime.

Download Full-text

VFFVA: dynamic load balancing enables large-scale flux variability analysis

BMC Bioinformatics ◽

10.1186/s12859-020-03711-2 ◽

2020 ◽

Vol 21 (1) ◽

Author(s):

Marouen Ben Guebila

Keyword(s):

Load Balancing ◽

Large Scale ◽

Parallel Implementation ◽

Convergence Time ◽

Coupled Models ◽

Flux Variability Analysis ◽

Variability Analysis ◽

Flux Variability ◽

Fast Flux ◽

Computation Load

Abstract Background Genome-scale metabolic models are increasingly employed to predict the phenotype of various biological systems pertaining to healthcare and bioengineering. To characterize the full metabolic spectrum of such systems, Fast Flux Variability Analysis (FFVA) is commonly used in parallel with static load balancing. This approach assigns to each core an equal number of biochemical reactions without consideration of their solution complexity. Results Here, we present Very Fast Flux Variability Analysis (VFFVA) as a parallel implementation that dynamically balances the computation load between the cores in runtime which guarantees equal convergence time between them. VFFVA allowed to gain a threefold speedup factor with coupled models and up to 100 with ill-conditioned models along with a 14-fold decrease in memory usage. Conclusions VFFVA exploits the parallel capabilities of modern machines to enable biological insights through optimizing systems biology modeling. VFFVA is available in C, MATLAB, and Python at https://github.com/marouenbg/VFFVA.

Download Full-text

Parallel implementation of evolutionary strategies on heterogeneous clusters with load balancing

Proceedings 20th IEEE International Parallel & Distributed Processing Symposium ◽

10.1109/ipdps.2006.1639520 ◽

2006 ◽

Author(s):

J.F. Garamendi ◽

J.L. Bosque

Keyword(s):

Load Balancing ◽

Parallel Implementation ◽

Evolutionary Strategies ◽

Heterogeneous Clusters

Download Full-text

Parallel implementation of multi-resolution filter band with novel Adaptive Load Balancing Algorithm on consumer-level multicore systems

2012 IEEE 55th International Midwest Symposium on Circuits and Systems (MWSCAS) ◽

10.1109/mwscas.2012.6292128 ◽

2012 ◽

Author(s):

Mohammad Wadood Majid ◽

Golrokh Mirzaei ◽

Mohsin M. Jamali

Keyword(s):

Load Balancing ◽

Parallel Implementation ◽

Multicore Systems ◽

Load Balancing Algorithm

Download Full-text

Review: Advanced parallel implementation of the coupled ocean-ice model FEMAO with load balancing

10.5194/gmd-2020-182-rc1 ◽

2020 ◽

Author(s):

Nikolay V. Koldunov

Keyword(s):

Load Balancing ◽

Parallel Implementation ◽

Ice Model

Download Full-text

Load balancing and task decomposition techniques for parallel implementation of integrated vision systems algorithms

Proceedings of the 1989 ACM/IEEE conference on Supercomputing - Supercomputing '89 ◽

10.1145/76263.76292 ◽

1989 ◽

Cited By ~ 2

Author(s):

A. N. Choudhary ◽

J. H. Pater

Keyword(s):

Load Balancing ◽

Parallel Implementation ◽

Decomposition Techniques ◽

Vision Systems ◽

Task Decomposition ◽

And Task

Download Full-text

Computational load balancing methods for algorithms parallel implementation on shared memory

Proceedings of 20th Scientific Conference “Scientific Services & Internet – 2018” ◽

10.20948/abrau-2018-18 ◽

2018 ◽

Author(s):

Kirill Nikolaevich Efimkin ◽

◽

Mikhail Aleksandrovich Solovev ◽

Alexander Borisovich Bugerya ◽

Ekaterina Nikolaevna Gladkova ◽

...

Keyword(s):

Load Balancing ◽

Shared Memory ◽

Parallel Implementation ◽

Computational Load

Download Full-text

SHARED MEMORY IMPLEMENTATION OF CONSTRAINT SATISFACTION PROBLEM RESOLUTION

Parallel Processing Letters ◽

10.1142/s0129626401000749 ◽

2001 ◽

Vol 11 (04) ◽

pp. 487-501 ◽

Cited By ~ 2

Author(s):

ZINEB HABBAS ◽

MICHAËL KRAJECKI ◽

DANIEL SINGER

Keyword(s):

Load Balancing ◽

Shared Memory ◽

Constraint Satisfaction ◽

Parallel Implementation ◽

Constraint Satisfaction Problem ◽

Search Tree ◽

Constraint Satisfaction Problems ◽

Memory Model ◽

Problem Resolution ◽

Finite Domains

Many problems in Computer Science, especially in Artificial Intelligence, can be formulated as Constraint Satisfaction Problems (CSP). This paper presents a parallel implementation of the Forward-Checking algorithm for solving a binary CSP over finite domains. Its main contribution is to use a simple decomposition strategy in order to distribute dynamically the search tree among machines. The feasibility and benefit of this approach are studied for a Shared Memory model. An implementation is drafted using the new emergent standard OpenMP library for shared memory, thus controlling load balancing. We mainly highlight satisfactory efficiencies without using any tricky load balancing policy. All the experiments were carried out running on the Sillicon Graphics Origin 2000 parallel machine.

Download Full-text

The Load Balancing Algorithm Based on the Parallel Implementation of IPO and FMM

Lecture Notes in Computer Science - High Performance Computing and Applications ◽

10.1007/978-3-642-11842-5_57 ◽

2010 ◽

pp. 410-417

Author(s):

Ting Wang ◽

Yue Hu ◽

Yanbao Cui ◽

Weiqin Tong ◽

Xiaoli Zhi

Keyword(s):

Load Balancing ◽

Parallel Implementation ◽

Load Balancing Algorithm

Download Full-text

PARALLEL IMPLEMENTATION OF THE TOPAZ OPACITY CODE: ISSUES IN LOAD-BALANCING

Load balancing and parallel implementation of iterative algorithms for row-continuous Markov chains

Advanced parallel implementation of the coupled ocean–ice model FEMAO (version 2.0) with load balancing

VFFVA: dynamic load balancing enables large-scale flux variability analysis

Parallel implementation of evolutionary strategies on heterogeneous clusters with load balancing

Parallel implementation of multi-resolution filter band with novel Adaptive Load Balancing Algorithm on consumer-level multicore systems

Review: Advanced parallel implementation of the coupled ocean-ice model FEMAO with load balancing

Load balancing and task decomposition techniques for parallel implementation of integrated vision systems algorithms

Computational load balancing methods for algorithms parallel implementation on shared memory

SHARED MEMORY IMPLEMENTATION OF CONSTRAINT SATISFACTION PROBLEM RESOLUTION

The Load Balancing Algorithm Based on the Parallel Implementation of IPO and FMM

Export Citation Format