A Parallel Algorithm of String Matching Based on Message Passing Interface for Multicore Processors

We introduce a parallel chaos-based encryption algorithm for taking advantage of multicore processors. The chaotic cryptosystem is generated by the piecewise linear chaotic map (PWLCM). The parallel algorithm is designed with a master/slave communication model with the Message Passing Interface (MPI). The algorithm is suitable not only for multicore processors but also for the single-processor architecture. The experimental results show that the chaos-based cryptosystem possesses good statistical properties. The parallel algorithm provides much better performance than the serial ones and would be useful to apply in encryption/decryption file with large size or multimedia.

Download Full-text

An Optimized Parallel Algorithm for Exact String Matching Using Message Passing Interface

Applications and Techniques in Information Security - Communications in Computer and Information Science ◽

10.1007/978-981-13-2907-4_11 ◽

2018 ◽

pp. 129-135

Author(s):

Sucharitha Shetty ◽

B. Dinesh Rao ◽

Srikanth Prabhu

Keyword(s):

Parallel Algorithm ◽

Message Passing ◽

Message Passing Interface ◽

String Matching

Download Full-text

The MPI and OpenMP Implementation of Parallel Algorithm for Generating Mandelbrot Set

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.571-572.26 ◽

2014 ◽

Vol 571-572 ◽

pp. 26-29

Author(s):

Xiang Wei Duan ◽

Wei Chang Shen ◽

Jun Guo

Keyword(s):

Parallel Algorithm ◽

Shared Memory ◽

Message Passing ◽

Message Passing Interface ◽

Algorithm Design ◽

Performance Testing ◽

Mandelbrot Set ◽

The Difference ◽

And Performance

The paper introduce the Mandelbrot Set and the message passing interface (MPI) and shared-memory (OpenMP), analyses the characteristic of algorithm design in the MPI and OpenMP environment, describes the implementation of parallel algorithm about Mandelbrot Set in the MPI environment and the OpenMP environment, conducted a series of evaluation and performance testing during the process of running, then the difference between the two system implementations is compared.

Download Full-text

A Hybrid MPI–OpenMP Parallel Algorithm and Performance Analysis for an Ensemble Square Root Filter Designed for Multiscale Observations

Journal of Atmospheric and Oceanic Technology ◽

10.1175/jtech-d-12-00165.1 ◽

2013 ◽

Vol 30 (7) ◽

pp. 1382-1397 ◽

Cited By ~ 14

Author(s):

Yunheng Wang ◽

Youngsun Jung ◽

Timothy A. Supinie ◽

Ming Xue

Keyword(s):

Data Assimilation ◽

Domain Decomposition ◽

Parallel Algorithm ◽

Shared Memory ◽

Message Passing ◽

Message Passing Interface ◽

High Volume ◽

Square Root ◽

Fixed Amount ◽

Square Root Filter

Abstract A hybrid parallel scheme for the ensemble square root filter (EnSRF) suitable for parallel assimilation of multiscale observations, including those from dense observational networks such as those of radar, is developed based on the domain decomposition strategy. The scheme handles internode communication through a message passing interface (MPI) and the communication within shared-memory nodes via Open Multiprocessing (OpenMP) threads. It also supports pure MPI and pure OpenMP modes. The parallel framework can accommodate high-volume remote-sensed radar (or satellite) observations as well as conventional observations that usually have larger covariance localization radii. The performance of the parallel algorithm has been tested with simulated and real radar data. The parallel program shows good scalability in pure MPI and hybrid MPI–OpenMP modes, while pure OpenMP runs exhibit limited scalability on a symmetric shared-memory system. It is found that in MPI mode, better parallel performance is achieved with domain decomposition configurations in which the leading dimension of the state variable arrays is larger, because this configuration allows for more efficient memory access. Given a fixed amount of computing resources, the hybrid parallel mode is preferred to pure MPI mode on supercomputers with nodes containing shared-memory cores. The overall performance is also affected by factors such as the cache size, memory bandwidth, and the networking topology. Tests with a real data case with a large number of radars confirm that the parallel data assimilation can be done on a multicore supercomputer with a significant speedup compared to the serial data assimilation algorithm.

Download Full-text

A framework for genomic sequencing on clusters of multicore and manycore processors

The International Journal of High Performance Computing Applications ◽

10.1177/1094342016653243 ◽

2016 ◽

Vol 32 (3) ◽

pp. 393-406 ◽

Cited By ~ 1

Author(s):

Héctor Martínez ◽

Sergio Barrachina ◽

Maribel Castillo ◽

Joaquín Tárraga ◽

Ignacio Medina ◽

...

Keyword(s):

Message Passing ◽

Graphics Processing Units ◽

High Performance ◽

Message Passing Interface ◽

Multicore Processors ◽

Global Alignment ◽

Genomic Sequencing ◽

Single Node ◽

Strongly Coupled ◽

Two Stages

The advances in genomic sequencing during the past few years have motivated the development of fast and reliable software for DNA/RNA sequencing on current high performance architectures. Most of these efforts target multicore processors, only a few can also exploit graphics processing units, and a much smaller set will run in clusters equipped with any of these multi-threaded architecture technologies. Furthermore, the examples that can be used on clusters today are all strongly coupled with a particular aligner. In this paper we introduce an alignment framework that can be leveraged to coordinately run any “single-node” aligner, taking advantage of the resources of a cluster without having to modify any portion of the original software. The key to our transparent migration lies in hiding the complexity associated with the multi-node execution (such as coordinating the processes running in the cluster nodes) inside the generic-aligner framework. Moreover, following the design and operation in our Message Passing Interface (MPI) version of HPG Aligner RNA BWT, we organize the framework into two stages in order to be able to execute different aligners in each one of them. With this configuration, for example, the first stage can ideally apply a fast aligner to accelerate the process, while the second one can be tuned to act as a refinement stage that further improves the global alignment process with little cost.

Download Full-text

A permutation parallel algorithm under exchange restriction with message passing interface

10.1063/1.4882458 ◽

2014 ◽

Cited By ~ 1

Author(s):

Sharmila Karim ◽

Zurni Omar ◽

Haslinda Ibrahim

Keyword(s):

Parallel Algorithm ◽

Message Passing ◽

Message Passing Interface

Download Full-text

Two-stage distributed parallel algorithm with message passing interface for maximum flow problem

The Journal of Supercomputing ◽

10.1007/s11227-014-1314-7 ◽

2014 ◽

Vol 71 (2) ◽

pp. 629-647 ◽

Cited By ~ 3

Author(s):

Jincheng Jiang ◽

Lixin Wu

Keyword(s):

Parallel Algorithm ◽

Message Passing ◽

Message Passing Interface ◽

Flow Problem ◽

Maximum Flow ◽

Two Stage ◽

Maximum Flow Problem

Download Full-text

Parallel Dijkstra's Algorithm Based on Multi-Core and MPI

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.441.750 ◽

2013 ◽

Vol 441 ◽

pp. 750-753

Author(s):

Xiao Gang Han ◽

Qin Lei Sun ◽

Jiang Wei Fan

Keyword(s):

Parallel Algorithm ◽

Shortest Path ◽

Message Passing ◽

Message Passing Interface ◽

Dijkstra’S Algorithm ◽

Software Platform ◽

Shortest Path Algorithm ◽

Dijkstra's Algorithm ◽

Low Efficiency

Dijkstra’s algorithm is a typical but low efficiency shortest path algorithm. The parallel Dijkstra’s algorithm based on message passing interface (MPI) is efficient and easy to implement, but it’s not very suitable for PC platform. This paper describes a parallel Dijkstra’s algorithm. We designed the parallel algorithm and realized it based on multi-core PC and MPI software platform. The implementation is convenient, and the performance experiment shows that the algorithm has satisfied speedup and efficiency.

Download Full-text

Efficient parallel algorithm for listing permutation with Message Passing Interface (MPI)

2015 International Symposium on Mathematical Sciences and Computing Research (iSMSC) ◽

10.1109/ismsc.2015.7594077 ◽

2015 ◽

Author(s):

Sharmila Karim ◽

Zurni Omar ◽

Haslinda Ibrahim

Keyword(s):

Parallel Algorithm ◽

Message Passing ◽

Message Passing Interface

Download Full-text

Multi-level Parallelization of Genotype Imputation on Supercomputers

Current Bioinformatics ◽

10.2174/1574893615999200420071307 ◽

2020 ◽

Vol 15 ◽

Author(s):

Weiwen Zhang ◽

Long Wang ◽

Theint Theint Aye ◽

Juniarto Samsudin ◽

Yongqing Zhu

Keyword(s):

Association Study ◽

Message Passing ◽

High Performance ◽

Message Passing Interface ◽

Genome Wide Association Study ◽

Job Scheduling ◽

Genotype Imputation ◽

Job Level ◽

Multi Level ◽

High Performance Requirement

Background: Genotype imputation as a service is developed to enable researchers to estimate genotypes on haplotyped data without performing whole genome sequencing. However, genotype imputation is computation intensive and thus it remains a challenge to satisfy the high performance requirement of genome wide association study (GWAS). Objective: In this paper, we propose a high performance computing solution for genotype imputation on supercomputers to enhance its execution performance. Method: We design and implement a multi-level parallelization that includes job level, process level and thread level parallelization, enabled by job scheduling management, message passing interface (MPI) and OpenMP, respectively. It involves job distribution, chunk partition and execution, parallelized iteration for imputation and data concatenation. Due to the design of multi-level parallelization, we can exploit the multi-machine/multi-core architecture to improve the performance of genotype imputation. Results: Experiment results show that our proposed method can outperform the Hadoop-based implementation of genotype imputation. Moreover, we conduct the experiments on supercomputers to evaluate the performance of the proposed method. The evaluation shows that it can significantly shorten the execution time, thus improving the performance for genotype imputation. Conclusion: The proposed multi-level parallelization, when deployed as an imputation as a service, will facilitate bioinformatics researchers in Singapore to conduct genotype imputation and enhance the association study.

Download Full-text