parallel speedup Latest Research Papers

csdR, an R package for differential co-expression analysis

10.1101/2021.09.27.461845 ◽

2021 ◽

Author(s):

Jakob P. Pettersen ◽

Eivind Almaas

Keyword(s):

Data Analysis ◽

Expression Analysis ◽

R Package ◽

Ease Of Use ◽

The Other ◽

Gene Pairs ◽

Computational Performance ◽

Parallel Speedup ◽

Multiple Processing ◽

Number Of Iterations

AbstractBackgroundDifferential co-expression network analysis has become an important tool to gain understanding of biological phenotypes and diseases. The CSD algorithm is a method to generate differential co-expression networks by comparing gene co-expressions from two different conditions. Each of the gene pairs is assigned conserved (C), specific (S) and differentiated (D) scores based on the co-expression of the gene pair between the two conditions. The result of the procedure is a network where the nodes are genes and the links are the gene pairs with the highest C-, S-, and D-scores. However, the existing CSD-implementations suffer from poor computational performance, difficult user procedures and lack of documentation.ResultsWe created the R-package csdR aimed at reaching good performance together with ease of use, sufficient documentation, and with the ability to play well with other tools for data analysis. csdR was benchmarked on a realistic dataset with 20, 645 genes. After verifying that the chosen number of iterations gave sufficient robustness, we tested the performance against the two existing CSD implementations. csdR was superior in performance to one of the implementations, whereas the other did not run. Our implementation can utilize multiple processing cores. However, we were unable to achieve more than ∼ 2.7 parallel speedup with saturation reached at about 10 cores.ConclusionsThe results suggest that csdR is a useful tool for differential co-expression analysis and is able to generate robust results within a workday on datasets of realistic sizes when run on a workstation or compute server.

Download Full-text

An Open-Source Many-Scenario Approach for Power System Dynamic Simulation on HPC Clusters

Electronics ◽

10.3390/electronics10111330 ◽

2021 ◽

Vol 10 (11) ◽

pp. 1330

Author(s):

Junjie Zhang ◽

Lukas Razik ◽

Sigurd Hofsmo Jakobsen ◽

Salvatore D’Arco ◽

Andrea Benigni

Keyword(s):

Power System ◽

Open Source ◽

Dynamic Simulation ◽

Message Passing ◽

High Performance ◽

Message Passing Interface ◽

Parallel Execution ◽

Software Environment ◽

Multiple Data ◽

Parallel Speedup

In this paper we introduce an approach to accelerate many-scenario (i.e., hundreds to thousands) power system simulations which is based on a highly scalable and flexible open-source software environment. In this approach, the parallel execution of simulations follows the single program, multiple data (SPMD) paradigm, where the dynamic simulation program is executed in parallel and takes different inputs to generate different scenarios. The power system is modeled using an existing Modelica library and compiled to a simulation executable using the OpenModelica Compiler. Furthermore, the parallel simulation is performed with the aid of a message-passing interface (MPI) and the approach includes dynamic workload balancing. Finally, benchmarks with the simulation environment are performed on high-performance computing (HPC) clusters with four test cases. The results show high scalability and a considerable parallel speedup of the proposed approach in the simulation of all scenarios.

Download Full-text

Inherent Parallelism and Speedup Estimation of Sequential Programs

Annals of Emerging Technologies in Computing ◽

10.33166/aetic.2021.02.006 ◽

2021 ◽

Vol 5 (2) ◽

pp. 62-77

Author(s):

Sesha Kalyur ◽

Nagaraja G.S

Keyword(s):

Research Work ◽

Performance Estimation ◽

Sequential Programs ◽

Parallel Class ◽

Parallel Performance ◽

Effective Performance ◽

Parallel Speedup ◽

High Level ◽

Performance Estimates ◽

Parallel Estimation

Although several automated Parallel Conversion solutions are available, very few have attempted, to provide proper estimates of the available Inherent Parallelism and expected Parallel Speedup. CALIPER which is the outcome of this research work is a parallel performance estimation technology that can fill this void. High level language structures such as Functions, Loops, Conditions, etc which ease program development, can be a hindrance for effective performance analysis. We refer to these program structures as the Program Shape. As a preparatory step, CALIPER attempts to remove these shape related hindrances, an activity we refer to as Program Shape Flattening. Programs are also characterized by dependences that exist between different instructions and impose an upper limit on the parallel conversion gains. For parallel estimation, we first group instructions that share dependences, and add them to a class we refer to as Dependence Class or Parallel Class. While instructions belonging to a class run sequentially, the classes themselves run in parallel. Parallel runtime, is now the runtime of the class that runs the longest. We report performance estimates of parallel conversion as two metrics. The inherent parallelism in the program is reported, as Maximum Available Parallelism (MAP) and the speedup after conversion as Speedup After Parallelization (SAP).

Download Full-text

Parallel speedup analysis of an adjoint ensemble-based source identification algorithm

Journal of Physics Conference Series ◽

10.1088/1742-6596/1715/1/012072 ◽

2021 ◽

Vol 1715 ◽

pp. 012072

Author(s):

Alexey Penenko ◽

Alexander Gochakov

Keyword(s):

Source Identification ◽

Identification Algorithm ◽

Parallel Speedup

Download Full-text

The Parallel Speedup Improving Based on Wave Field Forward Communication Latency Hiding Technology

2018 10th International Conference on Measuring Technology and Mechatronics Automation (ICMTMA) ◽

10.1109/icmtma.2018.00120 ◽

2018 ◽

Author(s):

Lei Xie ◽

Changming Zhao ◽

Xi Wu ◽

Tao Wu ◽

Min Huang ◽

...

Keyword(s):

Wave Field ◽

Communication Latency ◽

Parallel Speedup

Download Full-text

Study on GPU Parallel Speedup and Application of FE-EFG Coupling Method

Journal of Mechanical Engineering ◽

10.3901/jme.2018.011.197 ◽

2018 ◽

Vol 54 (11) ◽

pp. 197

Author(s):

Shuguang GONG

Keyword(s):

Coupling Method ◽

Parallel Speedup

Download Full-text

Time Complexity and Parallel Speedup of Relational Queries to Solve Graph Problems

Lecture Notes in Computer Science - Database and Expert Systems Applications ◽

10.1007/978-3-319-98812-2_30 ◽

2018 ◽

pp. 339-349

Author(s):

Carlos Ordonez ◽

Predrag T. Tosic

Keyword(s):

Time Complexity ◽

Graph Problems ◽

Parallel Speedup

Download Full-text

PaSE: A parallel speedup estimation framework for Network-on-Chip based multicore systems

2017 Eighth International Green and Sustainable Computing Conference (IGSC) ◽

10.1109/igcc.2017.8323601 ◽

2017 ◽

Author(s):

Ghassan Dharb ◽

Naseef Mansoor ◽

Sajeed Shahriat ◽

Amlan Ganguly

Keyword(s):

Network On Chip ◽

Multicore Systems ◽

Parallel Speedup ◽

On Chip

Download Full-text

AND/OR Branch-and-Bound on a Computational Grid

Journal of Artificial Intelligence Research ◽

10.1613/jair.5456 ◽

2017 ◽

Vol 59 ◽

pp. 351-435 ◽

Cited By ~ 4

Author(s):

Lars Otten ◽

Rina Dechter

Keyword(s):

Branch And Bound ◽

Empirical Evaluation ◽

Search Space ◽

Computational Grid ◽

Complex Problem ◽

Parallel Execution ◽

Machine Learning Techniques ◽

Depth Analysis ◽

Parallel Speedup ◽

Problem Instances

We present a parallel AND/OR Branch-and-Bound scheme that uses the power of a computational grid to push the boundaries of feasibility for combinatorial optimization. Two variants of the scheme are described, one of which aims to use machine learning techniques for parallel load balancing. In-depth analysis identifies two inherent sources of parallel search space redundancies that, together with general parallel execution overhead, can impede parallelization and render the problem far from embarrassingly parallel. We conduct extensive empirical evaluation on hundreds of CPUs, the first of its kind, with overall positive results. In a significant number of cases parallel speedup is close to the theoretical maximum and we are able to solve many very complex problem instances orders of magnitude faster than before; yet analysis of certain results also serves to demonstrate the inherent limitations of the approach due to the aforementioned redundancies.

Download Full-text

Simulation of Maxwell's Equations on GPU Using a High-Order Error-Minimized Scheme

Communications in Computational Physics ◽

10.4208/cicp.oa-2016-0079 ◽

2017 ◽

Vol 21 (4) ◽

pp. 1039-1064 ◽

Cited By ~ 1

Author(s):

Tony W. H. Sheu ◽

S. Z. Wang ◽

J. H. Li ◽

Matthew R. Smith

Keyword(s):

Parallel Computing ◽

Graphics Processing Units ◽

Maxwell’S Equations ◽

Maxwell's Equations ◽

Three Dimensional ◽

Staggered Grid ◽

Order Error ◽

Parallel Speedup ◽

Explicit Finite Difference ◽

Graphics Processing

AbstractIn this study an explicit Finite Difference Method (FDM) based scheme is developed to solve the Maxwell's equations in time domain for a lossless medium. This manuscript focuses on two unique aspects – the three dimensional time-accurate discretization of the hyperbolic system of Maxwell equations in three-point non-staggered grid stencil and it's application to parallel computing through the use of Graphics Processing Units (GPU). The proposed temporal scheme is symplectic, thus permitting conservation of all Hamiltonians in the Maxwell equation. Moreover, to enable accurate predictions over large time frames, a phase velocity preserving scheme is developed for treatment of the spatial derivative terms. As a result, the chosen time increment and grid spacing can be optimally coupled. An additional theoretical investigation into this pairing is also shown. Finally, the application of the proposed scheme to parallel computing using one Nvidia K20 Tesla GPU card is demonstrated. For the benchmarks performed, the parallel speedup when compared to a single core of an Intel i7-4820K CPU is approximately 190x.

Download Full-text

parallel speedup
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

csdR, an R package for differential co-expression analysis

An Open-Source Many-Scenario Approach for Power System Dynamic Simulation on HPC Clusters

Inherent Parallelism and Speedup Estimation of Sequential Programs

Parallel speedup analysis of an adjoint ensemble-based source identification algorithm

The Parallel Speedup Improving Based on Wave Field Forward Communication Latency Hiding Technology

Study on GPU Parallel Speedup and Application of FE-EFG Coupling Method

Time Complexity and Parallel Speedup of Relational Queries to Solve Graph Problems

PaSE: A parallel speedup estimation framework for Network-on-Chip based multicore systems

AND/OR Branch-and-Bound on a Computational Grid

Simulation of Maxwell's Equations on GPU Using a High-Order Error-Minimized Scheme

Export Citation Format

parallel speedupRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

csdR, an R package for differential co-expression analysis

An Open-Source Many-Scenario Approach for Power System Dynamic Simulation on HPC Clusters

Inherent Parallelism and Speedup Estimation of Sequential Programs

Parallel speedup analysis of an adjoint ensemble-based source identification algorithm

The Parallel Speedup Improving Based on Wave Field Forward Communication Latency Hiding Technology

Study on GPU Parallel Speedup and Application of FE-EFG Coupling Method

Time Complexity and Parallel Speedup of Relational Queries to Solve Graph Problems

PaSE: A parallel speedup estimation framework for Network-on-Chip based multicore systems

AND/OR Branch-and-Bound on a Computational Grid

Simulation of Maxwell's Equations on GPU Using a High-Order Error-Minimized Scheme

parallel speedup
Recently Published Documents