Parallelization Analysis on Clusters of Multicore Nodes Using Shared and Distributed Memory Parallel Computing Models

Because many universities lack the funds to purchase expensive parallel computers, cost effective alternatives are needed to teach students about parallel processing. Free software is available to support the three major paradigms of parallel computing. Parallaxis is a sophisticated SIMD simulator which runs on a variety of platforms.jBACI shared memory simulator supports the MIMD model of computing with a common shared memory. PVM and MPI allow students to treat a network of workstations as a message passing MIMD multicomputer with distributed memory. Each of this software tools can be used in a variety of courses to give students experience with parallel algorithms.

Download Full-text

An O(log2N) Fully-Balanced Resampling Algorithm for Particle Filters on Distributed Memory Architectures

Algorithms ◽

10.3390/a14120342 ◽

2021 ◽

Vol 14 (12) ◽

pp. 342

Author(s):

Alessandro Varsi ◽

Simon Maskell ◽

Paul G. Spirakis

Keyword(s):

Parallel Computing ◽

Shared Memory ◽

Time Complexity ◽

Distributed Memory ◽

Particle Filters ◽

Dynamic Models ◽

State Of The Art ◽

Novel Approach ◽

Non Gaussian ◽

Memory Architectures

Resampling is a well-known statistical algorithm that is commonly applied in the context of Particle Filters (PFs) in order to perform state estimation for non-linear non-Gaussian dynamic models. As the models become more complex and accurate, the run-time of PF applications becomes increasingly slow. Parallel computing can help to address this. However, resampling (and, hence, PFs as well) necessarily involves a bottleneck, the redistribution step, which is notoriously challenging to parallelize if using textbook parallel computing techniques. A state-of-the-art redistribution takes O((log2N)2) computations on Distributed Memory (DM) architectures, which most supercomputers adopt, whereas redistribution can be performed in O(log2N) on Shared Memory (SM) architectures, such as GPU or mainstream CPUs. In this paper, we propose a novel parallel redistribution for DM that achieves an O(log2N) time complexity. We also present empirical results that indicate that our novel approach outperforms the O((log2N)2) approach.

Download Full-text

Boosting Performance in Parallel Computing Models with a New Experimental Architecture

Parallel Architectures, Algorithms and Programming - Communications in Computer and Information Science ◽

10.1007/978-981-16-0010-4_36 ◽

2021 ◽

pp. 418-428

Author(s):

Alberto Arteta Albert ◽

Akshay Harshakumar ◽

Luis Fernando de Mingo López ◽

Nuria Gómez Blas

Keyword(s):

Parallel Computing ◽

Computing Models ◽

Experimental Architecture

Download Full-text

A method for using object-oriented frameworks to support various high-level parallel computing models

Proceedings. Technology of Object-Oriented Languages. TOOLS 24 (Cat. No.97TB100240) ◽

10.1109/tools.1997.713538 ◽

2002 ◽

Author(s):

Lu Pei ◽

Yu Dachuan ◽

Lu Jian ◽

D.L. Shang

Keyword(s):

Parallel Computing ◽

Object Oriented ◽

High Level ◽

Computing Models

Download Full-text

503 A Study of Distributed Memory Parallel Computing Using Unstructured Mesh

The Proceedings of Conference of Kansai Branch ◽

10.1299/jsmekansai.2012.87._5-3_ ◽

2012 ◽

Vol 2012.87 (0) ◽

pp. _5-3_

Author(s):

Chikashi KAWATANI ◽

Masashi YAMAKAWA ◽

Kenichi MATSUNO

Keyword(s):

Parallel Computing ◽

Distributed Memory ◽

Unstructured Mesh

Download Full-text

Towards Structured Parallel Computing on Architecture-Independent Parallel Algorithm Design for Distributed-Memory Architectures

Journal of Computer and System Sciences ◽

10.1006/jcss.1996.0053 ◽

1996 ◽

Vol 53 (1) ◽

pp. 112-128

Author(s):

Feng Gao

Keyword(s):

Parallel Computing ◽

Parallel Algorithm ◽

Distributed Memory ◽

Algorithm Design ◽

Parallel Algorithm Design ◽

Memory Architectures

Download Full-text

A Comparative Analysis of Distributed and Parallel Computing

VFAST Transactions on Software Engineering ◽

10.21015/vtse.v13i2.507 ◽

2018 ◽

pp. 60-67

Keyword(s):

Parallel Computing ◽

Comparative Analysis ◽

Distributed Systems ◽

Data Centers ◽

Parallel Systems ◽

The Other ◽

Pros And Cons ◽

Distributed And Parallel Computing ◽

Better Than ◽

Computing Models

In the age of emerging technologies, the amount of data is increasing very rapidly. Due to massive increase of data the level of computations are increasing. Computer executes instructions sequentially. But the time has now changed and innovation has been advanced. We are currently managing gigantic data centers that perform billions of executions on consistent schedule. Truth be- hold, if we dive deep into the processor engineering and mechanism, even a successive machine works parallel. Parallel computing is growing faster as a substitute of distributing computing. The performance to functionality ratio of parallel systems is high. Also, the I/O usage of parallel systems is lower because of ability to perform all operations simultaneously. On the other hand, the performance to functionality ratio of distributed systems is low. The I/O usage of distributed systems is higher because of incapability to perform all operations simultaneously. In this paper, an overview of distributed and parallel computing is described. The basic concept of these two computing is discussed. In addition to this, pros and cons of distributed and parallel computing models are described. Through many aspects, we can conclude that parallel systems are better than distributed systems.

Download Full-text

Parallel Computing: Models

Encyclopedia of Optimization ◽

10.1007/0-306-48332-7_380 ◽

2006 ◽

pp. 1934-1939

Author(s):

Afonso Ferreira

Keyword(s):

Parallel Computing ◽

Computing Models

Download Full-text

Spiking Neural P Systems with Extended Channel Rules

International Journal of Neural Systems ◽

10.1142/s0129065720500495 ◽

2020 ◽

Vol 31 (01) ◽

pp. 2050049 ◽

Cited By ~ 1

Author(s):

Zeqiong Lv ◽

Tingting Bao ◽

Nan Zhou ◽

Hong Peng ◽

Xiangnian Huang ◽

...

Keyword(s):

Parallel Computing ◽

Control Mechanism ◽

P Systems ◽

Multiple Channels ◽

Spiking Neural P Systems ◽

Distributed Parallel Computing ◽

New Variant ◽

New Type ◽

Turing Universality ◽

Computing Models

This paper discusses a new variant of spiking neural P systems (in short, SNP systems), spiking neural P systems with extended channel rules (in short, SNP–ECR systems). SNP–ECR systems are a class of distributed parallel computing models. In SNP–ECR systems, a new type of spiking rule is introduced, called ECR. With an ECR, a neuron can send the different numbers of spikes to its subsequent neurons. Therefore, SNP–ECR systems can provide a stronger firing control mechanism compared with SNP systems and the variant with multiple channels. We discuss the Turing universality of SNP–ECR systems. It is proven that SNP–ECR systems as number generating/accepting devices are Turing universal. Moreover, we provide a small universal SNP–ECR system as function computing devices.

Download Full-text