Compilation for Distributed Memory Architectures

The Compiler Design Handbook ◽

10.1201/9781420040579.ch11 ◽

2002 ◽

Author(s):

Alok Choudhary ◽

Mahmut Kandemir

Keyword(s):

Distributed Memory ◽

Memory Architectures

Download Full-text

Implementing actor-based primitives on distributed-memory architectures

ACM SIGPLAN OOPS Messenger ◽

10.1145/127070.127078 ◽

1991 ◽

Vol 2 (2) ◽

pp. 45-49 ◽

Author(s):

Michele Di Santo ◽

Giulio Iannello

Keyword(s):

Distributed Memory ◽

Memory Architectures

Download Full-text

Overcoming the startup time problem in distributed memory architectures

10.1109/hicss.1991.183927 ◽

2002 ◽

Author(s):

W. Schroeder-Preikschat

Keyword(s):

Distributed Memory ◽

Time Problem ◽

Startup Time ◽

Memory Architectures

Download Full-text

A Library for Coarse Grain Macro-Pipelining in Distributed Memory Architectures

Programming Environments for Massively Parallel Distributed Systems ◽

10.1007/978-3-0348-8534-8_37 ◽

1994 ◽

pp. 365-371

Author(s):

F. Desprez

Keyword(s):

Distributed Memory ◽

Coarse Grain ◽

Memory Architectures

Download Full-text

Compilation for Distributed Memory Architectures Alok Choudhary and Mahmut Kandemir

The Compiler Design Handbook ◽

10.1201/9781420040579-15 ◽

2002 ◽

pp. 385-420

Keyword(s):

Distributed Memory ◽

Memory Architectures

Download Full-text

An O(log2N) Fully-Balanced Resampling Algorithm for Particle Filters on Distributed Memory Architectures

10.3390/a14120342 ◽

2021 ◽

Vol 14 (12) ◽

pp. 342

Author(s):

Alessandro Varsi ◽

Simon Maskell ◽

Paul G. Spirakis

Keyword(s):

Parallel Computing ◽

Shared Memory ◽

Time Complexity ◽

Distributed Memory ◽

Particle Filters ◽

Dynamic Models ◽

State Of The Art ◽

Novel Approach ◽

Non Gaussian ◽

Memory Architectures

Resampling is a well-known statistical algorithm that is commonly applied in the context of Particle Filters (PFs) in order to perform state estimation for non-linear non-Gaussian dynamic models. As the models become more complex and accurate, the run-time of PF applications becomes increasingly slow. Parallel computing can help to address this. However, resampling (and, hence, PFs as well) necessarily involves a bottleneck, the redistribution step, which is notoriously challenging to parallelize if using textbook parallel computing techniques. A state-of-the-art redistribution takes O((log2N)2) computations on Distributed Memory (DM) architectures, which most supercomputers adopt, whereas redistribution can be performed in O(log2N) on Shared Memory (SM) architectures, such as GPU or mainstream CPUs. In this paper, we propose a novel parallel redistribution for DM that achieves an O(log2N) time complexity. We also present empirical results that indicate that our novel approach outperforms the O((log2N)2) approach.

Download Full-text

Load balancing and locality in hierarchical N-body algorithms on distributed memory architectures

High-Performance Computing and Networking - Lecture Notes in Computer Science ◽

10.1007/bfb0037155 ◽

1998 ◽

pp. 284-293 ◽

Author(s):

F. Baiardi ◽

P. Becuzzi ◽

P. Mori ◽

M. Paoli

Keyword(s):

Load Balancing ◽

Distributed Memory ◽

Memory Architectures

Download Full-text

Scalability and Locality of Extrapolation Methods for Distributed-Memory Architectures

Euro-Par 2010 - Parallel Processing - Lecture Notes in Computer Science ◽

10.1007/978-3-642-15291-7_8 ◽

2010 ◽

pp. 65-76 ◽

Author(s):

Matthias Korch ◽

Thomas Rauber ◽

Carsten Scholtes

Keyword(s):

Distributed Memory ◽

Extrapolation Methods ◽

Memory Architectures

Download Full-text

Performance modeling of sparse matrix methods for distributed memory architectures

Parallel Processing: CONPAR 92—VAPP V - Lecture Notes in Computer Science ◽

10.1007/3-540-55895-0_469 ◽

1992 ◽

pp. 677-688 ◽

Author(s):

Roldan Pozo

Keyword(s):

Distributed Memory ◽

Performance Modeling ◽

Sparse Matrix ◽

Matrix Methods ◽

Memory Architectures

Download Full-text

Parallelizing RRT on Large-Scale Distributed-Memory Architectures

IEEE Transactions on Robotics ◽

10.1109/tro.2013.2239571 ◽

2013 ◽

Vol 29 (2) ◽

pp. 571-579 ◽

Author(s):

Didier Devaurs ◽

Thierry Simeon ◽

Juan Cortes

Keyword(s):

Large Scale ◽

Distributed Memory ◽

Memory Architectures

Download Full-text

Adaptive runtime support for direct simulation Monte Carlo methods on distributed memory architectures

Proceedings of IEEE Scalable High Performance Computing Conference ◽

10.1109/shpcc.1994.296641 ◽

2002 ◽

Author(s):

B. Moon ◽

J. Saltz

Keyword(s):

Monte Carlo ◽

Monte Carlo Methods ◽

Distributed Memory ◽

Direct Simulation Monte Carlo ◽

Direct Simulation ◽

Runtime Support ◽

Simulation Monte Carlo ◽

Memory Architectures

Download Full-text