Data access reorganizations in compiling out-of-core data parallel programs on distributed memory machines

Proceedings 11th International Parallel Processing Symposium ◽

10.1109/ipps.1997.580956 ◽

2002 ◽

Author(s):

M. Kandemir ◽

R. Bordawekar ◽

A. Choudhary

Keyword(s):

Distributed Memory ◽

Data Access ◽

Parallel Programs ◽

Core Data ◽

Data Parallel ◽

Distributed Memory Machines

Download Full-text

Compilation of out-of-core data parallel programs for distributed memory machines

ACM SIGARCH Computer Architecture News ◽

10.1145/190787.190793 ◽

1994 ◽

Vol 22 (4) ◽

pp. 23-28 ◽

Author(s):

Rajeev Thakur ◽

Rajesh Bordawekar ◽

Alok Choudhary

Keyword(s):

Distributed Memory ◽

Parallel Programs ◽

Core Data ◽

Data Parallel ◽

Distributed Memory Machines

Download Full-text

A model and compilation strategy for out-of-core data parallel programs

ACM SIGPLAN Notices ◽

10.1145/209937.209938 ◽

1995 ◽

Vol 30 (8) ◽

pp. 1-10 ◽

Author(s):

Rajesh Bordawekar ◽

Alok Choudhary ◽

Ken Kennedy ◽

Charles Koelbel ◽

Michael Paleczny

Keyword(s):

Parallel Programs ◽

Core Data ◽

Download Full-text

A model and compilation strategy for out-of-core data parallel programs

Proceedings of the fifth ACM SIGPLAN symposium on Principles and practice of parallel programming - PPOPP '95 ◽

10.1145/209936.209938 ◽

1995 ◽

Author(s):

Rajesh Bordawekar ◽

Alok Choudhary ◽

Ken Kennedy ◽

Charles Koelbel ◽

Michael Paleczny

Keyword(s):

Parallel Programs ◽

Core Data ◽

Download Full-text

MIMD, Multiple Instruction, Multiple Data

Introduction to Parallel Computing ◽

10.1093/oso/9780198515760.003.0010 ◽

2004 ◽

Author(s):

Wesley Petersen ◽

Peter Arbenz

Keyword(s):

Shared Memory ◽

Message Passing ◽

Distributed Memory ◽

Programming Model ◽

Data Access ◽

File Server ◽

Distributed Memory Machines ◽

Shared Data ◽

Multiple Data ◽

The Multiple instruction, multiple data (MIMD) programming model usually refers to computing on distributed memory machines with multiple independent processors. Although processors may run independent instruction streams, we are interested in streams that are always portions of a single program. Between processors which share a coherent memory view (within a node), data access is immediate, whereas between nodes data access is effected by message passing. In this book, we use MPI for such message passing. MPI has emerged as a more/less standard message passing system used on both shared memory and distributed memory machines. It is often the case that although the system consists of multiple independent instruction streams, the programming model is not too different from SIMD. Namely, the totality of a program is logically split into many independent tasks each processed by a group (see Appendix D) of processes—but the overall program is effectively single threaded at the beginning, and likewise at the end. The MIMD model, however, is extremely flexible in that no one process is always master and the other processes slaves. A communicator group of processes performs certain tasks, usually with an arbitrary master/slave relationship. One process may be assigned to be master (or root) and coordinates the tasks of others in the group. We emphasize that the assignments of which is root is arbitrary—any processor may be chosen. Frequently, however, this choice is one of convenience—a file server node, for example. Processors and memory are connected by a network, for example, Figure 5.1. In this form, each processor has its own local memory. This is not always the case: The Cray X1, and NEC SX-6 through SX-8 series machines, have common memory within nodes. Within a node, memory coherency is maintained within local caches. Between nodes, it remains the programmer’s responsibility to assure a proper read–update relationship in the shared data. Data updated by one set of processes should not be clobbered by another set until the data are properly used.

Download Full-text

Load balancing data parallel programs on distributed memory computers

Parallel Computing ◽

10.1016/0167-8191(93)90027-i ◽

1993 ◽

Vol 19 (11) ◽

pp. 1199-1219 ◽

Author(s):

J De Keyser ◽

D Roose

Keyword(s):

Load Balancing ◽

Distributed Memory ◽

Parallel Programs ◽

Download Full-text

An improved two-step algorithm for task and data parallel scheduling in distributed memory machines

Parallel Computing ◽

10.1016/j.parco.2006.08.004 ◽

2006 ◽

Vol 32 (10) ◽

pp. 759-774 ◽

Author(s):

Savina Bansal ◽

Padam Kumar ◽

Kuldip Singh

Keyword(s):

Distributed Memory ◽

Parallel Scheduling ◽

Data Parallel ◽

Distributed Memory Machines ◽

Download Full-text

Nonblocking Data Structures for Distributed-Memory Machines: Stacks as an Example

2021 29th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP) ◽

10.1109/pdp52278.2021.00012 ◽

2021 ◽

Author(s):

Thanh-Dang Diep ◽

Karl Furlinger

Keyword(s):

Data Structures ◽

Distributed Memory ◽

Distributed Memory Machines

Download Full-text

DIAGONAL-IMPLICITLY ITERATED RUNGE–KUTTA METHODS ON DISTRIBUTED MEMORY MACHINES

International Journal of High Speed Computing ◽

10.1142/s0129053399000090 ◽

1999 ◽

Vol 10 (02) ◽

pp. 185-207 ◽

Author(s):

THOMAS RAUBER ◽

GUDULA RÜNGER

Keyword(s):

Distributed Memory ◽

Distributed Memory Machines ◽

Download Full-text

Parallelizing molecular dynamics programs for distributed-memory machines

IEEE Computational Science and Engineering ◽

10.1109/99.388949 ◽

1995 ◽

Vol 2 (2) ◽

pp. 18-29 ◽

Author(s):

Ynan-Shin Hwang ◽

R. Das ◽

J.H. Saltz ◽

M. Hodoscek ◽

B.R. Brooks

Keyword(s):

Molecular Dynamics ◽

Distributed Memory ◽

Distributed Memory Machines

Download Full-text

Static analysis to reduce synchronization costs in data-parallel programs

Proceedings of the 23rd ACM SIGPLAN-SIGACT symposium on Principles of programming languages - POPL '96 ◽

10.1145/237721.237799 ◽

1996 ◽

Author(s):

Manish Gupta ◽

Edith Schonberg

Keyword(s):

Static Analysis ◽

Parallel Programs ◽

Download Full-text