General Purpose Parallel Architectures

1990 ◽  
pp. 943-971 ◽  
Author(s):  
L.G. VALIANT
1996 ◽  
Vol 06 (03) ◽  
pp. 309-320
Author(s):  
GIULIO DESTRI ◽  
PAOLO MARENZONI

The numerical analysis and solution of many physics and engineering problems is based on lattice-oriented algorithms. The Cellular Neural Network (CNN) computational paradigm embodies a wide set of grid problems characterized by locality of information exchanges among lattice points. Performance analysis tests using CNN-based algorithms may provide insights into the performance achievable by a given parallel architecture, with respect to a wide class of lattice problems. In this paper a message passing version of a general CNN-based algorithm is implemented and optimized for three general purpose parallel architectures: Connection Machine CM-5, Cray T3D, and IBM SP2. Separate measurements on computations and communications of the algorithm allow us to evaluate processing node and network communication performance of the machines. Moreover. the overall performance of the full application is analyzed, in order to understand the scalability and the range of applicability of this prototype of lattice problem.


2011 ◽  
Author(s):  
Richard Beare ◽  
Daniel Micevski ◽  
Chris Share ◽  
Luke Parkinson ◽  
Phillip Ward ◽  
...  

There is great interest in the use of graphics processing units (GPU)for general purpose applications because the highly parallel architectures used in GPUs offer the potential for huge performance increases. The use of GPUs in image analysis applications has been under investigation for a number of years. This article describes modifications to the InsightToolkit (ITK) that provide a simple architecture for transparent use of GPU enabled filters and examples of how to write GPU enabled filters using the NVIDIA CUDA tools.This work was performed between late 2009 and early 2010 and is being published as modifications to ITK 3.20. It is hoped that publication will help inform development of more general GPU support in ITK 4.0 and facilitate experimentation by users requiring functionality of 3.20 or wishing to pursue CUDA based developments.


1994 ◽  
Vol 19 (3) ◽  
pp. 283-293 ◽  
Author(s):  
A. H. J. Koning ◽  
K. J. Zuiderveld ◽  
M. A. Viergever

Sign in / Sign up

Export Citation Format

Share Document