Performance Models for Matrix Computations on Multicore Processors Using OpenMP

2010 International Conference on Parallel and Distributed Computing, Applications and Technologies ◽

10.1109/pdcat.2010.52 ◽

2010 ◽

Author(s):

Panagiotis D. Michailidis ◽

Konstantinos G. Margaritis

Keyword(s):

Multicore Processors ◽

Performance Models ◽

Matrix Computations

Download Full-text

LEVERAGING SHARED CACHES FOR PARALLEL TEMPORAL BLOCKING OF STENCIL CODES ON MULTICORE PROCESSORS AND CLUSTERS

Parallel Processing Letters ◽

10.1142/s0129626410000296 ◽

2010 ◽

Vol 20 (04) ◽

pp. 359-376 ◽

Author(s):

MARKUS WITTMANN ◽

GEORG HAGER ◽

JAN TREIBIG ◽

GERHARD WELLEIN

Keyword(s):

Distributed Memory ◽

Multicore Processors ◽

Memory Bandwidth ◽

Performance Models ◽

Shared Caches ◽

Strong Scaling ◽

Stencil Codes ◽

Multicore Chips ◽

Synthetic Modeling

Bandwidth-starved multicore chips have become ubiquitous. It is well known that the performance of stencil codes can be improved by temporal blocking, lessening the pressure on the memory interface. We introduce a new pipelined approach that makes explicit use of shared caches in multicore environments and minimizes synchronization and boundary overhead. Benchmark results are presented for three current x86-based microprocessors, showing clearly that our optimization works best on designs with high-speed shared caches and low memory bandwidth per core. We furthermore demonstrate that simple bandwidth-based performance models are inaccurate for this kind of algorithm and employ a more elaborate, synthetic modeling procedure. Finally we show that temporal blocking can be employed successfully in a hybrid shared/distributed-memory environment, albeit with limited benefit at strong scaling.

Download Full-text

From models to methods to models: Tools and techniques for using, developing, and analyzing cognitive human performance models

PsycEXTRA Dataset ◽

10.1037/e577362012-008 ◽

2005 ◽

Author(s):

Wayne D. Gray ◽

Christopher W. Myers

Keyword(s):

Human Performance ◽

Performance Models ◽

Tools And Techniques

Download Full-text

Early generation of performance models for object-oriented systems

IEE Proceedings - Software ◽

10.1049/ip-sen:20000755 ◽

2000 ◽

Vol 147 (3) ◽

pp. 61 ◽

Author(s):

V. Cortellessa ◽

G. Iazeolla ◽

R. Mirandola

Keyword(s):

Object Oriented ◽

Performance Models ◽

Early Generation ◽

Object Oriented Systems

Download Full-text

Process Scheduling Challenges in the Era of Multicore Processors

Intel Technology Journal ◽

10.1535/itj.1104.09 ◽

2007 ◽

Vol 11 (04) ◽

Author(s):

Suresh Siddha

Keyword(s):

Multicore Processors ◽

Process Scheduling

Download Full-text

Parallel Matrix Computations.

10.21236/ada170699 ◽

1986 ◽

Author(s):

G. W. Stewart ◽

Dianne P. O'Leary

Keyword(s):

Matrix Computations

Download Full-text

Sequential and Parallel Matrix Computations.

10.21236/ada166062 ◽

1985 ◽

Author(s):

Biswa N. Datta

Keyword(s):

Matrix Computations

Download Full-text

Large Sparse Stable Matrix Computations

10.21236/ada229837 ◽

1990 ◽

Author(s):

Alex Pothen ◽

Jesse L. Barlow

Keyword(s):

Matrix Computations ◽

Download Full-text

Wavelets, Signal Processing and Matrix Computations

10.21236/ada283832 ◽

1994 ◽

Author(s):

Bruce W. Suter

Keyword(s):

Signal Processing ◽

Matrix Computations

Download Full-text

Physical Ability-Task Performance Models: Assessing the Risk of Omitted Variable Bias

10.21236/ada515128 ◽

2008 ◽

Author(s):

Jr. Vickers ◽

Hodgdon Ross R. ◽

Beckett James A. ◽

Marcie B.

Keyword(s):

Task Performance ◽

Performance Models ◽

Physical Ability ◽

Omitted Variable Bias ◽

Download Full-text

Integrating CMMI and TSP/PSP: Using TSP Data to Create Process Performance Models

10.21236/ada512409 ◽

2009 ◽

Author(s):

Shurei Tamura

Keyword(s):

Process Performance ◽

Performance Models

Download Full-text