Analysis of RAPL Energy Prediction Accuracy in a Matrix Multiplication Application on Shared Memory

Author(s):  
Juan Manuel Paniego ◽  
Silvana Gallo ◽  
Martín Pi Puig ◽  
Franco Chichizola ◽  
Laura De Giusti ◽  
...  
1995 ◽  
Vol 4 (4) ◽  
pp. 275-289 ◽  
Author(s):  
B. Kumar ◽  
C.-H. Huang ◽  
P. Sadayappan ◽  
R.W. Johnson

In this article, we present a program generation strategy of Strassen's matrix multiplication algorithm using a programming methodology based on tensor product formulas. In this methodology, block recursive programs such as the fast Fourier Transforms and Strassen's matrix multiplication algorithm are expressed as algebraic formulas involving tensor products and other matrix operations. Such formulas can be systematically translated to high-performance parallel/vector codes for various architectures. In this article, we present a nonrecursive implementation of Strassen's algorithm for shared memory vector processors such as the Cray Y-MP. A previous implementation of Strassen's algorithm synthesized from tensor product formulas required working storage of size O(7n) for multiplying 2n× 2nmatrices. We present a modified formulation in which the working storage requirement is reduced to O(4n). The modified formulation exhibits sufficient parallelism for efficient implementation on a shared memory multiprocessor. Performance results on a Cray Y-MP8/64 are presented.


2005 ◽  
Vol 15 (04) ◽  
pp. 367-378 ◽  
Author(s):  
RAIMI RUFAI ◽  
MUSLIM BOZYIGIT ◽  
JARALLA ALGHAMDI ◽  
MOATAZ AHMED

While multithreaded programming is an effective way to exploit concurrency, multithreaded programs are notoriously hard to program, debug and tune for performance. In this paper, we present OpenMP shared memory programming as a viable alternative and a much simpler way to write multithreaded programs. We show through empirical results obtained by running, on a single processor machine, a simple matrix multiplication program written in OpenMP C that the drop in performance compared with the single threaded version even on a uniprocessor machine may be negligible. However, this is well compensated for by the increased programmer productivity resulting from the ease of programming, debugging, tuning and the relative ease of OpenMP skill acquisition.


2018 ◽  
pp. 111-118
Author(s):  
Tigran Galstyan

The combination of OpenMP and MPI in programming is called hybrid programming. Hybrid programming (through messages and shared memory) has gained an important role since the appearance of cluster architectures. A hybrid programming method combines the MPI and OpenMP libraries to use this hierarchical multi-core architecture. The purpose of this work is to carry out the performance analysis of matrix multiplication algorithms in a cluster system. Each node in the cluster consists of multiple-core CPUs, in which memory is distributed among the nodes and shared memory. Algorithms use MPI as a message-passing mechanism and OpenMP as shared memory.


2017 ◽  
Vol 2017 ◽  
pp. 1-14 ◽  
Author(s):  
Muhammad ◽  
Hassaan Khaliq Qureshi ◽  
Umber Saleem ◽  
Muhammad Saleem ◽  
Andreas Pitsillides ◽  
...  

We review harvested energy prediction schemes to be used in wireless sensor networks and explore the relative merits of landmark solutions. We propose enhancements to the well-known Profile-Energy (Pro-Energy) model, the so-called Improved Profile-Energy (IPro-Energy), and compare its performance with Accurate Solar Irradiance Prediction Model (ASIM), Pro-Energy, and Weather Conditioned Moving Average (WCMA). The performance metrics considered are the prediction accuracy and the execution time which measure the implementation complexity. In addition, the effectiveness of the considered models, when integrated in an energy management scheme, is also investigated in terms of the achieved throughput and the energy consumption. Both solar irradiance and wind power datasets are used for the evaluation study. Our results indicate that the proposed IPro-Energy scheme outperforms the other candidate models in terms of the prediction accuracy achieved by up to 78% for short term predictions and 50% for medium term prediction horizons. For long term predictions, its prediction accuracy is comparable to the Pro-Energy model but outperforms the other models by up to 64%. In addition, the IPro scheme is able to achieve the highest throughput when integrated in the developed energy management scheme. Finally, the ASIM scheme reports the smallest implementation complexity.


Sign in / Sign up

Export Citation Format

Share Document