Optimizing thread throughput for multithreaded workloads on memory constrained CMPs

Proceedings of the 2008 conference on Computing frontiers - CF '08 ◽

10.1145/1366230.1366256 ◽

2008 ◽

Author(s):

Major Bhadauria ◽

Sally A. McKee

Keyword(s):

Multithreaded Workloads

Download Full-text

Pre-execution power consumption prediction of computational multithreaded workloads

Cluster Computing ◽

10.1007/s10586-014-0401-0 ◽

2014 ◽

Vol 17 (4) ◽

pp. 1323-1333 ◽

Author(s):

Hamid Fadishei ◽

Hossein Deldari ◽

Mahmoud Naghibzadeh

Keyword(s):

Power Consumption ◽

Multithreaded Workloads ◽

Power Consumption Prediction ◽

Consumption Prediction

Download Full-text

RPPM: Rapid Performance Prediction of Multithreaded Workloads on Multicore Processors

2019 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS) ◽

10.1109/ispass.2019.00038 ◽

2019 ◽

Author(s):

Sander De Pestel ◽

Sam Van den Steen ◽

Shoaib Akram ◽

Lieven Eeckhout

Keyword(s):

Performance Prediction ◽

Multicore Processors ◽

Multithreaded Workloads

Download Full-text

Toward Model Checking-Driven Fair Comparison of Dynamic Thermal Management Techniques Under Multithreaded Workloads

IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems ◽

10.1109/tcad.2019.2921313 ◽

2020 ◽

Vol 39 (8) ◽

pp. 1725-1738

Author(s):

Syed Ali Asadullah Bukhari ◽

Faiq Khalid ◽

Osman Hasan ◽

Muhammad Shafique ◽

Jorg Henkel

Keyword(s):

Model Checking ◽

Thermal Management ◽

Dynamic Thermal Management ◽

Fair Comparison ◽

Management Techniques ◽

Multithreaded Workloads

Download Full-text

Collecting whole-system reference traces of multiprogrammed and multithreaded workloads

ACM SIGSOFT Software Engineering Notes ◽

10.1145/974043.974080 ◽

2004 ◽

Vol 29 (1) ◽

pp. 228-237

Author(s):

Scott F. Kaplan

Keyword(s):

Multithreaded Workloads

Download Full-text

Adaptive Power Capping for Servers with Multithreaded Workloads

10.1109/mm.2012.59 ◽

2012 ◽

Vol 32 (5) ◽

pp. 64-75 ◽

Author(s):

Sherief Reda ◽

Ryan Cochran ◽

Ayse K. Coskun

Keyword(s):

Power Capping ◽

Adaptive Power ◽

Multithreaded Workloads

Download Full-text

An efficient cache flat storage organization for multithreaded workloads for low power processors

Future Generation Computer Systems ◽

10.1016/j.future.2019.11.024 ◽

2020 ◽

Vol 110 ◽

pp. 1037-1054

Author(s):

José Puche ◽

Salvador Petit ◽

María E. Gómez ◽

Julio Sahuquillo

Keyword(s):

Low Power ◽

Multithreaded Workloads

Download Full-text

(Mis)understanding the NUMA memory system performance of multithreaded workloads

2013 IEEE International Symposium on Workload Characterization (IISWC) ◽

10.1109/iiswc.2013.6704666 ◽

2013 ◽

Author(s):

Zoltan Majo ◽

Thomas R. Gross

Keyword(s):

System Performance ◽

Memory System ◽

Multithreaded Workloads

Download Full-text

Power Token Balancing: Adapting CMPs to Power Constraints for Parallel Multithreaded Workloads

2011 IEEE International Parallel & Distributed Processing Symposium ◽

10.1109/ipdps.2011.49 ◽

2011 ◽

Author(s):

Juan M. Cebri´n ◽

Juan L. Aragón ◽

Stefanos Kaxiras

Keyword(s):

Power Constraints ◽

Multithreaded Workloads

Download Full-text

Collecting whole-system reference traces of multiprogrammed and multithreaded workloads

10.1145/974044.974080 ◽

2004 ◽

Author(s):

Scott F. Kaplan

Keyword(s):

Multithreaded Workloads

Download Full-text

A Workload-Adaptive and Reconfigurable Bus Architecture for Multicore Processors

International Journal of Reconfigurable Computing ◽

10.1155/2010/205852 ◽

2010 ◽

Vol 2010 ◽

pp. 1-22 ◽

Author(s):

Shoaib Akram ◽

Alexandros Papakonstantinou ◽

Rakesh Kumar ◽

Deming Chen

Keyword(s):

Shared Memory ◽

Interconnection Networks ◽

Interconnection Network ◽

Multicore Processors ◽

Scale Up ◽

Cost Effective ◽

Reconfigurable Logic ◽

Multithreaded Workloads ◽

Adaptive Policies ◽

Bus Architecture

Interconnection networks for multicore processors are traditionally designed to serve a diversity of workloads. However, different workloads or even different execution phases of the same workload may benefit from different interconnect configurations. In this paper, we first motivate the need for workload-adaptive interconnection networks. Subsequently, we describe an interconnection network framework based on reconfigurable switches for use in medium-scale (up to 32 cores) shared memory multicore processors. Our cost-effective reconfigurable interconnection network is implemented on a traditional shared bus interconnect with snoopy-based coherence, and it enables improved multicore performance. The proposed interconnect architecture distributes the cores of the processor into clusters with reconfigurable logic between clusters to support workload-adaptive policies for inter-cluster communication. Our interconnection scheme is complemented by interconnect-aware scheduling and additional interconnect optimizations which help boost the performance of multiprogramming and multithreaded workloads. We provide experimental results that show that the overall throughput of multiprogramming workloads (consisting of two and four programs) can be improved by up to 60% with our configurable bus architecture. Similar gains can be achieved also for multithreaded applications as shown by further experiments. Finally, we present the performance sensitivity of the proposed interconnect architecture on shared memory bandwidth availability.

Download Full-text