simultaneous multithreading Latest Research Papers

ITSLF: Inter-Thread Store-to-Load Forwardingin Simultaneous Multithreading

10.1145/3466752.3480086 ◽

2021 ◽

Author(s):

Josué Feliu ◽

Alberto Ros ◽

Manuel E. Acacio ◽

Stefanos Kaxiras

Keyword(s):

Simultaneous Multithreading

Reinforcement learning-based register renaming policy for simultaneous multithreading CPUs

Expert Systems with Applications ◽

10.1016/j.eswa.2021.115717 ◽

2021 ◽

pp. 115717

Author(s):

Huixin Zhan ◽

Victor S. Sheng ◽

Wei-Ming Lin

Keyword(s):

Reinforcement Learning ◽

Simultaneous Multithreading ◽

Register Renaming

Simultaneous Multithreading in Mixed-Criticality Real-Time Systems

2021 IEEE 27th Real-Time and Embedded Technology and Applications Symposium (RTAS) ◽

10.1109/rtas52030.2021.00030 ◽

2021 ◽

Author(s):

Joshua Bakita ◽

Shareef Ahmed ◽

Sims Hill Osborne ◽

Stephen Tang ◽

Jingyuan Chen ◽

...

Keyword(s):

Real Time ◽

Simultaneous Multithreading ◽

Real Time Systems ◽

Time Systems ◽

Mixed Criticality

Three Strategies for Improving Shortest Vector Enumeration Using GPUs

Scientific Programming ◽

10.1155/2021/8852497 ◽

2021 ◽

Vol 2021 ◽

pp. 1-13

Author(s):

Mohamed S. Esseissah ◽

Ashraf Bhery ◽

Sameh S. Daoud ◽

Hatem M. Bahig

Keyword(s):

Quantum Computing ◽

Parallel Algorithms ◽

Parallel Implementation ◽

Simultaneous Multithreading ◽

The Third ◽

Vector Problem ◽

Shortest Vector Problem ◽

Speed Up ◽

Multithreading Technology

Hard Lattice problems are assumed to be one of the most promising problems for generating cryptosystems that are secure in quantum computing. The shortest vector problem (SVP) is one of the most famous lattice problems. In this paper, we present three improvements on GPU-based parallel algorithms for solving SVP using the classical enumeration and pruned enumeration. There are two improvements for preprocessing: we use a combination of randomization and the Gaussian heuristic to expect a better basis that leads rapidly to a shortest vector and we expect the level on which the exchanging data between CPU and GPU is optimized. In the third improvement, we improve GPU-based implementation by generating some points in GPU rather than in CPU. We used NVIDIA GeForce GPUs of type GTX 1060 6G. We achieved a significant improvement upon Hermans’s improvement. The improvements speed up the pruned enumeration by a factor of almost 2.5 using a single GPU. Additionally, we provided an implementation for multi-GPUs by using two GPUs. The results showed that our algorithm of enumeration is scalable since the speedups achieved using two GPUs are almost faster than Hermans’s improvement by a factor of almost 5. The improvements also provided a high speedup for the classical enumeration. The speedup achieved using our improvements and two GPUs on a challenge of dimension 60 is almost faster by factor 2 than Correia’s parallel implementation using a dual-socket machine with 16 physical cores and simultaneous multithreading technology.

A Learning-based Fetch Thread Gating Mechanism for A Simultaneous Multithreading Processor

2020 Eighth International Symposium on Computing and Networking (CANDAR) ◽

10.1109/candar51075.2020.00011 ◽

2020 ◽

Author(s):

Yosuke Ide ◽

Nobuyuki Yamasaki

Keyword(s):

Simultaneous Multithreading ◽

Gating Mechanism

Atenuando a Contenção nas Unidades de Execução com Mapeamento Instruction-Aware

10.5753/wscad.2020.14073 ◽

2020 ◽

Author(s):

Matheus Serpa ◽

Eduardo Cruz ◽

Matthias Diener ◽

Antonio Carlos Beck ◽

Philippe Navaux

Keyword(s):

Round Robin ◽

Simultaneous Multithreading

Aplicações paralelas executadas em processadores SMT (Simultaneous Multithreading) competem por unidades de execução. O problema ﬁca ainda pior, quando as threads executam instruções semelhantes, como por exemplo de ponto ﬂutuante, inteiro, load e store. Nesses casos, o mesmo tipo de instrução é despachado para execução, o que leva a perdas de desempenho devido a contenção nessas unidades. Este trabalho tem como objetivo fornecer um mecanismo para mapeamento de múltiplas aplicações paralelas em processadores SMT. O mecanismo foca em melhorar o desempenho, mitigando a contenção nas unidades de execução ao executar aplicações paralelas. Para tanto, threads que estressam as mesmas unidades de execução são mapeadas em núcleos diferentes. Os resultados mostram ganhos de desempenho de 29,1% e 17,4%, em média, quando comparado com o escalonador do sistema operacional Linux e com um mapeamento Round-robin.

Non-Blocking Simultaneous Multithreading: Embracing the Resiliency of Deep Neural Networks

2020 53rd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO) ◽

10.1109/micro50266.2020.00032 ◽

2020 ◽

Author(s):

Gil Shomron ◽

Uri Weiser

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Simultaneous Multithreading

Exploiting Simultaneous Multithreading in Priority-Driven Hard Real-Time Systems

2020 IEEE 26th International Conference on Embedded and Real-Time Computing Systems and Applications (RTCSA) ◽

10.1109/rtcsa50079.2020.9203575 ◽

2020 ◽

Author(s):

Sims Hill Osborne ◽

Shareef Ahmed ◽

Saujas Nandi ◽

James H. Anderson

Keyword(s):

Real Time ◽

Simultaneous Multithreading ◽

Real Time Systems ◽

Hard Real Time ◽

Time Systems

A simultaneous multithreading processor architecture with predictable timing behavior

Design Automation for Embedded Systems ◽

10.1007/s10617-019-09224-3 ◽

2019 ◽

Vol 24 (1) ◽

pp. 45-62

Author(s):

Hadley Magno Siqueira ◽

Marcio Eduardo Kreutz

Keyword(s):

Simultaneous Multithreading ◽

Processor Architecture ◽

Timing Behavior

SMT-SA: Simultaneous Multithreading in Systolic Arrays

IEEE Computer Architecture Letters ◽

10.1109/lca.2019.2924007 ◽

2019 ◽

Vol 18 (2) ◽

pp. 99-102 ◽

Cited By ~ 4

Author(s):

Gil Shomron ◽

Tal Horowitz ◽

Uri Weiser

Keyword(s):

Systolic Arrays ◽

Simultaneous Multithreading

simultaneous multithreading
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

ITSLF: Inter-Thread Store-to-Load Forwardingin Simultaneous Multithreading

Reinforcement learning-based register renaming policy for simultaneous multithreading CPUs

Simultaneous Multithreading in Mixed-Criticality Real-Time Systems

Three Strategies for Improving Shortest Vector Enumeration Using GPUs

A Learning-based Fetch Thread Gating Mechanism for A Simultaneous Multithreading Processor

Atenuando a Contenção nas Unidades de Execução com Mapeamento Instruction-Aware

Non-Blocking Simultaneous Multithreading: Embracing the Resiliency of Deep Neural Networks

Exploiting Simultaneous Multithreading in Priority-Driven Hard Real-Time Systems

A simultaneous multithreading processor architecture with predictable timing behavior

SMT-SA: Simultaneous Multithreading in Systolic Arrays

Export Citation Format

simultaneous multithreadingRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

ITSLF: Inter-Thread Store-to-Load Forwardingin Simultaneous Multithreading

Reinforcement learning-based register renaming policy for simultaneous multithreading CPUs

Simultaneous Multithreading in Mixed-Criticality Real-Time Systems

Three Strategies for Improving Shortest Vector Enumeration Using GPUs

A Learning-based Fetch Thread Gating Mechanism for A Simultaneous Multithreading Processor

Atenuando a Contenção nas Unidades de Execução com Mapeamento Instruction-Aware

Non-Blocking Simultaneous Multithreading: Embracing the Resiliency of Deep Neural Networks

Exploiting Simultaneous Multithreading in Priority-Driven Hard Real-Time Systems

A simultaneous multithreading processor architecture with predictable timing behavior

SMT-SA: Simultaneous Multithreading in Systolic Arrays

simultaneous multithreading
Recently Published Documents