simultaneous multithreading
Recently Published Documents


TOTAL DOCUMENTS

101
(FIVE YEARS 10)

H-INDEX

19
(FIVE YEARS 1)

2021 ◽  
Author(s):  
Josué Feliu ◽  
Alberto Ros ◽  
Manuel E. Acacio ◽  
Stefanos Kaxiras

2021 ◽  
Vol 2021 ◽  
pp. 1-13
Author(s):  
Mohamed S. Esseissah ◽  
Ashraf Bhery ◽  
Sameh S. Daoud ◽  
Hatem M. Bahig

Hard Lattice problems are assumed to be one of the most promising problems for generating cryptosystems that are secure in quantum computing. The shortest vector problem (SVP) is one of the most famous lattice problems. In this paper, we present three improvements on GPU-based parallel algorithms for solving SVP using the classical enumeration and pruned enumeration. There are two improvements for preprocessing: we use a combination of randomization and the Gaussian heuristic to expect a better basis that leads rapidly to a shortest vector and we expect the level on which the exchanging data between CPU and GPU is optimized. In the third improvement, we improve GPU-based implementation by generating some points in GPU rather than in CPU. We used NVIDIA GeForce GPUs of type GTX 1060 6G. We achieved a significant improvement upon Hermans’s improvement. The improvements speed up the pruned enumeration by a factor of almost 2.5 using a single GPU. Additionally, we provided an implementation for multi-GPUs by using two GPUs. The results showed that our algorithm of enumeration is scalable since the speedups achieved using two GPUs are almost faster than Hermans’s improvement by a factor of almost 5. The improvements also provided a high speedup for the classical enumeration. The speedup achieved using our improvements and two GPUs on a challenge of dimension 60 is almost faster by factor 2 than Correia’s parallel implementation using a dual-socket machine with 16 physical cores and simultaneous multithreading technology.


2020 ◽  
Author(s):  
Matheus Serpa ◽  
Eduardo Cruz ◽  
Matthias Diener ◽  
Antonio Carlos Beck ◽  
Philippe Navaux

Aplicações paralelas executadas em processadores SMT (Simultaneous Multithreading) competem por unidades de execução. O problema fica ainda pior, quando as threads executam instruções semelhantes, como por exemplo de ponto flutuante, inteiro, load e store. Nesses casos, o mesmo tipo de instrução é despachado para execução, o que leva a perdas de desempenho devido a contenção nessas unidades. Este trabalho tem como objetivo fornecer um mecanismo para mapeamento de múltiplas aplicações paralelas em processadores SMT. O mecanismo foca em melhorar o desempenho, mitigando a contenção nas unidades de execução ao executar aplicações paralelas. Para tanto, threads que estressam as mesmas unidades de execução são mapeadas em núcleos diferentes. Os resultados mostram ganhos de desempenho de 29,1% e 17,4%, em média, quando comparado com o escalonador do sistema operacional Linux e com um mapeamento Round-robin.


2019 ◽  
Vol 18 (2) ◽  
pp. 99-102 ◽  
Author(s):  
Gil Shomron ◽  
Tal Horowitz ◽  
Uri Weiser

Sign in / Sign up

Export Citation Format

Share Document