MCMA: A modular processing elements array based low-power coarse-grained reconfigurable accelerator

Author(s):  
Remi Chaintreuil ◽  
Rie Uno ◽  
Hideharu Amano
Author(s):  
Dennis Wolf ◽  
Andreas Engel ◽  
Tajas Ruschke ◽  
Andreas Koch ◽  
Christian Hochberger

AbstractCoarse Grained Reconfigurable Arrays (CGRAs) or Architectures are a concept for hardware accelerators based on the idea of distributing workload over Processing Elements. These processors exploit instruction level parallelism, while being energy efficient due to their simplistic internal structure. However, the incorporation into a complete computing system raises severe challenges at the hardware and software level. This article evaluates a CGRA integrated into a control engineering environment targeting a Xilinx Zynq System on Chip (SoC) in detail. Besides the actual application execution performance, the practicability of the configuration toolchain is validated. Challenges of the real-world integration are discussed and practical insights are highlighted.


Author(s):  
Yoonjin Kim ◽  
Rabi N. Mahapatra ◽  
Ilhyun Park ◽  
Kiyoung Choi

2013 ◽  
Vol 22 (03) ◽  
pp. 1350001 ◽  
Author(s):  
YOONJIN KIM

Coarse-grained reconfigurable architectures (CGRA) require many processing elements (PEs) and a configuration memory unit (configuration cache) for reconfiguration of its PE array. Although this structure is meant for high performance and flexibility, it consumes significant power. Specially, power consumption by configuration cache is explicit overhead compared to other types of IP cores. Reducing power in configuration cache is very crucial for CGRA to be more competitive and reliable processing core in embedded systems. In this paper, I propose a power-efficient configuration cache structure based on two design schemes — one is a reusable context pipelining (RCP) architecture to reduce power-overhead caused by reconfiguration and another is a dynamic context management strategy for power saving in configuration cache. This power-efficient approach works without degrading the performance and flexibility of CGRA. Experimental results show that the proposed approach saves 56.50%/86.84% of the average power in write/read-operation of configuration cache compared to the previous design.


Integration ◽  
2018 ◽  
Vol 62 ◽  
pp. 170-181 ◽  
Author(s):  
Mohamed Asan Basiri M ◽  
Sandeep K. Shukla

Sign in / Sign up

Export Citation Format

Share Document