compilation framework
Recently Published Documents


TOTAL DOCUMENTS

39
(FIVE YEARS 6)

H-INDEX

9
(FIVE YEARS 2)

2022 ◽  
Vol 18 (1) ◽  
pp. 1-23
Author(s):  
Jianhui Han ◽  
Xiang Fei ◽  
Zhaolin Li ◽  
Youhui Zhang

Memristor-based processing-in-memory architecture is a promising solution to the memory bottleneck in the neural network ( NN ) processing. A major challenge for the programmability of such architectures is the automatic compilation of high-level NN workloads, from various operators to the memristor-based hardware that may provide programming interfaces with different granularities. This article proposes a source-to-source compilation framework for such memristor-based NN accelerators, which can conduct automatic detection and mapping of multiple NN operators based on the flexible and rich representation capability of the polyhedral model. In contrast to previous studies, it implements support for pipeline generation to exploit the parallelism in the NN loads to leverage hardware resources for higher efficiency. The evaluation based on synthetic kernels and NN benchmarks demonstrates that the proposed framework can reliably detect and map the target operators. Case studies on typical memristor-based architectures also show its generality over various architectural designs. The evaluation further demonstrates that compared with existing polyhedral-based compilation frameworks that do not support the pipelined execution, the performance can upgrade by an order of magnitude with the pipelined execution, which emphasizes the necessity of our improvement.


2021 ◽  
Vol 20 (5s) ◽  
pp. 1-24
Author(s):  
Daniele Parravicini ◽  
Davide Conficconi ◽  
Emanuele Del Sozzo ◽  
Christian Pilato ◽  
Marco D. Santambrogio

Regular Expression (RE) matching is a computational kernel used in several applications. Since RE complexity and data volumes are steadily increasing, hardware acceleration is gaining attention also for this problem. Existing approaches have limited flexibility as they require a different implementation for each RE. On the other hand, it is complex to map efficient RE representations like non-deterministic finite-state automata onto software-programmable engines or parallel architectures. In this work, we present CICERO , an end-to-end framework composed of a domain-specific architecture and a companion compilation framework for RE matching. Our solution is suitable for many applications, such as genomics/proteomics and natural language processing. CICERO aims at exploiting the intrinsic parallelism of non-deterministic representations of the REs. CICERO can trade-off accelerators’ efficiency and processors’ flexibility thanks to its programmable architecture and the compilation framework. We implemented CICERO prototypes on embedded FPGA achieving up to 28.6× and 20.8× more energy efficiency than embedded and mainstream processors, respectively. Since it is a programmable architecture, it can be implemented as a custom ASIC that is orders of magnitude more energy-efficient than mainstream processors.


2020 ◽  
Vol 10 (20) ◽  
pp. 174-202
Author(s):  
Mehr Ali Baran Cheshmeh ◽  
Ali Mohammad Safania ◽  
Mohsen Bagherian Farahabadi ◽  
Salahdin Naqshbandi ◽  
◽  
...  

2018 ◽  
Vol 111 ◽  
pp. 260-272 ◽  
Author(s):  
Raphael Polig ◽  
Kubilay Atasu ◽  
Heiner Giefers ◽  
Christoph Hagleitner ◽  
Laura Chiticariu ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document