Dependency Graph-based High-level Synthesis for Maximum Instruction Parallelism

Zhenghua Gu; Wenqing Wan; Jundong Xie; Chang Wu

doi:10.1145/3468875

Dependency Graph-based High-level Synthesis for Maximum Instruction Parallelism

ACM Transactions on Reconfigurable Technology and Systems ◽

10.1145/3468875 ◽

2021 ◽

Vol 14 (4) ◽

pp. 1-15

Author(s):

Zhenghua Gu ◽

Wenqing Wan ◽

Jundong Xie ◽

Chang Wu

Keyword(s):

Performance Optimization ◽

Directed Acyclic Graph ◽

Scheduling Algorithm ◽

Dependency Graph ◽

High Level Synthesis ◽

Limiting Factor ◽

Circuit Performance ◽

State Transition Graph ◽

High Level ◽

Basic Blocks

Performance optimization is an important goal for High-level Synthesis (HLS). Existing HLS scheduling algorithms are all based on Control and Data Flow Graph (CDFG) and will schedule basic blocks in sequential order. Our study shows that the sequential scheduling order of basic blocks is a big limiting factor for achievable circuit performance. In this article, we propose a Dependency Graph (DG) with two important properties for scheduling. First, DG is a directed acyclic graph. Thus, no loop breaking heuristic is needed for scheduling. Second, DG can be used to identify the exact instruction parallelism. Our experiment shows that DG can lead to 76% instruction parallelism increase over CDFG. Based on DG, we propose a bottom-up scheduling algorithm to achieve much higher instruction parallelism than existing algorithms. Hierarchical state transition graph with guard conditions is proposed for efficient implementation of such high parallelism scheduling. Our experimental results show that our DG-based HLS algorithm can outperform the CDFG-based LegUp and the state-of-the-art industrial tool Vivado HLS by 2.88× and 1.29× on circuit latency, respectively.

Download Full-text

Buffer Placement and Sizing for High-Performance Dataflow Circuits

ACM Transactions on Reconfigurable Technology and Systems ◽

10.1145/3477053 ◽

2022 ◽

Vol 15 (1) ◽

pp. 1-32

Author(s):

Lana Josipović ◽

Shabnam Sheikhha ◽

Andrea Guerrieri ◽

Paolo Ienne ◽

Jordi Cortadella

Keyword(s):

Performance Optimization ◽

Optimization Model ◽

High Performance ◽

Control Flow ◽

High Level Synthesis ◽

Software Applications ◽

Marked Graphs ◽

Variable Latency ◽

High Level ◽

Strong Contrast

Commercial high-level synthesis tools typically produce statically scheduled circuits. Yet, effective C-to-circuit conversion of arbitrary software applications calls for dataflow circuits, as they can handle efficiently variable latencies (e.g., caches), unpredictable memory dependencies, and irregular control flow. Dataflow circuits exhibit an unconventional property: registers (usually referred to as “buffers”) can be placed anywhere in the circuit without changing its semantics, in strong contrast to what happens in traditional datapaths. Yet, although functionally irrelevant, this placement has a significant impact on the circuit’s timing and throughput. In this work, we show how to strategically place buffers into a dataflow circuit to optimize its performance. Our approach extracts a set of choice-free critical loops from arbitrary dataflow circuits and relies on the theory of marked graphs to optimize the buffer placement and sizing. Our performance optimization model supports important high-level synthesis features such as pipelined computational units, units with variable latency and throughput, and if-conversion. We demonstrate the performance benefits of our approach on a set of dataflow circuits obtained from imperative code.

Download Full-text

Multi-objective genetic scheduling algorithm with respect to allocation in high-level synthesis

Proceedings of the 26th Euromicro Conference. EUROMICRO 2000. Informatics: Inventing the Future ◽

10.1109/eurmic.2000.874651 ◽

2002 ◽

Cited By ~ 1

Author(s):

G. Papa ◽

J. Silc

Keyword(s):

Scheduling Algorithm ◽

High Level Synthesis ◽

Multi Objective ◽

High Level

Download Full-text

Optimal Design of a VLSI Processor with Spatially and Temporally Parallel Structure

Journal of Robotics and Mechatronics ◽

10.20965/jrm.1996.p0516 ◽

1996 ◽

Vol 8 (6) ◽

pp. 516-523

Author(s):

Michitaka Kameyama ◽

◽

Masayuki Sasaki

Keyword(s):

Delay Time ◽

Scheduling Algorithm ◽

High Level Synthesis ◽

Parallel Structure ◽

Data Flow Graph ◽

Silicon Area ◽

Suitable Combination ◽

Minimum Delay ◽

High Level ◽

Minimum Delay Time

In intelligent integrated systems such as robotics for autonomous work, it is essential to respond to the change of the environment very quickly. Therefore, the development of special-purpose VLSI processors with minimum delay time becomes a very important subject. A suitable combination of spatially parallel and temporally parallel processing is very important to realize the minimum delay time. In this article, we present a scheduling algorithm for high-level synthesis, where the input to the scheduler is a behavioral description viewed as a data flow graph. The scheduler minimizes the delay time under the constraint of a silicon area and I/O pins.

Download Full-text

CASCH-a scheduling algorithm for 'high level'-synthesis

10.1109/edac.1991.206414 ◽

2002 ◽

Cited By ~ 11

Author(s):

P. Gutberlet ◽

H. Kramer ◽

W. Rosenstiel

Keyword(s):

Scheduling Algorithm ◽

High Level Synthesis ◽

High Level

Download Full-text

Improving circuit performance with multispeculative additive trees in high-level synthesis

Microelectronics Journal ◽

10.1016/j.mejo.2014.06.005 ◽

2014 ◽

Vol 45 (11) ◽

pp. 1470-1479 ◽

Cited By ~ 6

Author(s):

Alberto A. Del Barrio ◽

Román Hermida ◽

Seda Ogrenci Memik ◽

José M. Mendías ◽

María C. Molina

Keyword(s):

High Level Synthesis ◽

Circuit Performance ◽

Additive Trees ◽

High Level

Download Full-text

A fast and effective lookahead and fractional search based scheduling algorithm for high-level synthesis

2018 Design, Automation & Test in Europe Conference & Exhibition (DATE) ◽

10.23919/date.2018.8341975 ◽

2018 ◽

Cited By ~ 3

Author(s):

Shantanu Dutt ◽

Ouwen Shi

Keyword(s):

Scheduling Algorithm ◽

High Level Synthesis ◽

High Level

Download Full-text

DLS: A scheduling algorithm for high-level synthesis in VHDL

1993 European Conference on Design Automation with the European Event in ASIC Design ◽

10.1109/edac.1993.386444 ◽

2002 ◽

Cited By ~ 13

Author(s):

K. O'Brien ◽

M. Rahmouni ◽

A. Jerraya

Keyword(s):

Scheduling Algorithm ◽

High Level Synthesis ◽

High Level

Download Full-text

Performance optimization using template mapping for datapath-intensive high-level synthesis

IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems ◽

10.1109/43.511568 ◽

1996 ◽

Vol 15 (8) ◽

pp. 877-888 ◽

Cited By ~ 64

Author(s):

M.R. Corazao ◽

M.A. Khalaf ◽

L.M. Guerra ◽

M. Potkonjak ◽

J.M. Rabaey

Keyword(s):

Performance Optimization ◽

High Level Synthesis ◽

High Level

Download Full-text

An Optimized Scheduling Algorithm for High Level Synthesis (HLS) to Reduce the Control Steps

Asian Journal of Research in Social Sciences and Humanities ◽

10.5958/2249-7315.2016.00870.4 ◽

2016 ◽

Vol 6 (9) ◽

pp. 1289

Author(s):

M. Chinnadurai ◽

S. M. Ramesh

Keyword(s):

Scheduling Algorithm ◽

High Level Synthesis ◽

High Level

Download Full-text

Feedback Driven High Level Synthesis for Performance Optimization

2005 6th International Conference on ASIC ◽

10.1109/icasic.2005.1611468 ◽

2006 ◽

Author(s):

Hao Li ◽

S. Katkoori ◽

Zhipeng Liu

Keyword(s):

Performance Optimization ◽

High Level Synthesis ◽

High Level

Download Full-text