Mapping high level algorithms onto massively parallel reconfigurable hardware

The hardware structure of a processing element used for optimization of an investment strategy for financial markets is presented. It is shown how this processing element can be multiply implemented on the massively parallel FPGA-machine RIVYERA. This leads to a speedup of a factor of about 17,000 in comparison to one single high-performance PC, while saving more than 99% of the consumed energy. Furthermore, it is shown for a special security and different time periods that the optimized investment strategy delivers an outperformance between 2 and 14 percent in relation to a buy and hold strategy.

Download Full-text

A high-level cellular programming model for massively parallel processing

Proceedings Second International Workshop on High-Level Parallel Programming Models and Supportive Environments ◽

10.1109/hips.1997.582956 ◽

2002 ◽

Cited By ~ 11

Author(s):

G. Spezzano ◽

D. Talia

Keyword(s):

Parallel Processing ◽

Programming Model ◽

Massively Parallel ◽

Massively Parallel Processing ◽

High Level

Download Full-text

A power-aware algorithm for the design of reconfigurable hardware during high level placement

International Journal of Knowledge-based and Intelligent Engineering Systems ◽

10.3233/kes-2008-12306 ◽

2008 ◽

Vol 12 (3) ◽

pp. 237-244

Author(s):

Wing On Fung ◽

Tughrul Arslan

Keyword(s):

Reconfigurable Hardware ◽

High Level

Download Full-text

High-level synthesis for dynamically reconfigurable hardware/software systems

Lecture Notes in Computer Science - Field-Programmable Logic and Applications From FPGAs to Computing Paradigm ◽

10.1007/bfb0055256 ◽

1998 ◽

pp. 288-297 ◽

Cited By ~ 1

Author(s):

Rainer Kress ◽

Andreas Pyttel

Keyword(s):

Reconfigurable Hardware ◽

High Level Synthesis ◽

Software Systems ◽

Dynamically Reconfigurable ◽

High Level

Download Full-text

A massively-parallel easily-scalable satisfiability solver using reconfigurable hardware

Proceedings 1999 Design Automation Conference (Cat. No. 99CH36361) ◽

10.1109/dac.1999.782036 ◽

2003 ◽

Cited By ~ 5

Author(s):

M. Abramovici ◽

J.T. de Sousa ◽

D. Saab

Keyword(s):

Reconfigurable Hardware ◽

Massively Parallel ◽

Satisfiability Solver

Download Full-text

A Power-Aware Algorithm for the Design of Reconfigurable Hardware during High Level Placement

Second NASA/ESA Conference on Adaptive Hardware and Systems (AHS 2007) ◽

10.1109/ahs.2007.15 ◽

2007 ◽

Author(s):

Wing On Fung ◽

T. Arslan

Keyword(s):

Reconfigurable Hardware ◽

High Level

Download Full-text

Methodology of Firmware Development for ARUZ—An FPGA-Based HPC System

Electronics ◽

10.3390/electronics9091482 ◽

2020 ◽

Vol 9 (9) ◽

pp. 1482

Author(s):

Rafał Kiełbik ◽

Kamil Rudnicki ◽

Zbigniew Mudza ◽

Jarosław Jung

Keyword(s):

Large Scale ◽

High Level Synthesis ◽

Massively Parallel ◽

Computational System ◽

Automated Generation ◽

Processing Elements ◽

Synthesis Tool ◽

High Level ◽

Scalable Network ◽

Core Description

ARUZ is a large scale, massively parallel, FPGA-based reconfigurable computational system dedicated primarily to molecular analysis. This paper presents a methodology for ARUZ firmware development that simplifies the process, offers low-level optimization, and facilitates verification. According to this methodology, firstly an expanded, generic, all-in-one VHDL description of variable Processing Elements (PEs) is developed manually. GCC preprocessing is then used to extract only the desired target functionality. A dedicated software instantiates and connects PEs in form of a scalable network, divides it into subsets for chips and generates its HDL description. As a result, individual HDL-coded specification, optimized for certain analysis, is provided for the synthesis tool. Code reuse and automated generation of up to 81% of the code economizes the workload. Using well-optimized VHDL for core description rather than High Level Synthesis eliminates unnecessary overhead. The PE network can be scaled inversely proportional to PEs complexity, in order to efficiently utilize available resources. Moreover, downscaling the problem makes verification during HDL simulations and testing the prototype systems easier.

Download Full-text

Large-Scale Data Computing Performance Comparisons on SYCL Heterogeneous Parallel Processing Layer Implementations

Applied Sciences ◽

10.3390/app10051656 ◽

2020 ◽

Vol 10 (5) ◽

pp. 1656

Author(s):

Woosuk Shin ◽

Kwan-Hee Yoo ◽

Nakhoon Baek

Keyword(s):

Big Data ◽

Large Scale ◽

Heterogeneous Computing ◽

Cost Effective ◽

Massively Parallel ◽

Single Source ◽

Parallel Tasks ◽

Big Data Applications ◽

High Level ◽

Mathematical Operations

Today, many big data applications require massively parallel tasks to compute complicated mathematical operations. To perform parallel tasks, platforms like CUDA (Compute Unified Device Architecture) and OpenCL (Open Computing Language) are widely used and developed to enhance the throughput of massively parallel tasks. There is also a need for high-level abstractions and platform-independence over those massively parallel computing platforms. Recently, Khronos group announced SYCL (C++ Single-source Heterogeneous Programming for OpenCL), a new cross-platform abstraction layer, to provide an efficient way for single-source heterogeneous computing, with C++-template-level abstractions. However, since there has been no official implementation of SYCL, we currently have several different implementations from various vendors. In this paper, we analyse the characteristics of those SYCL implementations. We also show performance measures of those SYCL implementations, especially for well-known massively parallel tasks. We show that each implementation has its own strength in computing different types of mathematical operations, along with different sizes of data. Our analysis is available for fundamental measurements of the abstract-level cost-effective use of massively parallel computations, especially for big-data applications.

Download Full-text

Triton/1: A Massively-Parallel Mixed-Mode Computer Designed to Support High Level Languages

Proceedings. Workshop on Heterogeneous Processing, ◽

10.1109/whp.1993.664368 ◽

2005 ◽

Cited By ~ 9

Author(s):

C.G. Herter ◽

T.M. Warschko ◽

W.F. Tichy ◽

M. Philippsen

Keyword(s):

Mixed Mode ◽

Massively Parallel ◽

High Level

Download Full-text

Mapping high level algorithms onto massively parallel reconfigurable hardware

Evaluating High-Level Program Invariants Using Reconfigurable Hardware

Optimizing Investment Strategies with the Reconfigurable Hardware Platform RIVYERA

A high-level cellular programming model for massively parallel processing

A power-aware algorithm for the design of reconfigurable hardware during high level placement

High-level synthesis for dynamically reconfigurable hardware/software systems

A massively-parallel easily-scalable satisfiability solver using reconfigurable hardware

A Power-Aware Algorithm for the Design of Reconfigurable Hardware during High Level Placement

Methodology of Firmware Development for ARUZ—An FPGA-Based HPC System

Large-Scale Data Computing Performance Comparisons on SYCL Heterogeneous Parallel Processing Layer Implementations

Triton/1: A Massively-Parallel Mixed-Mode Computer Designed to Support High Level Languages

Export Citation Format