hardware architectures Latest Research Papers

Medical imaging is considered one of the most important advances in the history of medicine and has become an essential part of the diagnosis and treatment of patients. Earlier prediction and treatment have been driving the acquisition of higher image resolutions as well as the fusion of different modalities, raising the need for sophisticated hardware and software systems for medical image registration, storage, analysis, and processing. In this scenario and given the new clinical pipelines and the huge clinical burden of hospitals, these systems are often required to provide both highly accurate and real-time processing of large amounts of imaging data. Additionally, lowering the prices of each part of imaging equipment, as well as its development and implementation, and increasing their lifespan is crucial to minimize the cost and lead to more accessible healthcare. This paper focuses on the evolution and the application of different hardware architectures (namely, CPU, GPU, DSP, FPGA, and ASIC) in medical imaging through various specific examples and discussing different options depending on the specific application. The main purpose is to provide a general introduction to hardware acceleration techniques for medical imaging researchers and developers who need to accelerate their implementations.

Download Full-text

The use of fuzzy sets to determine the parameters of genetic algorithms that provide approximately the same execution time on the CPU and GPU

Journal of Physics Conference Series ◽

10.1088/1742-6596/2131/3/032025 ◽

2021 ◽

Vol 2131 (3) ◽

pp. 032025

Author(s):

Oleg Agibalov ◽

Nikolay Ventsov

Keyword(s):

Genetic Algorithm ◽

Processing Time ◽

Fuzzy Numbers ◽

Subject Area ◽

Initial Information ◽

Hardware Architectures ◽

The Subject ◽

Gpu Architectures ◽

Number Of Individuals ◽

Gpu Architecture

Abstract The problem under consideration consists in choosing the number of k individuals, so that the time for processing k individuals by the genetic algorithm (GA) on the CPU architecture is close to the time for processing l individuals on the GPU architecture by the genetic algorithm. The initial information is data arrays containing information about the processing time of a given number of individuals by the genetic algorithm on the available hardware architectures. Fuzzy numbers are determined based on these arrays?~? and?~?, describing the processing time of a given number of individuals, respectively, on the CPU and GPU architectures. The peculiarities of the subject area do not allow considering the well-known methods of comparison based on the equalities of the membership functions and the nearest clear sets as adequate. Based on the known formula “close to Y (around Y)” the way to compare fuzzy numbers?~? and?~? was developed in order to determine the degree of closeness of the processing time of k and l individuals, respectively, on the hardware architectures of the CPU and GPU.

Download Full-text

Correction to: Efficient Hardware Architectures for 1D- and MD-LSTM Networks

Journal of Signal Processing Systems ◽

10.1007/s11265-021-01684-w ◽

2021 ◽

Author(s):

Vladimir Rybalkin ◽

Chirag Sudarshan ◽

Christian Weis ◽

Jan Lappas ◽

Norbert Wehn ◽

...

Keyword(s):

Hardware Architectures

Download Full-text

A GPU-Based Kalman Filter for Track Fitting

Computing and Software for Big Science ◽

10.1007/s41781-021-00065-z ◽

2021 ◽

Vol 5 (1) ◽

Author(s):

Xiaocong Ai ◽

Georgiana Mania ◽

Heather M. Gray ◽

Michael Kuhn ◽

Nicholas Styles

Keyword(s):

Kalman Filter ◽

High Energy Physics ◽

High Energy ◽

Track Reconstruction ◽

Future Directions ◽

Performance Benchmarking ◽

Computational Performance ◽

Hardware Architectures ◽

Improved Performance ◽

Performance Gains

AbstractComputing centres, including those used to process High-Energy Physics data and simulations, are increasingly providing significant fractions of their computing resources through hardware architectures other than x86 CPUs, with GPUs being a common alternative. GPUs can provide excellent computational performance at a good price point for tasks that can be suitably parallelized. Charged particle (track) reconstruction is a computationally expensive component of HEP data reconstruction, and thus needs to use available resources in an efficient way. In this paper, an implementation of Kalman filter-based track fitting using CUDA and running on GPUs is presented. This utilizes the ACTS (A Common Tracking Software) toolkit; an open source and experiment-independent toolkit for track reconstruction. The implementation details and parallelization approach are described, along with the specific challenges for such an implementation. Detailed performance benchmarking results are discussed, which show encouraging performance gains over a CPU-based implementation for representative configurations. Finally, a perspective on the challenges and future directions for these studies is outlined. These include more complex and realistic scenarios which can be studied, and anticipated developments to software frameworks and standards which may open up possibilities for greater flexibility and improved performance.

Download Full-text

Efficient and Portable Distribution Modeling for Large-Scale Scientific Data Processing with Data-Parallel Primitives

Algorithms ◽

10.3390/a14100285 ◽

2021 ◽

Vol 14 (10) ◽

pp. 285

Author(s):

Hao-Yi Yang ◽

Zhi-Rong Lin ◽

Ko-Chih Wang

Keyword(s):

Data Processing ◽

Parallel Algorithms ◽

Large Scale ◽

Gaussian Mixture ◽

Data Representation ◽

Scientific Data ◽

Data Parallel ◽

Distribution Modeling ◽

Scientific Datasets ◽

Hardware Architectures

The use of distribution-based data representation to handle large-scale scientific datasets is a promising approach. Distribution-based approaches often transform a scientific dataset into many distributions, each of which is calculated from a small number of samples. Most of the proposed parallel algorithms focus on modeling single distributions from many input samples efficiently, but these may not fit the large-scale scientific data processing scenario because they cannot utilize computing resources effectively. Histograms and the Gaussian Mixture Model (GMM) are the most popular distribution representations used to model scientific datasets. Therefore, we propose the use of multi-set histogram and GMM modeling algorithms for the scenario of large-scale scientific data processing. Our algorithms are developed by data-parallel primitives to achieve portability across different hardware architectures. We evaluate the performance of the proposed algorithms in detail and demonstrate use cases for scientific data processing.

Download Full-text

The 4-2 Fused Adder–Subtractor Compressor for Low-Power Butterfly-Based Hardware Architectures

Circuits Systems and Signal Processing ◽

10.1007/s00034-021-01839-x ◽

2021 ◽

Author(s):

Bianca Silveira ◽

Guilherme Paim ◽

Brunno Alves Abreu ◽

Rafael dos Santos Ferreira ◽

Cláudio Machado Diniz ◽

...

Keyword(s):

Low Power ◽

Hardware Architectures

Download Full-text

Seeds of SEED: Building and Verifying Foundationally Isolated Hardware Architectures

10.1109/seed51797.2021.00032 ◽

2021 ◽

Author(s):

Jason Oberg

Keyword(s):

Hardware Architectures

Download Full-text

A Low Area High Speed FPGA Implementation of AES Architecture for Cryptography Application

Electronics ◽

10.3390/electronics10162023 ◽

2021 ◽

Vol 10 (16) ◽

pp. 2023

Author(s):

Thanikodi Manoj Kumar ◽

Kasarla Satish Reddy ◽

Stefano Rinaldi ◽

Bidare Divakarachari Parameshachari ◽

Kavitha Arunachalam

Keyword(s):

Data Security ◽

High Speed ◽

Fpga Implementation ◽

Digital Data ◽

Low Area ◽

Aes Algorithm ◽

Field Programmable ◽

Hardware Architectures ◽

Cyber Crimes ◽

The Look

Nowadays, a huge amount of digital data is frequently changed among different embedded devices over wireless communication technologies. Data security is considered an important parameter for avoiding information loss and preventing cyber-crimes. This research article details the low power high-speed hardware architectures for the efficient field programmable gate array (FPGA) implementation of the advanced encryption standard (AES) algorithm to provide data security. This work does not depend on the look up tables (LUTs) for the implementation the SubBytes and InvSubBytes stages of transformations of the AES encryption and decryption; this new architecture uses combinational logical circuits for implementing SubBytes and InvSubBytes transformation. Due to the elimination of LUTs, unwanted delays are eliminated in this architecture and a subpipelining structure is introduced for improving the speed of the AES algorithm. Here, modified positive polarity reed muller (MPPRM) architecture is inserted to reduce the total hardware requirements, and comparisons are made with different implementations. With MPPRM architecture introduced in SubBytes stages, an efficient mixcolumn and invmixcolumn architecture that is suited to subpipelined round units is added. The performances of the proposed AES-MPPRM architecture is analyzed in terms of number of slice registers, flip flops, number of slice LUTs, number of logical elements, slices, bonded IOB, operating frequency and delay. There are five different AES architectures including LAES, AES-CTR, AES-CFA, AES-BSRD, and AES-EMCBE. The LUT of the AES-MPPRM architecture designed in the Spartan 6 is reduced up to 15.45% when compared to the AES-BSRD.

Download Full-text

hardware architectures
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Editorial: Scalable Bioinformatics: Methods, Software Tools, and Hardware Architectures

Hardware Architectures of Multipliers in Binary Galois Fields with Characteristic Less Than 1024

Hardware Architectures for Real-Time Medical Imaging

The use of fuzzy sets to determine the parameters of genetic algorithms that provide approximately the same execution time on the CPU and GPU

Correction to: Efficient Hardware Architectures for 1D- and MD-LSTM Networks

A GPU-Based Kalman Filter for Track Fitting

Efficient and Portable Distribution Modeling for Large-Scale Scientific Data Processing with Data-Parallel Primitives

The 4-2 Fused Adder–Subtractor Compressor for Low-Power Butterfly-Based Hardware Architectures

Seeds of SEED: Building and Verifying Foundationally Isolated Hardware Architectures

A Low Area High Speed FPGA Implementation of AES Architecture for Cryptography Application

Export Citation Format

hardware architecturesRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Editorial: Scalable Bioinformatics: Methods, Software Tools, and Hardware Architectures

Hardware Architectures of Multipliers in Binary Galois Fields with Characteristic Less Than 1024

Hardware Architectures for Real-Time Medical Imaging

The use of fuzzy sets to determine the parameters of genetic algorithms that provide approximately the same execution time on the CPU and GPU

Correction to: Efficient Hardware Architectures for 1D- and MD-LSTM Networks

A GPU-Based Kalman Filter for Track Fitting

Efficient and Portable Distribution Modeling for Large-Scale Scientific Data Processing with Data-Parallel Primitives

The 4-2 Fused Adder–Subtractor Compressor for Low-Power Butterfly-Based Hardware Architectures

Seeds of SEED: Building and Verifying Foundationally Isolated Hardware Architectures

A Low Area High Speed FPGA Implementation of AES Architecture for Cryptography Application

hardware architectures
Recently Published Documents