gpu acceleration Latest Research Papers

AbstractMulti-image super-resolution (MISR) usually outperforms single-image super-resolution (SISR) under a proper inter-image alignment by explicitly exploiting the inter-image correlation. However, the large computational demand encumbers the deployment of MISR in practice. In this work, we propose a distributed optimization framework based on data parallelism for fast large-scale MISR using multi-GPU acceleration named FL-MISR. The scaled conjugate gradient (SCG) algorithm is applied to the distributed subfunctions and the local SCG variables are communicated to synchronize the convergence rate over multi-GPU systems towards a consistent convergence. Furthermore, an inner-outer border exchange scheme is performed to obviate the border effect between neighboring GPUs. The proposed FL-MISR is applied to the computed tomography (CT) system by super-resolving the projections acquired by subpixel detector shift. The SR reconstruction is performed on the fly during the CT acquisition such that no additional computation time is introduced. FL-MISR is extensively evaluated from different aspects and experimental results demonstrate that FL-MISR effectively improves the spatial resolution of CT systems in modulation transfer function (MTF) and visual perception. Comparing to a multi-core CPU implementation, FL-MISR achieves a more than 50$$\times$$ × speedup on an off-the-shelf 4-GPU system.

Download Full-text

GPU acceleration of rank-reduced coupled-cluster singles and doubles

The Journal of Chemical Physics ◽

10.1063/5.0063467 ◽

2021 ◽

Vol 155 (18) ◽

pp. 184110

Author(s):

Edward G. Hohenstein ◽

Todd J. Martínez

Keyword(s):

Gpu Acceleration ◽

Coupled Cluster

Download Full-text

Decision support for GPU acceleration by predicting energy savings and programming effort

Sustainable Computing Informatics and Systems ◽

10.1016/j.suscom.2021.100631 ◽

2021 ◽

pp. 100631

Author(s):

Charalampos Marantos ◽

Lazaros Papadopoulos ◽

Angeliki-Agathi Tsintzira ◽

Apostolos Ampatzoglou ◽

Alexander Chatzigeorgiou ◽

...

Keyword(s):

Decision Support ◽

Energy Savings ◽

Gpu Acceleration ◽

Programming Effort

Download Full-text

Adding GPU Acceleration to an Industrial CPU-Based Simulator, Development Strategy and Results

10.2118/203936-ms ◽

2021 ◽

Author(s):

Hui Cao ◽

Rustem Zaydullin ◽

Terrence Liao ◽

Neil Gohaud ◽

Eguono Obi ◽

...

Keyword(s):

Development Strategy ◽

Future Trend ◽

Gpu Acceleration ◽

Real Field ◽

Development Effort ◽

Processing Unit ◽

Linear Solver ◽

Cell Simulation ◽

Speed Up ◽

Reservoir Simulator

Abstract Running multi-million cell simulation problems in minutes has been a dream for reservoir engineers for decades. Today, with the advancement of Graphic Processing Unit (GPU), we have a real chance to make this dream a reality. Here we present our experience in the step-by-step transformation of a fully developed industrial CPU-based simulator into a fully functional GPU-based simulator. We also demonstrate significant accelerations achieved through the use of GPU technology. To achieve the best performance possible, we choose to use CUDA (NVIDIA GPU’s native language), and offload as much computations to GPU as possible. Our CUDA implementation covers all reservoir computes, which include property calculation, linearization, linear solver, etc. The well and Field Management still reside on CPU and need minor changes for their interaction with GPU-based reservoir. Importantly, there is no change to the nonlinear logic. The GPU and CPU parts are overlapped, fully utilizing the asynchronous nature of GPU operations. Each reservoir computation can be run in three modes, CPU_only (existing one), GPU_only, CPU followed by GPU. The latter is only used for result checking and debugging. In early 2019, we prototyped two reservoir linearization operations (mass accumulation and mass flux) in CUDA; both showed very strong runtime speed-up of several hundred times, 1 P100-GPU (NVIDIA) vs 1 POWER8NVL CPU core rated at 2.8 GHz (IBM). Encouraged by this success, we moved into linear solver development and managed to move the entire linear solver module into GPU. Again, strong speed-up of ~50 times was achieved (1 GPU vs 1 CPU). The focus for 2019 has been on standard Black-Oil cases. Our implementation was tested with multiple "million-cell range" models (SPE10 and other real field cases). In early 2020, we managed to put SPE10 fully on GPU, and finished the entire 2000 day time-stepping in ~35 sec with a single P100 card. After that our effort has switched to compositional AIM (Adaptive Implicit Method), with focus on compositional flash and AIM implementation for reservoir linearization and linear solver, both show early promising results. GPU-based reservoir simulation is a future trend for HPC. The development of a reservoir simulator is complex, multi-discipline and time-consuming work. Our paper demonstrates a clear strategy to add tremendous GPU acceleration into an existing CPU-based simulator. Our approach fully utilizes the strength of the existing CPU simulator and minimizes the GPU development effort. This paper is also the first publication targeting GPU acceleration for compositional AIM models.

Download Full-text

GPU acceleration of finite state machine input execution: Improving scale and performance

Software Testing Verification and Reliability ◽

10.1002/stvr.1796 ◽

2021 ◽

Author(s):

Vanya Yaneva ◽

Ajitha Rajan ◽

Christophe Dubach

Keyword(s):

Finite State Machine ◽

State Machine ◽

Gpu Acceleration ◽

Finite State ◽

And Performance

Download Full-text

Performance of a geometric deep learning pipeline for HL-LHC particle tracking

The European Physical Journal C ◽

10.1140/epjc/s10052-021-09675-8 ◽

2021 ◽

Vol 81 (10) ◽

Author(s):

Xiangyang Ju ◽

Daniel Murnane ◽

Paolo Calafiura ◽

Nicholas Choma ◽

Sean Conlon ◽

...

Keyword(s):

Neural Networks ◽

Particle Tracking ◽

Metric Learning ◽

Liquid Argon ◽

Gpu Acceleration ◽

New Developments ◽

Tracking Detector ◽

Computing Performance ◽

Graph Neural Networks ◽

Number Of Particles

AbstractThe Exa.TrkX project has applied geometric learning concepts such as metric learning and graph neural networks to HEP particle tracking. Exa.TrkX’s tracking pipeline groups detector measurements to form track candidates and filters them. The pipeline, originally developed using the TrackML dataset (a simulation of an LHC-inspired tracking detector), has been demonstrated on other detectors, including DUNE Liquid Argon TPC and CMS High-Granularity Calorimeter. This paper documents new developments needed to study the physics and computing performance of the Exa.TrkX pipeline on the full TrackML dataset, a first step towards validating the pipeline using ATLAS and CMS data. The pipeline achieves tracking efficiency and purity similar to production tracking algorithms. Crucially for future HEP applications, the pipeline benefits significantly from GPU acceleration, and its computational requirements scale close to linearly with the number of particles in the event.

Download Full-text

An efficient GPU acceleration technique for CBCT based on memory aware optimization scheme

Ain Shams Engineering Journal ◽

10.1016/j.asej.2021.08.010 ◽

2021 ◽

Author(s):

Hassan Youness ◽

Marwa Abbas ◽

Ammar Hassan

Keyword(s):

Gpu Acceleration ◽

Optimization Scheme ◽

Acceleration Technique

Download Full-text

gpu acceleration
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

A STUDY ON GPU-ACCELERATION OF PRESSURE SOLVER IN COMPRESSIBLE PARTICLE METHOD

Evaluation of GPU Acceleration for WRF–SFIRE

Fast, cheap, & turbulent — Global ocean modelling with GPU acceleration in Python

FL-MISR: fast large-scale multi-image super-resolution for computed tomography based on multi-GPU acceleration

GPU acceleration of rank-reduced coupled-cluster singles and doubles

Decision support for GPU acceleration by predicting energy savings and programming effort

Adding GPU Acceleration to an Industrial CPU-Based Simulator, Development Strategy and Results

GPU acceleration of finite state machine input execution: Improving scale and performance

Performance of a geometric deep learning pipeline for HL-LHC particle tracking

An efficient GPU acceleration technique for CBCT based on memory aware optimization scheme

Export Citation Format

gpu accelerationRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

A STUDY ON GPU-ACCELERATION OF PRESSURE SOLVER IN COMPRESSIBLE PARTICLE METHOD

Evaluation of GPU Acceleration for WRF–SFIRE

Fast, cheap, & turbulent — Global ocean modelling with GPU acceleration in Python

FL-MISR: fast large-scale multi-image super-resolution for computed tomography based on multi-GPU acceleration

GPU acceleration of rank-reduced coupled-cluster singles and doubles

Decision support for GPU acceleration by predicting energy savings and programming effort

Adding GPU Acceleration to an Industrial CPU-Based Simulator, Development Strategy and Results

GPU acceleration of finite state machine input execution: Improving scale and performance

Performance of a geometric deep learning pipeline for HL-LHC particle tracking

An efficient GPU acceleration technique for CBCT based on memory aware optimization scheme

gpu acceleration
Recently Published Documents