multiple gpus Latest Research Papers

This paper explores novel architectures for fast backprojection based video synthetic aperture radar (BP-VISAR) with multiple GPUs. The video SAR frame rate is analyzed for non-overlapped and overlapped aperture modes. For the parallelization of the backprojection process, a processing data unit is defined as the phase history data or range profile data from partial synthetic-apertures divided from the full resolution target data. Considering whether full-aperture processing is performed and range compression or backprojection are parallelized on a GPU basis, we propose six distinct architectures, each having a single-stream pipeline with a single GPU. The performance of these architectures is evaluated in both non-overlapped and overlapped modes. The efficiency of the BP-VISAR architecture with sub-aperture processing in the overlapped mode is accelerated further by filling the processing gap from the idling GPU resources with multi-stream based backprojection on multiple GPUs. The frame rate of the proposed BP-VISAR architecture with sub-aperture processing is scalable with the number of GPU devices for large pixel resolution. It can generate 4096 × 4096 video SAR frames of 0.5 m cross-range resolution in 23.0 Hz on a single GPU and 73.5 Hz on quad GPUs.

Download Full-text

Addressing Load Imbalance in Bioinformatics and Biomedical Applications: Efficient Scheduling across Multiple GPUs

10.1109/bibm52615.2021.9669317 ◽

2021 ◽

Author(s):

Mathialakan Thavappiragasam ◽

Vivek Kale ◽

Oscar Hernandez ◽

Ada Sedova

Keyword(s):

Biomedical Applications ◽

Multiple Gpus ◽

Load Imbalance

Download Full-text

Eulerian incompressible smoothed particle hydrodynamics on multiple GPUs

Computer Physics Communications ◽

10.1016/j.cpc.2021.108263 ◽

2021 ◽

pp. 108263

Author(s):

Joseph O'Connor ◽

José M. Domínguez ◽

Benedict D. Rogers ◽

Steven J. Lind ◽

Peter K. Stansby

Keyword(s):

Smoothed Particle Hydrodynamics ◽

Multiple Gpus ◽

Particle Hydrodynamics ◽

Smoothed Particle ◽

Incompressible Smoothed Particle Hydrodynamics

Download Full-text

An FPGA-Based Optimizer Design for Distributed Deep Learning with Multiple GPUs

IEICE Transactions on Information and Systems ◽

10.1587/transinf.2021pap0008 ◽

2021 ◽

Vol E104.D (12) ◽

pp. 2057-2067

Author(s):

Tomoya ITSUBO ◽

Michihiro KOIBUCHI ◽

Hideharu AMANO ◽

Hiroki MATSUTANI

Keyword(s):

Deep Learning ◽

Multiple Gpus

Download Full-text

Solving the sparse QUBO on multiple GPUs for Simulating a Quantum Annealer

10.1109/candar53791.2021.00011 ◽

2021 ◽

Author(s):

Tomohiro Imanaga ◽

Koji Nakano ◽

Ryota Yasudo ◽

Yasuaki Ito ◽

Yuya Kawamata ◽

...

Keyword(s):

Multiple Gpus

Download Full-text

A Comprehensive Survey of Deep Learning Models Based on Keras Framework

Journal of Soft Computing and Data Mining ◽

10.30880/jscdm.2021.02.02.005 ◽

2021 ◽

Vol 2 (2) ◽

Author(s):

Bahzad Taha Chicho ◽

◽

Amira Bibo Sallow ◽

Keyword(s):

Deep Learning ◽

Programming Languages ◽

Application Programming Interface ◽

Learning Models ◽

Multiple Gpus ◽

Learning Framework ◽

Current Survey ◽

Comprehensive Survey ◽

Application Programming ◽

Graph Neural Networks

Python is one of the most widely adopted programming languages, having replaced a number of those in the field. Python is popular with developers for a variety of reasons, one of which is because it has an incredibly diverse collection of libraries that users can run. The most compelling reasons for adopting Keras come from its guiding principles, particularly those related to usability. Aside from the simplicity of learning and model construction, Keras has a wide variety of production deployment options and robust support for multiple GPUs and distributed training. A strong and easy-to-use free, open-source Python library is the most important tool for developing and evaluating deep learning models. The aim of this paper is to provide the most current survey of Keras in different aspects, which is a Python-based deep learning Application Programming Interface (API) that runs on top of the machine learning framework, TensorFlow. The mentioned library is used in conjunction with TensorFlow, PyTorch, CODEEPNEATM, and Pygame to allow integration of deep learning models such as cardiovascular disease diagnostics, graph neural networks, identifying health issues, COVID-19 recognition, skin tumors, image detection, and so on, in the applied area. Furthermore, the author used Keras's details, goals, challenges, significant outcomes, and the findings obtained using this method.

Download Full-text

Haze Mitigation in High-Resolution Satellite Imagery Using Enhanced Style-Transfer Neural Network and Normalization Across Multiple GPUs

10.1109/igarss47720.2021.9553574 ◽

2021 ◽

Author(s):

Byung H. Park ◽

Somrita Chattopadhyay ◽

John Burgin

Keyword(s):

Neural Network ◽

High Resolution ◽

Satellite Imagery ◽

Style Transfer ◽

Multiple Gpus ◽

High Resolution Satellite Imagery

Download Full-text

Large-scale flow simulations using lattice Boltzmann method with AMR following free-surface on multiple GPUs

Computer Physics Communications ◽

10.1016/j.cpc.2021.107871 ◽

2021 ◽

Vol 264 ◽

pp. 107871

Author(s):

Seiya Watanabe ◽

Takayuki Aoki

Keyword(s):

Free Surface ◽

Lattice Boltzmann Method ◽

Lattice Boltzmann ◽

Large Scale ◽

Multiple Gpus ◽

Flow Simulations ◽

Large Scale Flow ◽

Boltzmann Method

Download Full-text

A Comparative Study of Block Incomplete Sparse Approximate Inverses Preconditioning on Tesla K20 and V100 GPUs

Algorithms ◽

10.3390/a14070204 ◽

2021 ◽

Vol 14 (7) ◽

pp. 204

Author(s):

Wenpeng Ma ◽

Wu Yuan ◽

Xiazhen Liu

Keyword(s):

Comparative Study ◽

Open Source ◽

Multiple Gpus ◽

Approximate Inverses ◽

Level Scheduling

Incomplete Sparse Approximate Inverses (ISAI) has shown some advantages over sparse triangular solves on GPUs when it is used for the incomplete LU based preconditioner. In this paper, we extend the single GPU method for Block–ISAI to multiple GPUs algorithm by coupling Block–Jacobi preconditioner, and introduce the detailed implementation in the open source numerical package PETSc. In the experiments, two representative cases are performed and a comparative study of Block–ISAI on up to four GPUs are conducted on two major generations of NVIDIA’s GPUs (Tesla K20 and Tesla V100). Block–Jacobi preconditioning with Block–ISAI (BJPB-ISAI) shows an advantage over the level-scheduling based triangular solves from the cuSPARSE library for the cases, and the overhead of setting up Block–ISAI and the total wall clock times of GMRES is greatly reduced using Tesla V100 GPUs compared to Tesla K20 GPUs.

Download Full-text

multiple gpus
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Increasing momentum-like factors: A method for reducing training errors on multiple GPUs

Architecture Exploration of a Backprojection Algorithm for Real-Time Video SAR

Addressing Load Imbalance in Bioinformatics and Biomedical Applications: Efficient Scheduling across Multiple GPUs

Eulerian incompressible smoothed particle hydrodynamics on multiple GPUs

An FPGA-Based Optimizer Design for Distributed Deep Learning with Multiple GPUs

Solving the sparse QUBO on multiple GPUs for Simulating a Quantum Annealer

A Comprehensive Survey of Deep Learning Models Based on Keras Framework

Haze Mitigation in High-Resolution Satellite Imagery Using Enhanced Style-Transfer Neural Network and Normalization Across Multiple GPUs

Large-scale flow simulations using lattice Boltzmann method with AMR following free-surface on multiple GPUs

A Comparative Study of Block Incomplete Sparse Approximate Inverses Preconditioning on Tesla K20 and V100 GPUs

Export Citation Format

multiple gpusRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Increasing momentum-like factors: A method for reducing training errors on multiple GPUs

Architecture Exploration of a Backprojection Algorithm for Real-Time Video SAR

Addressing Load Imbalance in Bioinformatics and Biomedical Applications: Efficient Scheduling across Multiple GPUs

Eulerian incompressible smoothed particle hydrodynamics on multiple GPUs

An FPGA-Based Optimizer Design for Distributed Deep Learning with Multiple GPUs

Solving the sparse QUBO on multiple GPUs for Simulating a Quantum Annealer

A Comprehensive Survey of Deep Learning Models Based on Keras Framework

Haze Mitigation in High-Resolution Satellite Imagery Using Enhanced Style-Transfer Neural Network and Normalization Across Multiple GPUs

Large-scale flow simulations using lattice Boltzmann method with AMR following free-surface on multiple GPUs

A Comparative Study of Block Incomplete Sparse Approximate Inverses Preconditioning on Tesla K20 and V100 GPUs

multiple gpus
Recently Published Documents