massively parallel computing Latest Research Papers

Graphics processing unit-accelerated Monte Carlo simulation of polarized light in complex three-dimensional media

10.1101/2022.01.13.476270 ◽

2022 ◽

Author(s):

Shijie Yan ◽

Steven L Jacques ◽

Jessica C. Ramella-Roman ◽

Qianqian Fang

Keyword(s):

Monte Carlo ◽

Graphics Processing Unit ◽

Three Dimensional ◽

Polarized Light ◽

Biological Tissues ◽

Dramatic Improvement ◽

Processing Unit ◽

Photon Propagation ◽

Spatially Resolved ◽

Massively Parallel Computing

Significance: Monte Carlo (MC) methods have been applied for studying interactions between polarized light and biological tissues, but most existing MC codes supporting polarization modeling can only simulate homogeneous or multi-layered domains, resulting in approximations when handling realistic tissue structures. Aim: Over the past decade, the speed of MC simulations has seen dramatic improvement with massively-parallel computing techniques. Developing hardware-accelerated MC simulation algorithms that can accurately model polarized light inside 3-D heterogeneous tissues can greatly expand the utility of polarization in biophotonics applications. Approach: Here we report a highly efficient polarized MC algorithm capable of modeling arbitrarily complex media defined over a voxelated domain. Each voxel of the domain can be associated with spherical scatters of various radii and densities. The Stokes vector of each simulated photon packet is updated through photon propagation, creating spatially resolved polarization measurements over the detectors or domain surface. Results: We have implemented this algorithm in our widely disseminated MC simulator, Monte Carlo eXtreme (MCX). It is validated by comparing with a reference CPU-based simulator in both homogeneous and layered domains, showing excellent agreement and a 931-fold speedup. Conclusion: The polarization-enabled MCX (pMCX) offers biophotonics community an efficient tool to explore polarized light in bio-tissues, and is freely available at http://mcx.space/.

3D magnetotelluric modeling using high-order tetrahedral Nédélec elements on massively parallel computing platforms

Computers & Geosciences ◽

10.1016/j.cageo.2021.105030 ◽

2022 ◽

pp. 105030

Author(s):

Octavio Castillo-Reyes ◽

David Modesto ◽

Pilar Queralt ◽

Alex Marcuello ◽

Juanjo Ledo ◽

...

Keyword(s):

Parallel Computing ◽

High Order ◽

Massively Parallel ◽

Massively Parallel Computing ◽

Computing Platforms

Circular Interval-valued Computers and Simulation of (Red-green) Turing Machines

Fundamenta Informaticae ◽

10.3233/fi-2021-2057 ◽

2021 ◽

Vol 181 (2-3) ◽

pp. 213-238

Author(s):

Benedek Nagy ◽

Sándor Vályi

Keyword(s):

Parallel Computing ◽

Fixed Number ◽

Massively Parallel ◽

Infinite Length ◽

Turing Machines ◽

Computing Power ◽

Simple Observation ◽

Computing Model ◽

Massively Parallel Computing ◽

Interval Valued

Interval-valued computing is a kind of massively parallel computing. It operates on specific subsets of the interval [0,1) – unions of subintervals. They serve as basic data units and are called interval-values. It was established in [9], by a rather simple observation, that interval-valued computing, as a digital computing model, has computing power equivalent to Turing machines. However, this equivalence involves an unlimited number of interval-valued variables. In [14], the equivalence with Turing machines is established using a simulation that uses only a fixed number of interval-valued variables and this number depends only on the number of states of the Turing machine – in a logarithmic way. The simulation given there allows us to extend interval-valued computations into infinite length to capture the computing power of red-green Turing machines. In this extension of [14], based on the quasi-periodic techniques used in the simulations in that paper, a reformulation of the interval-valued computations is given, named circular interval-valued computers. This reformulation enforces the finiteness of the number of used interval-valued variables by building the finiteness into the syntax rules.

Impact of HPC and Automated CFD Simulation Processes on Virtual Product Development—A Case Study

Applied Sciences ◽

10.3390/app11146552 ◽

2021 ◽

Vol 11 (14) ◽

pp. 6552

Author(s):

Christopher Lange ◽

Patrick Barthelmäs ◽

Tobias Rosnitschek ◽

Stephan Tremmel ◽

Frank Rieg

Keyword(s):

Product Development ◽

High Performance ◽

Cfd Simulation ◽

Computing Time ◽

Point Of View ◽

Before And After ◽

Massively Parallel Computing ◽

Cfd Analyses ◽

Virtual Product Development ◽

The Impact

High-performance computing (HPC) enables both academia and industry to accelerate simulation-driven product development processes by providing a massively parallel computing infrastructure. In particular, the automation of high-fidelity computational fluid dynamics (CFD) analyses aided by HPC systems can be beneficial since computing time decreases while the number of significant design iterations increases. However, no studies have quantified these effects from a product development point of view yet. This article evaluates the impact of HPC and automation on product development by studying a formula student racing team as a representative example of a small or medium-sized company. Over several seasons, we accompanied the team, and provided HPC infrastructure and methods to automate their CFD simulation processes. By comparing the team’s key performance indicators (KPIs) before and after the HPC implementation, we were able to quantify a significant increase in development efficiency in both qualitative and quantitative aspects. The major aerodynamic KPI increased up to 115%. Simultaneously, the number of expedient design iterations within one season increased by 600% while utilizing HPC. These results prove the substantial benefits of HPC and automation of numerical-intensive simulation processes for product development.

Scalable massively parallel computing using continuous-time data representation in nanoscale crossbar array

Nature Nanotechnology ◽

10.1038/s41565-021-00943-y ◽

2021 ◽

Author(s):

Cong Wang ◽

Shi-Jun Liang ◽

Chen-Yu Wang ◽

Zai-Zheng Yang ◽

Yingmeng Ge ◽

...

Keyword(s):

Parallel Computing ◽

Continuous Time ◽

Data Representation ◽

Massively Parallel ◽

Time Data ◽

Crossbar Array ◽

Massively Parallel Computing

HARDWARE IMPLEMENTATION DESIGN OF A SPIKING NEURON

System technologies ◽

10.34185/1562-9945-1-132-2021-10 ◽

2021 ◽

Vol 1 (132) ◽

pp. 116-123

Author(s):

Alexey Gnilenko

Keyword(s):

Circuit Design ◽

Hardware Implementation ◽

Massively Parallel ◽

Spiking Neuron ◽

Artificial Neuron ◽

Implementation Model ◽

Neuron Networks ◽

Integrate And Fire ◽

Massively Parallel Computing ◽

Integrate And Fire Neuron

The hardware implementation of an artificial neuron is the key problem of the design of neuromorphic chips which are new promising architectural solutions for massively parallel computing. In this paper an analog neuron circuit design is presented to be used as a building element of spiking neuron networks. The design of the neuron is performed at the transistor level based on Leaky Integrate-and-Fire neuron implementation model. The neuron is simulated using EDA tool to verify the design. Signal waveforms at key nodes of the neuron are obtained and neuron functionality is demonstrated.

Data-driven prediction and origin identification of epidemics in population networks

Royal Society Open Science ◽

10.1098/rsos.200531 ◽

2021 ◽

Vol 8 (1) ◽

pp. 200531

Author(s):

Karen Larson ◽

Georgios Arampatzis ◽

Clark Bowman ◽

Zhizhong Chen ◽

Panagiotis Hadjidoukas ◽

...

Keyword(s):

Model Fitting ◽

Synthetic Data ◽

Model Complexity ◽

Model Parameters ◽

Computational Framework ◽

Related Data ◽

Robust Model ◽

Massively Parallel Computing ◽

Disease Spreading ◽

Origin Identification

Effective intervention strategies for epidemics rely on the identification of their origin and on the robustness of the predictions made by network disease models. We introduce a Bayesian uncertainty quantification framework to infer model parameters for a disease spreading on a network of communities from limited, noisy observations; the state-of-the-art computational framework compensates for the model complexity by exploiting massively parallel computing architectures. Using noisy, synthetic data, we show the potential of the approach to perform robust model fitting and additionally demonstrate that we can effectively identify the disease origin via Bayesian model selection. As disease-related data are increasingly available, the proposed framework has broad practical relevance for the prediction and management of epidemics.

A Comparison of Unsteady RANS and DES for Simulating an Axial Compressor Stage

Volume 2A: Turbomachinery ◽

10.1115/gt2020-15725 ◽

2020 ◽

Author(s):

Edward A. Miller ◽

Michael J. Cave ◽

David M. Williams ◽

Khandan Thayalakhandan

Keyword(s):

Large Scale ◽

Axial Compressor ◽

Industrial Scale ◽

Navier Stokes ◽

Detached Eddy Simulation ◽

Sliding Mesh ◽

Eddy Simulation ◽

Massively Parallel Computing ◽

Unsteady Rans ◽

Computing Platforms

Abstract Computational fluid dynamics (CFD) of industrial-scale, axial compressor geometries has traditionally been performed using steady state methods such as the mixing plane approach. With the surge in the development of large-scale, massively-parallel computing platforms, fully 3D unsteady approaches are rapidly growing in popularity. The fully 3D, unsteady approach involves building a full 3D domain for each blade row, and then coupling the stationary and rotating domains using a sliding interface. In the literature, there are various methods for solving this 3D unsteady problem, such as the Unsteady Reynolds Averaged Navier-Stokes (URANS) and the Detached Eddy Simulation (DES) methods. While these methods are well documented for a variety of real-world problems, there have been limited efforts to compare the effectiveness of these methods for fully 3D, unsteady turbomachinery problems. In this study, the first stage of an industrial-scale axial compressor was simulated using: i) the URANS approach, and ii) the DES approach. The compressor geometry consisted of an inlet housing, inlet guide vanes (IGV), a rotor, and a stator. The RANS model for both simulations was the k-epsilon model. For both of these cases, sliding mesh interfaces were located between the IGV and rotor, and between the rotor and stator. The results of the URANS and DES approaches were time-averaged and their predictions were compared. Throughout the study, our goal was to provide important insights into the performance of the URANS and DES approaches, and to highlight the essential differences.

Towards real-time DNA biometrics using GPU-accelerated processing

Logic Journal of IGPL ◽

10.1093/jigpal/jzaa034 ◽

2020 ◽

Author(s):

Mario Reja ◽

Ciprian Pungila ◽

Viorel Negru

Keyword(s):

Dna Sequences ◽

Search Space ◽

Careful Consideration ◽

Computing Power ◽

The Past ◽

Graphical Processing Units ◽

Massively Parallel Computing ◽

Computationally Intensive ◽

Graphical Processing ◽

Specialized Hardware

Abstract Decoding the human genome in the past decades has brought into focus a computationally intensive operation through DNA profiling. The typical search space for these kinds of problems is extremely large and requires specialized hardware and algorithms to perform the necessary sequence analysis. In this paper, we propose an innovative and scalable approach to exact multi-pattern matching of nucleotide sequences by harnessing the massively parallel computing power found in commodity graphical processing units. Our approach places careful consideration on preprocessing of DNA datasets and runtime performance, while exploiting the full capabilities of the heterogeneous platform it runs on. Finally, we evaluate our models against real-world DNA sequences.

Review of smoothed particle hydrodynamics: towards converged Lagrangian flow modelling

Proceedings of The Royal Society A Mathematical Physical and Engineering Sciences ◽

10.1098/rspa.2019.0801 ◽

2020 ◽

Vol 476 (2241) ◽

pp. 20190801

Author(s):

Steven J. Lind ◽

Benedict D. Rogers ◽

Peter K. Stansby

Keyword(s):

Smoothed Particle Hydrodynamics ◽

Graphics Processing Units ◽

Wave Structure ◽

Free Form ◽

Mesh Free ◽

Weakly Compressible ◽

Particle Hydrodynamics ◽

Massively Parallel Computing ◽

Smoothed Particle ◽

Graphics Processing

This paper presents a review of the progress of smoothed particle hydrodynamics (SPH) towards high-order converged simulations. As a mesh-free Lagrangian method suitable for complex flows with interfaces and multiple phases, SPH has developed considerably in the past decade. While original applications were in astrophysics, early engineering applications showed the versatility and robustness of the method without emphasis on accuracy and convergence. The early method was of weakly compressible form resulting in noisy pressures due to spurious pressure waves. This was effectively removed in the incompressible (divergence-free) form which followed; since then the weakly compressible form has been advanced, reducing pressure noise. Now numerical convergence studies are standard. While the method is computationally demanding on conventional processors, it is well suited to parallel processing on massively parallel computing and graphics processing units. Applications are diverse and encompass wave–structure interaction, geophysical flows due to landslides, nuclear sludge flows, welding, gearbox flows and many others. In the state of the art, convergence is typically between the first- and second-order theoretical limits. Recent advances are improving convergence to fourth order (and higher) and these will also be outlined. This can be necessary to resolve multi-scale aspects of turbulent flow.

massively parallel computing
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Graphics processing unit-accelerated Monte Carlo simulation of polarized light in complex three-dimensional media

3D magnetotelluric modeling using high-order tetrahedral Nédélec elements on massively parallel computing platforms

Circular Interval-valued Computers and Simulation of (Red-green) Turing Machines

Impact of HPC and Automated CFD Simulation Processes on Virtual Product Development—A Case Study

Scalable massively parallel computing using continuous-time data representation in nanoscale crossbar array

HARDWARE IMPLEMENTATION DESIGN OF A SPIKING NEURON

Data-driven prediction and origin identification of epidemics in population networks

A Comparison of Unsteady RANS and DES for Simulating an Axial Compressor Stage

Towards real-time DNA biometrics using GPU-accelerated processing

Review of smoothed particle hydrodynamics: towards converged Lagrangian flow modelling

Export Citation Format

massively parallel computingRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Graphics processing unit-accelerated Monte Carlo simulation of polarized light in complex three-dimensional media

3D magnetotelluric modeling using high-order tetrahedral Nédélec elements on massively parallel computing platforms

Circular Interval-valued Computers and Simulation of (Red-green) Turing Machines

Impact of HPC and Automated CFD Simulation Processes on Virtual Product Development—A Case Study

Scalable massively parallel computing using continuous-time data representation in nanoscale crossbar array

HARDWARE IMPLEMENTATION DESIGN OF A SPIKING NEURON

Data-driven prediction and origin identification of epidemics in population networks

A Comparison of Unsteady RANS and DES for Simulating an Axial Compressor Stage

Towards real-time DNA biometrics using GPU-accelerated processing

Review of smoothed particle hydrodynamics: towards converged Lagrangian flow modelling

massively parallel computing
Recently Published Documents