parallel computer Latest Research Papers

The Multigrid Beta Function Approach for Modeling of Background Error Covariance in the Real-Time Mesoscale Analysis (RTMA)

Monthly Weather Review ◽

10.1175/mwr-d-20-0405.1 ◽

2022 ◽

Keyword(s):

Spatial Scales ◽

Beta Function ◽

Parallel Computer ◽

Covariance Operator ◽

General Covariance ◽

Background Error ◽

Recursive Filters ◽

Error Covariance ◽

Covariance Operators ◽

Helmholtz Operators

Abstract We describe a method for the efficient generation of the covariance operators of a variational data assimilation scheme which is suited to implementation on a massively parallel computer. The elementary components of this scheme are what we call ‘beta filters’, since they are based on the same spatial profiles possessed by the symmetric beta distributions of probability theory. These approximately Gaussian (bell-shaped) polynomials blend smoothly to zero at the ends of finite intervals, which makes them better suited to parallelization than the present quasi-Gaussian ‘recursive filters’ used in operations at NCEP. These basic elements are further combined at a hierarchy of different spatial scales into an overall multigrid structure formulated to preserve the necessary self-adjoint attribute possessed by any valid covariance operator. This paper describes the underlying idea of the beta filter and discusses how generalized Helmholtz operators can be enlisted to weight the elementary contributions additively in such a way that the covariance operators may exhibit realistic negative sidelobes, which are not easily obtained through the recursive filter paradigm. The main focus of the paper is on the basic logistics of the multigrid structure by which more general covariance forms are synthesized from the basic quasi-Gaussian elements. We describe several ideas on how best to organize computation, which led us to a generalization of this structure which made it practical so that it can efficiently perform with any rectangular arrangement of processing elements. Some simple idealized examples of the applications of these ideas are given.

Genetic programming for iterative numerical methods

Genetic Programming and Evolvable Machines ◽

10.1007/s10710-021-09425-5 ◽

2021 ◽

Author(s):

Dominik Sobania ◽

Jonas Schmitt ◽

Harald Köstler ◽

Franz Rothlauf

Keyword(s):

Genetic Programming ◽

Linear Systems ◽

Spectral Radius ◽

Linear Equations ◽

Parallel Computer ◽

Iteration Matrix ◽

Mathematical Expressions ◽

Lower Spectral Radius ◽

Problem Instances ◽

Stationary Iterative Methods

AbstractWe introduce GPLS (Genetic Programming for Linear Systems) as a GP system that finds mathematical expressions defining an iteration matrix. Stationary iterative methods use this iteration matrix to solve a system of linear equations numerically. GPLS aims at finding iteration matrices with a low spectral radius and a high sparsity, since these properties ensure a fast error reduction of the numerical solution method and enable the efficient implementation of the methods on parallel computer architectures. We study GPLS for various types of system matrices and find that it easily outperforms classical approaches like the Gauss–Seidel and Jacobi methods. GPLS not only finds iteration matrices for linear systems with a much lower spectral radius, but also iteration matrices for problems where classical approaches fail. Additionally, solutions found by GPLS for small problem instances show also good performance for larger instances of the same problem.

A Novel Tradeoff Analysis between Traffic Congestion and Packing Density of Interconnection Networks for Massively Parallel Computers

Applied Sciences ◽

10.3390/app112210798 ◽

2021 ◽

Vol 11 (22) ◽

pp. 10798

Author(s):

M. M. Hafizur Rahman ◽

Mohammed Al-Naeem ◽

Mohammed Mustafa Ghowanem ◽

Eklas Hossain

Keyword(s):

Packing Density ◽

Traffic Congestion ◽

Interconnection Networks ◽

Network Performance ◽

Interconnection Network ◽

Future Generation ◽

Parallel Computer ◽

Massively Parallel ◽

Traffic Density ◽

Trade Off

From disaster prevention to mitigation, drug analysis to drug design, agriculture to food security, IoT to AI, and big data analysis to knowledge or sentiment mining, a high computation power is a prime necessity at present. As such, massively parallel computer (MPC) systems comprising a large number of nodes are gaining popularity. To interconnect these huge numbers of nodes efficiently, hierarchical interconnection networks are an attractive and feasible option. A Tori-connected flattened butterfly network (TFBN) has been proposed by the authors in a prior work for future generation MPC systems. In the previous study, the static network performance and static cost-effectiveness were evaluated. In this research, a novel trade-off factor named message traffic congestion vs. packing density trade-off factor has been proposed, which characterizes the message congestion in the network and its packing density. The factor is used to statically assess the suitability of the implementation of an interconnection network. The message traffic density, packing density, and new factor have been evaluated for the proposed network and similar competitive networks such as TTN, TESH, 2D-Mesh, 3D-Mesh, 2D-Torus, and 3D-Torus. It has been found that the performance of the TFBN is superior to the other networks.

Solution of the Neutron Transport Equation on Unstructured Grids Using the Parallel Block Jacobi-Integral Transport Matrix Method via the Novel Green’s Function ITMM Construction Algorithm on Massively Parallel Computer Systems

Nuclear Science and Engineering ◽

10.1080/00295639.2021.1898309 ◽

2021 ◽

pp. 1-29

Author(s):

Dylan S. Hoagland ◽

Raffi A. Yessayan ◽

Yousry Y. Azmy

Keyword(s):

Transport Equation ◽

Matrix Method ◽

Unstructured Grids ◽

Neutron Transport ◽

Parallel Computer ◽

Massively Parallel ◽

The Novel ◽

Neutron Transport Equation ◽

Jacobi Integral ◽

Integral Transport

Fractal dimension of coastline of Australia

Scientific Reports ◽

10.1038/s41598-021-85405-0 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Akhlaq Husain ◽

Jaideep Reddy ◽

Deepika Bisht ◽

Mohammad Sajid

Keyword(s):

Fractal Dimension ◽

Parallel Processing ◽

Programming Language ◽

Power Law ◽

Fractal Geometry ◽

Parallel Computer ◽

Processing Algorithm ◽

Computational Results ◽

Random Fractal

AbstractCoastlines are irregular in nature having (random) fractal geometry and are formed by various natural activities. Fractal dimension is a measure of degree of geometric irregularity present in the coastline. A novel multicore parallel processing algorithm is presented to calculate the fractal dimension of coastline of Australia. The reliability of the coastline length of Australia is addressed by recovering the power law from our computational results. For simulations, the algorithm is implemented on a parallel computer for multi-core processing using the QGIS software, R-programming language and Python codes.

The Graph of Our Mind

Brain Sciences ◽

10.3390/brainsci11030342 ◽

2021 ◽

Vol 11 (3) ◽

pp. 342 ◽

Cited By ~ 1

Author(s):

Balázs Szalkai ◽

Bálint Varga ◽

Vince Grolmusz

Keyword(s):

Human Brain ◽

Interconnection Networks ◽

Vertex Cover ◽

Parallel Computer ◽

Quality Data ◽

Computer Engineering ◽

Human Connectome Project ◽

Graph Parameters ◽

Human Brains ◽

Male Subjects

Graph theory in the last two decades penetrated sociology, molecular biology, genetics, chemistry, computer engineering, and numerous other fields of science. One of the more recent areas of its applications is the study of the connections of the human brain. By the development of diffusion magnetic resonance imaging (diffusion MRI), it is possible today to map the connections between the 1–1.5 cm2 regions of the gray matter of the human brain. These connections can be viewed as a graph. We have computed 1015-vertex graphs with thousands of edges for hundreds of human brains from one of the highest quality data sources: the Human Connectome Project. Here we analyze the male and female braingraphs graph-theoretically and show statistically significant differences in numerous parameters between the sexes: the female braingraphs are better expanders, have more edges, larger bipartition widths, and larger vertex cover than the braingraphs of the male subjects. These parameters are closely related to the quality measures of highly parallel computer interconnection networks: the better expanding property, the large bipartition width, and the large vertex cover characterize high-quality interconnection networks. We apply the data of 426 subjects and demonstrate the statistically significant (corrected) differences in 116 graph parameters between the sexes.

High Performance Computing in Parallel Electromagnetics Simulation Code suite ACE3P

Applied Computational Electromagnetics Society ◽

10.47037/2020.aces.j.351135 ◽

2021 ◽

Vol 35 (11) ◽

pp. 1332-1333

Author(s):

Lixin Ge ◽

Zenghai Li ◽

Cho-Kuen Ng ◽

Liling Xiao

Keyword(s):

High Performance Computing ◽

High Performance ◽

Large Scale ◽

Parallel Computer ◽

Particle Accelerators ◽

Simulation Code ◽

Computer Center ◽

Component Design ◽

Code Performance ◽

Performance Computing

A comprehensive set of parallel finite-element codes suite ACE3P (Advanced Computational Electromagnetics 3D Parallel) is developed by SLAC for multi-physics modeling of particle accelerators running on massively parallel computer platforms for high fidelity and high accuracy simulation. ACE3P enables rapid virtual prototyping of accelerator and RF component design, optimization and analysis. Advanced modeling capabilities have been facilitated by implementations of novel algorithms for numerical solvers. Code performance on state-of-the-art high performance computing (HPC) platforms for large-scale RF modeling in accelerator applications will be presented in this paper. All the simulations have been performed on the supercomputers at National Energy Research Computer Center (NERSC).

Midimew Connected Torus Network for Next Generation Massively Parallel Computer System

Procedia Computer Science ◽

10.1016/j.procs.2021.01.044 ◽

2021 ◽

Vol 179 ◽

pp. 590-597

Author(s):

Maryam Manaa Al-Shammari ◽

Asrar Haque ◽

M.M. Hafizur Rahman

Keyword(s):

Computer System ◽

Parallel Computer ◽

Massively Parallel ◽

Next Generation ◽

Torus Network

Dual-tool multiplexing model of parallel computer controlled optical surfacing

Optics Letters ◽

10.1364/ol.404575 ◽

2020 ◽

Vol 45 (23) ◽

pp. 6426

Author(s):

Xiaolong Ke ◽

Tianyi Wang ◽

Heejoo Choi ◽

Weslin Pullen ◽

Lei Huang ◽

...

Keyword(s):

Parallel Computer ◽

Computer Controlled Optical Surfacing ◽

Computer Controlled

TFBN: A Cost Effective High Performance Hierarchical Interconnection Network

Applied Sciences ◽

10.3390/app10228252 ◽

2020 ◽

Vol 10 (22) ◽

pp. 8252

Author(s):

M. M. Hafizur Rahman ◽

Mohammed Al-Naeem ◽

Mohammed N. M. Ali ◽

Abu Sufian

Keyword(s):

Cost Effectiveness ◽

High Performance ◽

Mesh Networks ◽

Interconnection Network ◽

Cost Effective ◽

Parallel Computer ◽

Next Generation ◽

Basic Module ◽

Bisection Width ◽

Butterfly Network

In order to fulfill the increasing demand for computation power to process a boundless data concurrently within a very short time or real-time in many areas such as IoT, AI, machine learning, smart grid, and big data analytics, we need exa-scale or zetta-scale computation in the near future. Thus, to have this level of computation, we need a massively parallel computer (MPC) system that shall consist of millions of nodes; and, for the interconnection of these massive numbers of nodes, conventional topologies are infeasible. Thus, a hierarchical interconnection network (HIN) is a rational way to connect huge nodes. Through this article, we are proposing a new HIN, which is a tori-connected flattened butterfly network (TFBN) for the next generation MPC system. Numerous basic modules are hierarchically interconnected as a toroidal connection, whereby the basic modules are flattened butterfly networks. We have studied the network architecture, static network performance, and static cost-effectiveness of the proposed TFBN in detail; and compared static network and cost-effectiveness performance of the TFBN to those of TTN, torus, TESH, and mesh networks. It is depicted that TFBN possesses low diameter and average distance, high arc connectivity, and temperate bisection width. It also has better cost-effectiveness and cost-performance trade-off factor compared to those of TTN, torus, TESH, and mesh networks. The only shortcoming is that the complexity of wiring of the TFBN is higher than that of those networks; this is because the basic module necessitates some extra short length link to form the flattened butterfly network. Therefore, TFBN is a high performance and cost-effective HIN, and it will be a good option for the next generation MPC system.

parallel computer
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

The Multigrid Beta Function Approach for Modeling of Background Error Covariance in the Real-Time Mesoscale Analysis (RTMA)

Genetic programming for iterative numerical methods

A Novel Tradeoff Analysis between Traffic Congestion and Packing Density of Interconnection Networks for Massively Parallel Computers

Solution of the Neutron Transport Equation on Unstructured Grids Using the Parallel Block Jacobi-Integral Transport Matrix Method via the Novel Green’s Function ITMM Construction Algorithm on Massively Parallel Computer Systems

Fractal dimension of coastline of Australia

The Graph of Our Mind

High Performance Computing in Parallel Electromagnetics Simulation Code suite ACE3P

Midimew Connected Torus Network for Next Generation Massively Parallel Computer System

Dual-tool multiplexing model of parallel computer controlled optical surfacing

TFBN: A Cost Effective High Performance Hierarchical Interconnection Network

Export Citation Format

parallel computerRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

The Multigrid Beta Function Approach for Modeling of Background Error Covariance in the Real-Time Mesoscale Analysis (RTMA)

Genetic programming for iterative numerical methods

A Novel Tradeoff Analysis between Traffic Congestion and Packing Density of Interconnection Networks for Massively Parallel Computers

Solution of the Neutron Transport Equation on Unstructured Grids Using the Parallel Block Jacobi-Integral Transport Matrix Method via the Novel Green’s Function ITMM Construction Algorithm on Massively Parallel Computer Systems

Fractal dimension of coastline of Australia

The Graph of Our Mind

High Performance Computing in Parallel Electromagnetics Simulation Code suite ACE3P

Midimew Connected Torus Network for Next Generation Massively Parallel Computer System

Dual-tool multiplexing model of parallel computer controlled optical surfacing

TFBN: A Cost Effective High Performance Hierarchical Interconnection Network

parallel computer
Recently Published Documents