parallel architectures Latest Research Papers

AUTOMATION OF CONSTRUCTION OF A SCHEME FOR SOLVING COMPUTE- INTENSIVE PROBLEMS OF MATHEMATICAL PHYSICS ON SUPERCOMPUTERS

Информационные и математические технологии в науке и управлении ◽

10.38028/esi.2021.24.4.005 ◽

2022 ◽

pp. 50-59

Author(s):

Борис Михайлович Глинский ◽

Анна Федоровна Сапетина ◽

Алексей Владимирович Снытников ◽

Галина Борисовна Загорулько ◽

Юрий Алексеевич Загорулько ◽

...

Keyword(s):

Numerical Methods ◽

Semantic Web ◽

Mathematical Models ◽

Mathematical Physics ◽

Subject Area ◽

Problem Area ◽

Parallel Architectures ◽

Semantic Web Technology ◽

Parallel Code ◽

Analytical System

В статье представлен подход к разработке информационно-аналитической системы, помогающей исследователю решать вычислительно сложные задачи математической физики на суперкомпьютерах. Система автоматически строит схему решения задачи по спецификации пользователя, введенной им в режиме диалога. Схема включает наиболее подходящие математические модели для решения задачи, численные методы, алгоритмы и параллельные архитектуры, ссылки на доступные фрагменты параллельного кода, которые пользователь может использовать при разработке собственного кода. Построение схемы осуществляется на основе онтологии проблемной области «Решение вычислительно сложных задач математической физики», онтологии заданной предметной области и экспертных правил, построенных с использованием технологии Semantic Web. The paper presents an approach to the development of an information-analytical system that helps a researcher to solve compute-intensive problems of mathematical physics on supercomputers. The system automatically builds a scheme for solving the problem according to the user's specification entered by him in the dialogue mode. The scheme includes the most suitable mathematical models for solving the problem, numerical methods, algorithms and parallel architectures, links to available fragments of parallel code that the user can use when developing their own code. The construction of the scheme is carried out on the basis of the ontology of the problem area "Solving compute-intensive problems of mathematical physics", the ontology of a given subject area and expert rules built using the Semantic Web technology.

Aggregation of clans to speed-up solving linear systems on parallel architectures

International Journal of Parallel Emergent and Distributed Systems ◽

10.1080/17445760.2021.2004412 ◽

2021 ◽

pp. 1-22

Author(s):

Dmitry A. Zaitsev ◽

Tatiana R. Shmeleva ◽

Piotr Luszczek

Keyword(s):

Linear Systems ◽

Parallel Architectures ◽

Speed Up

Series Architecture on Hybrid Electric Vehicles: A Review

Energies ◽

10.3390/en14227672 ◽

2021 ◽

Vol 14 (22) ◽

pp. 7672

Author(s):

Alessandro Benevieri ◽

Lorenzo Carbone ◽

Simone Cosso ◽

Krishneel Kumar ◽

Mario Marchesoni ◽

...

Keyword(s):

Hybrid Electric Vehicles ◽

New Technologies ◽

Combustion Engine ◽

Mechanical Energy ◽

Electrical Energy ◽

Wide Bandgap ◽

Parallel Architectures ◽

Medium Size ◽

Power Electronic Converters ◽

Additional Losses

The use of series architecture nowadays is mainly on hybrid buses. In comparison with series-parallel and parallel architectures, which are usually exploited on medium-size cars, the series architecture allows achieving internal combustion engine higher efficiency. The downside of this architecture, due to a double energy conversion (i.e., mechanical energy converted in electrical energy and electrical energy converted again in mechanical energy), is that additional losses are introduced. For this reason, the parallel and the series/parallel architectures were considered more suitable for hybrid medium-size cars. Nevertheless, the use of new technologies can change this scenario. Regarding storage systems, supercapacitors achieved a significant energy density, and they guarantee much higher efficiency than battery storage. Moreover, the use of wide-bandgap components for power electronic converters, such as silicon carbide devices, assure lower losses. In this scenario, the series architecture can become competitive on medium-size cars. This paper shows a review of various studies performed on this topic.

The automated construction of a scheme for solving compute-intensive problems based on the ontological approach and Semantic Web technologies

Journal of Physics Conference Series ◽

10.1088/1742-6596/2099/1/012022 ◽

2021 ◽

Vol 2099 (1) ◽

pp. 012022

Author(s):

B M Glinskiy ◽

A F Sapetina ◽

A V Snytnikov ◽

Y A Zagorulko ◽

G B Zagorulko

Keyword(s):

Numerical Methods ◽

Semantic Web ◽

Mathematical Physics ◽

Parallel Architectures ◽

Semantic Web Technologies ◽

Web Technologies ◽

Semantic Web Technology ◽

Ontological Approach ◽

Knowledge Area ◽

Parallel Code

Abstract This paper describes the tools for supporting researchers in the development of a parallel code. The tools are based on the ontology of the knowledge area “Support for solving compute-intensive problems of mathematical physics on supercomputers”. The main result of these tools operation is a scheme for solving the problem, built according to its specification provided by the user. The scheme includes the most suitable mathematical models for solving the problem, numerical methods, algorithms, and parallel architectures, links to available fragments of a parallel code that the user can use when developing his own code. The scheme construction is carried out on the basis of ontology and expert rules built using the Semantic Web technology.

Proceedings 30th International Conference on Parallel Architectures and Compilation Techniques [Title page]

10.1109/pact52795.2021.00001 ◽

2021 ◽

Keyword(s):

Parallel Architectures ◽

Title Page ◽

International Conference ◽

Compilation Techniques

Search by triplet: An efficient local track reconstruction algorithm for parallel architectures

Journal of Computational Science ◽

10.1016/j.jocs.2021.101422 ◽

2021 ◽

pp. 101422

Author(s):

Daniel Hugo Cámpora Pérez ◽

Niko Neufeld ◽

Agustín Riscos Núñez

Keyword(s):

Reconstruction Algorithm ◽

Parallel Architectures ◽

Track Reconstruction

Solving the Bethe-Salpeter equation on massively parallel architectures

Computer Physics Communications ◽

10.1016/j.cpc.2021.108081 ◽

2021 ◽

pp. 108081

Author(s):

Xiao Zhang ◽

Sebastian Achilles ◽

Jan Winkelmann ◽

Roland Haas ◽

André Schleife ◽

...

Keyword(s):

Parallel Architectures ◽

Massively Parallel ◽

Massively Parallel Architectures

Efficient curvature-constrained least cost route optimization on parallel architectures

Engineering With Computers ◽

10.1007/s00366-021-01343-5 ◽

2021 ◽

Author(s):

Sébastien Blaise ◽

Benoit Spinewine

Keyword(s):

Parallel Architectures ◽

Route Optimization

Area-Time Efficient Two-Dimensional Reconfigurable Integer DCT Architecture for HEVC

Electronics ◽

10.3390/electronics10050603 ◽

2021 ◽

Vol 10 (5) ◽

pp. 603

Author(s):

Pramod Kumar Meher ◽

Siew-Kei Lam ◽

Thambipillai Srikanthan ◽

Dong Hwan Kim ◽

Sang Yoon Park

Keyword(s):

High Efficiency ◽

Unit Area ◽

Critical Path ◽

Matrix Multiplication ◽

Parallel Architectures ◽

Reconfigurable Architectures ◽

High Efficiency Video Coding ◽

Precise Estimation ◽

Propagation Delays ◽

Integer Dct

In this paper, we present area-time efficient reconfigurable architectures for the implementation of the integer discrete cosine transform (DCT), which supports all the transform lengths to be used in High Efficiency Video Coding (HEVC). We propose three 1D reconfigurable architectures that can be configured for the computation of the DCT of any of the prescribed lengths such as 4, 8, 16, and 32. It is shown that matrix multiplication schemes involving fewer adders can be used to derive parallel architectures for 1D integer DCT of different lengths. A novel transposition buffer is designed to be used for the proposed 2D DCT architecture, which offers double the throughput without increasing the size of the transposition buffer. We determine the optimal pipeline locations in the proposed design through the precise estimation of propagation delays and the critical path so that the area-delay-product is optimized and all the output samples are obtained in the same cycle in spite of the recursive nature of the structure. Implementation results show that the proposed 2D integer DCT architectures provide significantly higher throughput per unit area than the existing designs for HEVC.

Parallelization of the K-Means++ Clustering Algorithm

Ingénierie des systèmes d information ◽

10.18280/isi.260106 ◽

2021 ◽

Vol 26 (1) ◽

pp. 59-66

Author(s):

Sara Daoudi ◽

Chakib Mustapha Anouar Zouaoui ◽

Miloud Chikr El-Mezouar ◽

Nasreddine Taleb

Keyword(s):

Graphics Processing Units ◽

Clustering Algorithm ◽

Large Data ◽

Parallel Architectures ◽

Programming Environment ◽

Sequential Mode ◽

Point Distance ◽

Graphics Processing ◽

Sequential Implementation

K-means++ is the clustering algorithm that is created to improve the process of getting initial clusters in the K-means algorithm. The k-means++ algorithm selects initial k-centroids arbitrarily dependent on a probability that is proportional to each data-point distance to the existing centroids. The most noteworthy problem of this algorithm is when running happens in sequential mode, as this reduces the speed of clustering. In this paper, we develop a new parallel k-means++ algorithm using the graphics processing units (GPU) where the Open Computing Language (OpenCL) platform is used as the programming environment to perform the data assignment phase in parallel while the Streaming SIMD Extension (SSE) technology is used to perform the initialization step to select the initial centroids in parallel on CPU. The focus is on optimizations directly targeted to this architecture to exploit the most of the available computing capabilities. Our objective is to minimize runtime while keeping the quality of the serial implementation. Our outcomes demonstrate that the implementation of targeting hybrid parallel architectures (CPU & GPU) is the most appropriate for large data. We have been able to achieve a 152 times higher throughput than that of the sequential implementation of k-means ++.

parallel architectures
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

AUTOMATION OF CONSTRUCTION OF A SCHEME FOR SOLVING COMPUTE- INTENSIVE PROBLEMS OF MATHEMATICAL PHYSICS ON SUPERCOMPUTERS

Aggregation of clans to speed-up solving linear systems on parallel architectures

Series Architecture on Hybrid Electric Vehicles: A Review

The automated construction of a scheme for solving compute-intensive problems based on the ontological approach and Semantic Web technologies

Proceedings 30th International Conference on Parallel Architectures and Compilation Techniques [Title page]

Search by triplet: An efficient local track reconstruction algorithm for parallel architectures

Solving the Bethe-Salpeter equation on massively parallel architectures

Efficient curvature-constrained least cost route optimization on parallel architectures

Area-Time Efficient Two-Dimensional Reconfigurable Integer DCT Architecture for HEVC

Parallelization of the K-Means++ Clustering Algorithm

Export Citation Format

parallel architecturesRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

AUTOMATION OF CONSTRUCTION OF A SCHEME FOR SOLVING COMPUTE- INTENSIVE PROBLEMS OF MATHEMATICAL PHYSICS ON SUPERCOMPUTERS

Aggregation of clans to speed-up solving linear systems on parallel architectures

Series Architecture on Hybrid Electric Vehicles: A Review

The automated construction of a scheme for solving compute-intensive problems based on the ontological approach and Semantic Web technologies

Proceedings 30th International Conference on Parallel Architectures and Compilation Techniques [Title page]

Search by triplet: An efficient local track reconstruction algorithm for parallel architectures

Solving the Bethe-Salpeter equation on massively parallel architectures

Efficient curvature-constrained least cost route optimization on parallel architectures

Area-Time Efficient Two-Dimensional Reconfigurable Integer DCT Architecture for HEVC

Parallelization of the K-Means++ Clustering Algorithm

parallel architectures
Recently Published Documents