Static Fault-Tolerant Strategy for High Performance Computing Platform

It is an important research issue to ensure the computation correctness for parallel application and enhance the using rate of dynamic computing resource in distributed computing system. Based on the previous high performance distributing computing system, a fault-tolerant and task scheduler was developed, which combined the breathe mechanism, fault-discover mechanism and subtask reschedule mechanism. Experiments show that the fault-tolerant and task-scheduler has good performance and ensures the computation correctness even if when some computing resources fail.

Download Full-text

The “Chimera”: An Off-The-Shelf CPU/GPGPU/FPGA Hybrid Computing Platform

International Journal of Reconfigurable Computing ◽

10.1155/2012/241439 ◽

2012 ◽

Vol 2012 ◽

pp. 1-10 ◽

Cited By ~ 23

Author(s):

Ra Inta ◽

David J. Bowman ◽

Susan M. Scott

Keyword(s):

Parallel Computing ◽

High Performance ◽

Hardware Acceleration ◽

Real Data ◽

Computing System ◽

Pci Express ◽

Computing Platform ◽

Commercial Off The Shelf ◽

Performance Computing ◽

Modern Astronomy

The nature of modern astronomy means that a number of interesting problems exhibit a substantial computational bound and this situation is gradually worsening. Scientists, increasingly fighting for valuable resources on conventional high-performance computing (HPC) facilities—often with a limited customizable user environment—are increasingly looking to hardware acceleration solutions. We describe here a heterogeneous CPU/GPGPU/FPGA desktop computing system (the “Chimera”), built with commercial-off-the-shelf components. We show that this platform may be a viable alternative solution to many common computationally bound problems found in astronomy, however, not without significant challenges. The most significant bottleneck in pipelines involving real data is most likely to be the interconnect (in this case the PCI Express bus residing on the CPU motherboard). Finally, we speculate on the merits of our Chimera system on the entire landscape of parallel computing, through the analysis of representative problems from UC Berkeley’s “Thirteen Dwarves.”

Download Full-text

Electronic Voting System with Cloud Based High Performance Computing

Journal of Computational and Theoretical Nanoscience ◽

10.1166/jctn.2019.7807 ◽

2019 ◽

Vol 16 (2) ◽

pp. 768-772 ◽

Cited By ~ 1

Author(s):

R. Jothikumar ◽

Kumar Subramaniam ◽

Siva G. Shanmugam ◽

S. Susi

Keyword(s):

Cloud Computing ◽

High Performance Computing ◽

High Performance ◽

Computing System ◽

Electronic Voting ◽

System A ◽

Voting System ◽

Electronic Voting System ◽

Super Computing ◽

Performance Computing

Traditional voting system has been replaced by electronic voting systems in most places increasingly. It is efficient, but not efficient enough in terms of cost and capacity. High Performance Computing (HPC) in Cloud computing is a relatively new concept which has been replacing the traditional systems. The HPC has been widely used over the recent years because of its efficiency, reliability, speed and cost. Whereas in the traditional super computing system a lot of cost is involved. The ability of integration of HPC with cloud provided enormous growth in the area of parallel processing and computing. This system may advocate to push the case of promoting electronic voting system for higher traffic scenarios with lower cost requirements. This paper proposes an idea of implementing a fully formed e-voting system integrated with both HPC and Cloud Computing.

Download Full-text

Construction of high performance computing system for fusion research using cluster technology

Journal of Computer Applications ◽

10.3724/sp.j.1087.2009.02132 ◽

2009 ◽

Vol 29 (8) ◽

pp. 2132-2135

Author(s):

Wei PAN ◽

Liao-yuan CHEN ◽

Yong-ge LI ◽

Jin-hua ZHANG ◽

Li PAN ◽

...

Keyword(s):

High Performance Computing ◽

High Performance ◽

Computing System ◽

Fusion Research ◽

High Performance Computing System ◽

Performance Computing ◽

Cluster Technology

Download Full-text

Investigating the usefulness of a micro high performance computing system as an educational tool

Proceedings of the 2nd International Conference on Intelligent and Innovative Computing Applications ◽

10.1145/3415088.3415105 ◽

2020 ◽

Author(s):

Nkundwe Moses Mwasaga ◽

Mike Joy

Keyword(s):

High Performance Computing ◽

High Performance ◽

Computing System ◽

Educational Tool ◽

High Performance Computing System ◽

Performance Computing

Download Full-text

High-performance computing system and artificial language recognition in the visual application of intangible cultural heritage art design

Personal and Ubiquitous Computing ◽

10.1007/s00779-021-01619-z ◽

2021 ◽

Author(s):

Xiaodan Peng

Keyword(s):

Cultural Heritage ◽

High Performance Computing ◽

High Performance ◽

Computing System ◽

Intangible Cultural Heritage ◽

Artificial Language ◽

Language Recognition ◽

High Performance Computing System ◽

Art Design ◽

Performance Computing

Download Full-text

A High-Performance Computing System for Probabilistic Weather Forecasts

10.1002/essoar.10500383.1 ◽

2019 ◽

Author(s):

Weiming Hu ◽

Guido Cervone ◽

Vivek Balasubramanian ◽

Matteo Turilli ◽

Shantenu Jha

Keyword(s):

High Performance Computing ◽

High Performance ◽

Computing System ◽

Weather Forecasts ◽

High Performance Computing System ◽

Performance Computing

Download Full-text

A SURVEY OF CHECKPOINT/RESTART TECHNIQUES ON DISTRIBUTED MEMORY SYSTEMS

Parallel Processing Letters ◽

10.1142/s0129626413400112 ◽

2013 ◽

Vol 23 (04) ◽

pp. 1340011 ◽

Cited By ~ 7

Author(s):

FAISAL SHAHZAD ◽

MARKUS WITTMANN ◽

MORITZ KREUTZER ◽

THOMAS ZEISER ◽

GEORG HAGER ◽

...

Keyword(s):

High Performance ◽

Building Blocks ◽

Memory Systems ◽

Time To Failure ◽

Flow Solver ◽

The Road ◽

System A ◽

Node Level ◽

Mean Time ◽

Performance Computing

The road to exascale computing poses many challenges for the High Performance Computing (HPC) community. Each step on the exascale path is mainly the result of a higher level of parallelism of the basic building blocks (i.e., CPUs, memory units, networking components, etc.). The reliability of each of these basic components does not increase at the same rate as the rate of hardware parallelism. This results in a reduction of the mean time to failure (MTTF) of the whole system. A fault tolerance environment is thus indispensable to run large applications on such clusters. Checkpoint/Restart (C/R) is the classic and most popular method to minimize failure damage. Its ease of implementation makes it useful, but typically it introduces significant overhead to the application. Several efforts have been made to reduce the C/R overhead. In this paper we compare various C/R techniques for their overheads by implementing them on two different categories of applications. These approaches are based on parallel-file-system (PFS)-level checkpoints (synchronous/asynchronous) and node-level checkpoints. We utilize the Scalable Checkpoint/Restart (SCR) library for the comparison of node-level checkpoints. For asynchronous PFS-level checkpoints, we use the Damaris library, the SCR asynchronous feature, and application-based checkpointing via dedicated threads. Our baseline for overhead comparison is the naïve application-based synchronous PFS-level checkpointing method. A 3D lattice-Boltzmann (LBM) flow solver and a Lanczos eigenvalue solver are used as prototypical applications in which all the techniques considered here may be applied.

Download Full-text

Taxonomic assignment for large-scale metagenomic data on high-perfomance systems

Journal of Computer Science and Cybernetics ◽

10.15625/1813-9663/33/2/10753 ◽

2017 ◽

Vol 33 (2) ◽

pp. 119-130

Author(s):

Vinh Van Le ◽

Hoai Van Tran ◽

Hieu Ngoc Duong ◽

Giang Xuan Bui ◽

Lang Van Tran

Keyword(s):

High Performance Computing ◽

Assignment Problem ◽

High Performance ◽

Large Scale ◽

Computing System ◽

Metagenomic Data ◽

Taxonomic Assignment ◽

High Performance Computing System ◽

Powerful Approach ◽

Performance Computing

Metagenomics is a powerful approach to study environment samples which do not require the isolation and cultivation of individual organisms. One of the essential tasks in a metagenomic project is to identify the origin of reads, referred to as taxonomic assignment. Due to the fact that each metagenomic project has to analyze large-scale datasets, the metatenomic assignment is very much computation intensive. This study proposes a parallel algorithm for the taxonomic assignment problem, called SeMetaPL, which aims to deal with the computational challenge. The proposed algorithm is evaluated with both simulated and real datasets on a high performance computing system. Experimental results demonstrate that the algorithm is able to achieve good performance and utilize resources of the system efficiently. The software implementing the algorithm and all test datasets can be downloaded at http://it.hcmute.edu.vn/bioinfo/metapro/SeMetaPL.html.

Download Full-text