VERIFYING VERY LARGE INDUSTRIAL CIRCUITS USING 100 PROCESSES AND BEYOND

Recent advances in scheduling and networking have paved the way for efficient exploitation of large-scale distributed computing platforms such as computational grids and huge clusters. Such infrastructures hold great promise for the highly resource-demanding task of verifying and checking large models, given that model checkers would be designed with a high degree of scalability and flexibility in mind. In this paper we focus on the mechanisms required to execute a high-performance, distributed, symbolic model checker on top of a large-scale distributed environment. We develop a hybrid algorithm for slicing the state space and dynamically distribute the work among the worker processes. We show that the new approach is faster, more effective, and thus much more scalable than previous slicing algorithms. We then present a checkpoint-restart module that has very low overhead. This module can be used to combat failures, the likelihood of which increases with the size of the computing plat-form. However, checkpoint-restart is even more handy for the scheduling system: it can be used to avoid reserving large numbers of workers, thus making the distributed computation work-efficient. Finally, we discuss for the first time the effect of reorder on the distributed model checker and show how the distributed system performs more efficient reordering than the sequential one. We implemented our contributions on a network of 200 processors, using a distributed scalable scheme that employs a high-performance industrial model checker from Intel. Our results show that the system was able to verify real-life models much larger than was previously possible.

Download Full-text

Measuring and tuning energy efficiency on large scale high performance computing platforms.

10.2172/1035312 ◽

2011 ◽

Cited By ~ 1

Author(s):

James H., III Laros

Keyword(s):

Energy Efficiency ◽

High Performance Computing ◽

High Performance ◽

Large Scale ◽

Computing Platforms ◽

Performance Computing

Download Full-text

Adaptive Threshold Based Scheduler for Batch of Independent Jobs for Cloud Computing System

Research Anthology on Architectures, Frameworks, and Integration Strategies for Distributed and Cloud Computing ◽

10.4018/978-1-7998-5339-8.ch110 ◽

2021 ◽

pp. 2246-2266

Author(s):

TAJ ALAM ◽

PARITOSH DUBEY ◽

ANKIT KUMAR

Keyword(s):

Cloud Computing ◽

Distributed Systems ◽

High Performance ◽

Large Scale ◽

Real Life ◽

Interval Estimation ◽

Computing System ◽

Adaptive Threshold ◽

Batch Simulation ◽

Heterogeneous Distributed Systems

Distributed systems are efficient means of realizing high-performance computing (HPC). They are used in meeting the demand of executing large-scale high-performance computational jobs. Scheduling the tasks on such computational resources is one of the prime concerns in the heterogeneous distributed systems. Scheduling jobs on distributed systems are NP-complete in nature. Scheduling requires either heuristic or metaheuristic approach for sub-optimal but acceptable solutions. An adaptive threshold-based scheduler is one such heuristic approach. This work proposes adaptive threshold-based scheduler for batch of independent jobs (ATSBIJ) with the objective of optimizing the makespan of the jobs submitted for execution on cloud computing systems. ATSBIJ exploits the features of interval estimation for calculating the threshold values for generation of efficient schedule of the batch. Simulation studies on CloudSim ensures that the ATSBIJ approach works effectively for real life scenario.

Download Full-text

An Efficient 3-D FNPF Numerical Wave Tank for Virtual Large-Scale Wave Basin Experiment

Volume 4: Offshore Geotechnics; Ronald W. Yeung Honoring Symposium on Offshore and Ship Hydrodynamics ◽

10.1115/omae2012-83760 ◽

2012 ◽

Author(s):

Seshu Nimmala ◽

Solomon Yim ◽

Stephan Grilli

Keyword(s):

High Performance ◽

Large Scale ◽

Mathematical Formulation ◽

Three Dimensional ◽

Numerical Wave Tank ◽

Wave Tank ◽

Wave Basin ◽

Nonlinear Potential ◽

Numerical Wave ◽

Computing Platforms

This paper presents an accurate and efficient three-dimensional computational model (3D numerical wave tank), based on fully nonlinear potential flow (FNPF) theory, and its extension to incorporate the motion of a laboratory snake piston wavemaker, to simulate experiments in a large-scale 3D wave basin (i.e. to conduct “virtual” or numerical experiments). The code is based on a higher-order boundary element method combined with a Fast Multipole Algorithm (FMA). Particular efforts were devoted to making the code efficient for large-scale simulations using high-performance computing platforms to complement experimental 3D wave basins. The numerical simulation capability can serve as an optimization tool at the experimental planning and detailed design stages. To date, waves that can be generated in the NWT include solitary, Cnoidal, and Airy waves. In this paper, we detail the model, mathematical formulation, and wave generation. Experimental or analytical comparisons with NWT results are provided for several cases to assess the accuracy and applicability of the numerical model to practical engineering problems.

Download Full-text

APPLICATION OF GENETIC ALGORITHMS TO A LARGE-SCALE MULTIPLE-CONSTRAINT VEHICLE ROUTING PROBLEM

International Journal of Computational Intelligence and Applications ◽

10.1142/s1469026803000835 ◽

2003 ◽

Vol 03 (01) ◽

pp. 1-21 ◽

Cited By ~ 3

Author(s):

GEORGE MOURKOUSIS ◽

MATHEW PROTONOTARIOS ◽

THEODORA VARVARIGOU

Keyword(s):

Vehicle Routing ◽

Vehicle Routing Problem ◽

High Performance ◽

Large Scale ◽

Hybrid Genetic Algorithm ◽

Real Life ◽

Search Space ◽

Genetic Operators ◽

Actual Problem ◽

Routing Problem

This paper presents a study on the application of a hybrid genetic algorithm (HGA) to an extended instance of the Vehicle Routing Problem. The actual problem is a complex real-life vehicle routing problem regarding the distribution of products to customers. A non homogenous fleet of vehicles with limited capacity and allowed travel time is available to satisfy the stochastic demand of a set of different types of customers with earliest and latest time for servicing. The objective is to minimize distribution costs respecting the imposed constraints (vehicle capacity, customer time windows, driver working hours and so on). The approach for solving the problem was based on a "cluster and route" HGA. Several genetic operators, selection and replacement methods were tested until the HGA became efficient for optimization of a multi-extrema search space system (multi-modal optimization). Finally, High Performance Computing (HPC) has been applied in order to provide near-optimal solutions in a sensible amount of time.

Download Full-text

Large-scale machine learning based on functional networks for biomedical big data with high performance computing platforms

Journal of Computational Science ◽

10.1016/j.jocs.2015.09.008 ◽

2015 ◽

Vol 11 ◽

pp. 69-81 ◽

Cited By ~ 32

Author(s):

Emad Elsebakhi ◽

Frank Lee ◽

Eric Schendel ◽

Anwar Haque ◽

Nagarajan Kathireason ◽

...

Keyword(s):

Machine Learning ◽

Big Data ◽

High Performance Computing ◽

High Performance ◽

Large Scale ◽

Functional Networks ◽

Computing Platforms ◽

Performance Computing

Download Full-text

Full-wave moment method simulation of large-scale antenna arrays on high performance computing platforms

IEEE Antennas and Propagation Society International Symposium. Digest. Held in conjunction with: USNC/CNC/URSI North American Radio Sci. Meeting (Cat. No.03CH37450) ◽

10.1109/aps.2003.1219276 ◽

2004 ◽

Author(s):

Donghoon Chun ◽

K. Sabet

Keyword(s):

High Performance Computing ◽

Antenna Arrays ◽

Moment Method ◽

High Performance ◽

Large Scale ◽

Full Wave ◽

Computing Platforms ◽

Performance Computing

Download Full-text

Significant Contribution of Single Atomic Mn Implanted in Carbon Nanosheets to High-performance Sodium-ion Hybrid Capacitors

Energy & Environmental Science ◽

10.1039/d1ee00370d ◽

2021 ◽

Author(s):

Xiang Hu ◽

Genxiang Wang ◽

Junwei Li ◽

Junheng Huang ◽

Yangjie Liu ◽

...

Keyword(s):

Energy Storage ◽

High Performance ◽

Large Scale ◽

Significant Contribution ◽

Electrochemical Capacitors ◽

Sodium Ion ◽

Great Promise ◽

Sodium Ion Batteries ◽

Carbon Nanosheets ◽

Hybrid Capacitors

Sodium-ion hybrid capacitors (SIHCs) hold great promise in large-scale energy storage by compromising the merits of sodium-ion batteries and electrochemical capacitors, the mismatch of kinetic and capacity between battery-type anode...

Download Full-text

Analytical Performance Estimation for Large-Scale Reconfigurable Dataflow Platforms

ACM Transactions on Reconfigurable Technology and Systems ◽

10.1145/3452742 ◽

2021 ◽

Vol 14 (3) ◽

pp. 1-21

Author(s):

Ryota Yasudo ◽

José G. F. Coutinho ◽

Ana-Lucia Varbanescu ◽

Wayne Luk ◽

Hideharu Amano ◽

...

Keyword(s):

High Performance Computing ◽

High Performance ◽

Large Scale ◽

Heterogeneous Systems ◽

Performance Estimation ◽

Performance Impact ◽

Accurate Performance ◽

Computing Platforms ◽

Reduced Power Consumption ◽

Performance Computing

Next-generation high-performance computing platforms will handle extreme data- and compute-intensive problems that are intractable with today’s technology. A promising path in achieving the next leap in high-performance computing is to embrace heterogeneity and specialised computing in the form of reconfigurable accelerators such as FPGAs, which have been shown to speed up compute-intensive tasks with reduced power consumption. However, assessing the feasibility of large-scale heterogeneous systems requires fast and accurate performance prediction. This article proposes Performance Estimation for Reconfigurable Kernels and Systems (PERKS), a novel performance estimation framework for reconfigurable dataflow platforms. PERKS makes use of an analytical model with machine and application parameters for predicting the performance of multi-accelerator systems and detecting their bottlenecks. Model calibration is automatic, making the model flexible and usable for different machine configurations and applications, including hypothetical ones. Our experimental results show that PERKS can predict the performance of current workloads on reconfigurable dataflow platforms with an accuracy above 91%. The results also illustrate how the modelling scales to large workloads, and how performance impact of architectural features can be estimated in seconds.

Download Full-text

NETWORK MODELING ISSUES FOR GRID APPLICATION SCHEDULING

International Journal of Foundations of Computer Science ◽

10.1142/s0129054105002929 ◽

2005 ◽

Vol 16 (02) ◽

pp. 145-162 ◽

Cited By ~ 20

Author(s):

HENRI CASANOVA

Keyword(s):

High Performance ◽

Large Scale ◽

Network Modeling ◽

Network Models ◽

Wide Area Networks ◽

Scheduling Problems ◽

Simulation Framework ◽

Bandwidth Sharing ◽

Main Challenge ◽

Computing Platforms

The dominant trend in scientific computing today is the establishment of platforms that span multiple institutions to support applications at unprecedented scales. On most distributed computing platforms a requirement to achieve high performance is the careful scheduling of distributed application components onto the available resources. While scheduling has been an active area of research for many decades most of the platform models traditionally used in scheduling research, and in particular network models, break down for platforms spanning wide-area networks. In this paper we examine network modeling issues for large-scale platforms from the perspective of scheduling. The main challenge we address is the development of models that are sophisticated enough to be more realistic than those traditionally used in the field, but simple enough that they are still amenable to analysis. In particular, we discuss issues of bandwidth sharing and topology modeling. Also, while these models can be used to define and reason about realistic scheduling problems, we show that they also provide a good basis for fast simulation, which is the typical method to evaluate scheduling algorithms, as demonstrated in our implementation of the SIMGRID simulation framework.

Download Full-text

swHPFM: Refactoring and Optimizing the Structured Grid Fluid Mechanical Algorithm on the Sunway TaihuLight Supercomputer

Applied Sciences ◽

10.3390/app10010072 ◽

2019 ◽

Vol 10 (1) ◽

pp. 72 ◽

Cited By ~ 1

Author(s):

Jingbo Li ◽

Xingjun Zhang ◽

Jianfeng Zhou ◽

Xiaoshe Dong ◽

Chuhua Zhang ◽

...

Keyword(s):

High Performance ◽

Large Scale ◽

Axial Compressor ◽

Computing Power ◽

Structured Grid ◽

Compressor Rotor ◽

Sunway Taihulight ◽

Computing Performance ◽

Computing Platforms ◽

Many Core

Fluid mechanical simulation is a typical high-performance computing problem. Due to the development of high-precision parallel algorithms, traditional computing platforms are unable to satisfy the computing requirements of large-scale algorithms. The Sunway TaihuLight supercomputer, which uses the SW26010 processor as its computing node, provides a powerful computing performance for this purpose. In this paper, the Sunway hierarchical parallel fluid machinery (swHPFM) framework and algorithm are proposed. Using the proposed framework and algorithm, engineers can exploit the parallelism of the existing fluid mechanical algorithm and achieve a satisfactory performance on the Sunway TaihuLight. In the framework, a suitable mapping of the model and the system architecture is developed, and the computing power of the SW26010 processor is fully utilized via the scratch pad memory (SPM) access strategy and serpentine register communication. In addition, the framework is implemented and tested by the axial compressor rotor simulation algorithm on a real-world dataset with Sunway many-core processors. The results demonstrate that we can achieve a speedup of up to 8.2×, compared to the original ported version, which only uses management processing elements (MPEs), as well as a 1.3× speedup compared to an Intel Xeon E5 processor. The proposed framework is useful for the optimization of fluid mechanical algorithm programs on computing platforms with a heterogeneous many-core architecture.

Download Full-text