XcalableMP 2.0 and Future Directions

AbstractThis chapter presents the XcalableMP on the Fugaku supercomputer, the Japanese flagship supercomputer developed by FLAGSHIP2020 project in RIKEN R-CCS. The porting and the performance evaluation were done as a part of this project, and the XcalableMP is available for the Fugaku users for improving the productivity and performance of parallel programing. The performance of XcalableMP on the Fugaku is enhanced by the manycore processor and a new Tofu-D interconnect. We are now working on the next version, XcalableMP 2.0, for cutting-edge high-performance systems with manycore processors by multithreading and multi-tasking with integrations of PGAS model and synchronization models. We conclude this book with retrospectives and challenges for future PGAS models.

Download Full-text

Self-assembly of porphyrin on the surface of a novel composite high performance photocatalyst for the degradation of organic dye from water: Characterization and performance evaluation

Journal of Environmental Chemical Engineering ◽

10.1016/j.jece.2021.106034 ◽

2021 ◽

pp. 106034

Author(s):

Duong Duc La ◽

Tuan Anh Nguyen ◽

X. Sang Nguyen ◽

Tuan N. Truong ◽

H. Phuong Nguyen T. ◽

...

Keyword(s):

Performance Evaluation ◽

Self Assembly ◽

High Performance ◽

Organic Dye ◽

And Performance

Download Full-text

High-Performance Workwear for Coal Miners in Northern China: Design and Performance Evaluation

Autex Research Journal ◽

10.2478/aut-2021-0020 ◽

2021 ◽

Vol 0 (0) ◽

Author(s):

Ying Ke ◽

Qing Zheng ◽

Faming Wang ◽

Min Wang ◽

Yi Wang

Keyword(s):

Performance Evaluation ◽

Thermal Comfort ◽

High Performance ◽

Northern China ◽

Coal Miners ◽

Climate Chamber ◽

Local Skin ◽

Worker Performance ◽

And Performance ◽

Skin Temperatures

Abstract The design of workwear has significant effects on worker performance. However, the current workwear for coal miners in Northern China is poor in fitness and thermal comfort. In this study, new workwear (NEW) for coal miners was developed with the design features providing better cold protection and movement comfort performance, as compared with a commonly worn workwear (CON). To evaluate the effectiveness of NEW, we conducted human trials which were performed using simulated work movements (i.e., sitting, shoveling, squatting, and crawling) in a climate chamber (10°C, 75% RH). Physiological measurements and perceptual responses were obtained. The results demonstrated that the local skin temperatures at chest, scapula, thigh, and calf; mean skin temperatures,; and thermal comfort in NEW were significantly higher than those in CON. NEW also exerted an improvement in enhancing movement comfort. We conclude that NEW could meet well with the cold protective and mobility requirements.

Download Full-text

AN ASSOCIATIVE DATA PARALLEL COMPILATION MODEL FOR TIGHT INTEGRATION OF HIGH PERFORMANCE KNOWLEDGE RETRIEVAL AND COMPUTING

International Journal of Artificial Intelligence Tools ◽

10.1142/s0218213094000078 ◽

1994 ◽

Vol 03 (01) ◽

pp. 97-125 ◽

Cited By ~ 3

Author(s):

ARVIND K. BANSAL

Keyword(s):

Performance Evaluation ◽

High Performance ◽

Loose Coupling ◽

Abstract Machine ◽

Data Movement ◽

Left Hand ◽

Low Level ◽

Data Parallel ◽

Data Alignment ◽

And Performance

Associative Computation is characterized by intertwining of search by content and data parallel computation. An algebra for associative computation is described. A compilation based model and a novel abstract machine for associative logic programming are presented. The model uses loose coupling of left hand side of the program, treated as data, and right hand side of the program, treated as low level code. This representation achieves efficiency by associative computation and data alignment during goal reduction and during execution of low level abstract instructions. Data alignment reduces the overhead of data movement. Novel schemes for associative manipulation of aliased uninstantiated variables, data parallel goal reduction in the presence multiple occurrences of the same variables in a goal. The architecture, behavior, and performance evaluation of the model are presented.

Download Full-text

VoD Service Model and Performance Evaluation on the Chinas High Performance Broadband Information Network (3Tnet)

Sixth International Conference on Parallel and Distributed Computing Applications and Technologies (PDCAT'05) ◽

10.1109/pdcat.2005.249 ◽

2005 ◽

Author(s):

Feng Lin ◽

Xin Wang ◽

Xiangyang Xue

Keyword(s):

Performance Evaluation ◽

High Performance ◽

Information Network ◽

Service Model ◽

And Performance

Download Full-text

The design and performance evaluation of a new high-performance mass-analyzed ion kinetic energy (MIKE) spectrometer

International Journal of Mass Spectrometry and Ion Physics ◽

10.1016/0020-7381(80)80090-8 ◽

1980 ◽

Vol 35 (3-4) ◽

pp. 381-391 ◽

Cited By ~ 38

Author(s):

D.H. Russell ◽

D.H. Smith ◽

R.J. Warmack ◽

L.K. Bertram

Keyword(s):

Performance Evaluation ◽

Kinetic Energy ◽

High Performance ◽

Ion Kinetic Energy ◽

And Performance

Download Full-text

Parallelization and Performance Evaluation of an Edge Detection Algorithm on a Streaming Multi-Core Engine

Journal of Information Technology Research ◽

10.4018/jitr.2009062906 ◽

2009 ◽

Vol 2 (4) ◽

pp. 81-91 ◽

Cited By ~ 1

Author(s):

Hashir Karim Kidwai ◽

Fadi N. Sibai ◽

Tamer Rabie

Keyword(s):

Performance Evaluation ◽

Edge Detection ◽

High Performance ◽

Detection Algorithm ◽

Cell Processor ◽

Edge Detector ◽

Processing Application ◽

Image Processing Application ◽

Host Processor ◽

And Performance

In the world of multi-core processors, the STI Cell Broadband Engine (BE) stands out as a heterogeneous 9-core processor with a PowerPC host processor (PPE) and 8 synergic processor engines (SPEs). The Cell BE architecture is designed to improve upon conventional processors in graphics and related areas by integrating 8 computation engines each with multiple execution units and large register sets to achieve a high performance per area return. In this paper, we discuss the parallelization, implementation and performance evaluation of an edge detection image processing application based on the Roberts edge detector on the Cell BE. The authors report the edge detection performance measured on a computer with one Cell processor and with varying numbers of synergic processor engines enabled. These results are compared to the results obtained on the Cell’s single PPE with all 8 SPEs disabled. The results indicate that edge detection performs 10 times faster on the Cell BE than on modern RISC processors.

Download Full-text

Challenges Faced in Enhancing the Performance and Scalability in Parallel Computing Architecture

Advances in Computer and Electrical Engineering - Handbook of Research on Advanced Computational Techniques for Simulation-Based Engineering ◽

10.4018/978-1-4666-9479-8.ch010 ◽

2016 ◽

pp. 252-269

Author(s):

M. Narayana Moorthi ◽

R. Manjula

Keyword(s):

Parallel Computing ◽

High Performance ◽

Cache Coherence ◽

Computing Architecture ◽

High Performance Systems ◽

Practical Limit ◽

Workload Distribution ◽

And Performance ◽

The Given ◽

Processor Cores

Now a day the architecture of high performance systems are improving with more and more processor cores on the chip. This has both benefits as well as challenges. The benefit is running more task simultaneously which reduces the running time of the program or application. The challenges are what is the maximum limit of the number of cores in the given chip, how the existing and future software will make use of all the cores, what parallel programming language to choose, what are the memory and cache coherence issues involved when we increase the number of cores, how to solve the power and performance issues, how the cores are connected and how they are communicating to solve a single problem, workload distribution and load balancing issues in terms of scalability. There is a practical limit for speedup and scalability of number of cores on the chip which needs to be analyzed. So this chapter will focus on the introduction and overviews of parallel computing and the challenges faced in enhancing the performance and scalability in parallel computing architecture.

Download Full-text

Synthesis and performance evaluation of a water-soluble copolymer as high-performance fluid loss additive for water-based drilling fluid at high temperature

Russian Journal of Applied Chemistry ◽

10.1134/s1070427216100190 ◽

2016 ◽

Vol 89 (10) ◽

pp. 1694-1705 ◽

Cited By ~ 2

Author(s):

Ma Xiping ◽

Zhu Zhongxiang ◽

Hou Daiyong ◽

Shi Wei

Keyword(s):

Performance Evaluation ◽

High Temperature ◽

High Performance ◽

Drilling Fluid ◽

Water Soluble ◽

Fluid Loss ◽

Water Based ◽

Water Soluble Copolymer ◽

And Performance ◽

Fluid Loss Additive

Download Full-text

An Innovative Design of an Integrated MED-TVC and Reverse Osmosis System for Seawater Desalination: Process Explanation and Performance Evaluation

Processes ◽

10.3390/pr8050607 ◽

2020 ◽

Vol 8 (5) ◽

pp. 607

Author(s):

Omer Mohamed Abubaker Al-hotmani ◽

Mudhar Abdul Alwahab Al-Obaidi ◽

Yakubu Mandafiya John ◽

Raj Patel ◽

Iqbal Mohammed Mujtaba

Keyword(s):

Reverse Osmosis ◽

High Performance ◽

Integrated System ◽

Operating Conditions ◽

The Novel ◽

Innovative Design ◽

High Performance Systems ◽

Total Productivity ◽

And Performance ◽

Vapour Compression

In recent times two or more desalination processes have been combined to form integrated systems that have been widely used to resolve the limitations of individual processes as well as producing high performance systems. In this regard, a simple integrated system of the Multi Effect Distillation (MED)/Thermal Vapour Compression (TVC) and Permeate Reprocessing Reverse Osmosis (PRRO) process was developed by the same authors and confirmed its validity after a comparison study against other developed configurations. However, this design has a considerable amount of retentate flowrate and low productivity. To resolve this issue, two novel designs of MED and double reverse osmosis (RO) processes including Permeate and Retentate Reprocessing designs (PRRP and RRRO) are developed and modelled in this paper. To systematically assess the consistency of the presented designs, the performance indicators of the novel designs are compared against previous simple designs of MED and PRRO processes at a specified set of operating conditions. Results show the superiority of the integrated MED and double permeate reprocessing design. This has specifically achieved both economic and environmental advantages where total productivity is increased by around 9% and total retentate flowrate (disposed to water bodies) is reduced by 5% with a marginally reduced energy consumption.

Download Full-text

RAPID for high-performance computing systems: architecture and performance evaluation

Applied Optics ◽

10.1364/ao.45.006326 ◽

2006 ◽

Vol 45 (25) ◽

pp. 6326 ◽

Cited By ~ 7

Author(s):

Avinash Karanth Kodi ◽

Ahmed Louri

Keyword(s):

Performance Evaluation ◽

High Performance Computing ◽

High Performance ◽

Computing Systems ◽

Systems Architecture ◽

And Performance ◽

Performance Computing

Download Full-text

XcalableMP 2.0 and Future Directions

Self-assembly of porphyrin on the surface of a novel composite high performance photocatalyst for the degradation of organic dye from water: Characterization and performance evaluation

High-Performance Workwear for Coal Miners in Northern China: Design and Performance Evaluation

AN ASSOCIATIVE DATA PARALLEL COMPILATION MODEL FOR TIGHT INTEGRATION OF HIGH PERFORMANCE KNOWLEDGE RETRIEVAL AND COMPUTING

VoD Service Model and Performance Evaluation on the China&#146;s High Performance Broadband Information Network (3Tnet)

The design and performance evaluation of a new high-performance mass-analyzed ion kinetic energy (MIKE) spectrometer

Parallelization and Performance Evaluation of an Edge Detection Algorithm on a Streaming Multi-Core Engine

Challenges Faced in Enhancing the Performance and Scalability in Parallel Computing Architecture

Synthesis and performance evaluation of a water-soluble copolymer as high-performance fluid loss additive for water-based drilling fluid at high temperature

An Innovative Design of an Integrated MED-TVC and Reverse Osmosis System for Seawater Desalination: Process Explanation and Performance Evaluation

RAPID for high-performance computing systems: architecture and performance evaluation

VoD Service Model and Performance Evaluation on the Chinas High Performance Broadband Information Network (3Tnet)