scholarly journals XcalableMP 2.0 and Future Directions

Author(s):  
Mitsuhisa Sato ◽  
Hitoshi Murai ◽  
Masahiro Nakao ◽  
Keisuke Tsugane ◽  
Tesuya Odajima ◽  
...  

AbstractThis chapter presents the XcalableMP on the Fugaku supercomputer, the Japanese flagship supercomputer developed by FLAGSHIP2020 project in RIKEN R-CCS. The porting and the performance evaluation were done as a part of this project, and the XcalableMP is available for the Fugaku users for improving the productivity and performance of parallel programing. The performance of XcalableMP on the Fugaku is enhanced by the manycore processor and a new Tofu-D interconnect. We are now working on the next version, XcalableMP 2.0, for cutting-edge high-performance systems with manycore processors by multithreading and multi-tasking with integrations of PGAS model and synchronization models. We conclude this book with retrospectives and challenges for future PGAS models.

2021 ◽  
Vol 0 (0) ◽  
Author(s):  
Ying Ke ◽  
Qing Zheng ◽  
Faming Wang ◽  
Min Wang ◽  
Yi Wang

Abstract The design of workwear has significant effects on worker performance. However, the current workwear for coal miners in Northern China is poor in fitness and thermal comfort. In this study, new workwear (NEW) for coal miners was developed with the design features providing better cold protection and movement comfort performance, as compared with a commonly worn workwear (CON). To evaluate the effectiveness of NEW, we conducted human trials which were performed using simulated work movements (i.e., sitting, shoveling, squatting, and crawling) in a climate chamber (10°C, 75% RH). Physiological measurements and perceptual responses were obtained. The results demonstrated that the local skin temperatures at chest, scapula, thigh, and calf; mean skin temperatures,; and thermal comfort in NEW were significantly higher than those in CON. NEW also exerted an improvement in enhancing movement comfort. We conclude that NEW could meet well with the cold protective and mobility requirements.


1994 ◽  
Vol 03 (01) ◽  
pp. 97-125 ◽  
Author(s):  
ARVIND K. BANSAL

Associative Computation is characterized by intertwining of search by content and data parallel computation. An algebra for associative computation is described. A compilation based model and a novel abstract machine for associative logic programming are presented. The model uses loose coupling of left hand side of the program, treated as data, and right hand side of the program, treated as low level code. This representation achieves efficiency by associative computation and data alignment during goal reduction and during execution of low level abstract instructions. Data alignment reduces the overhead of data movement. Novel schemes for associative manipulation of aliased uninstantiated variables, data parallel goal reduction in the presence multiple occurrences of the same variables in a goal. The architecture, behavior, and performance evaluation of the model are presented.


2009 ◽  
Vol 2 (4) ◽  
pp. 81-91 ◽  
Author(s):  
Hashir Karim Kidwai ◽  
Fadi N. Sibai ◽  
Tamer Rabie

In the world of multi-core processors, the STI Cell Broadband Engine (BE) stands out as a heterogeneous 9-core processor with a PowerPC host processor (PPE) and 8 synergic processor engines (SPEs). The Cell BE architecture is designed to improve upon conventional processors in graphics and related areas by integrating 8 computation engines each with multiple execution units and large register sets to achieve a high performance per area return. In this paper, we discuss the parallelization, implementation and performance evaluation of an edge detection image processing application based on the Roberts edge detector on the Cell BE. The authors report the edge detection performance measured on a computer with one Cell processor and with varying numbers of synergic processor engines enabled. These results are compared to the results obtained on the Cell’s single PPE with all 8 SPEs disabled. The results indicate that edge detection performs 10 times faster on the Cell BE than on modern RISC processors.


Author(s):  
M. Narayana Moorthi ◽  
R. Manjula

Now a day the architecture of high performance systems are improving with more and more processor cores on the chip. This has both benefits as well as challenges. The benefit is running more task simultaneously which reduces the running time of the program or application. The challenges are what is the maximum limit of the number of cores in the given chip, how the existing and future software will make use of all the cores, what parallel programming language to choose, what are the memory and cache coherence issues involved when we increase the number of cores, how to solve the power and performance issues, how the cores are connected and how they are communicating to solve a single problem, workload distribution and load balancing issues in terms of scalability. There is a practical limit for speedup and scalability of number of cores on the chip which needs to be analyzed. So this chapter will focus on the introduction and overviews of parallel computing and the challenges faced in enhancing the performance and scalability in parallel computing architecture.


Processes ◽  
2020 ◽  
Vol 8 (5) ◽  
pp. 607
Author(s):  
Omer Mohamed Abubaker Al-hotmani ◽  
Mudhar Abdul Alwahab Al-Obaidi ◽  
Yakubu Mandafiya John ◽  
Raj Patel ◽  
Iqbal Mohammed Mujtaba

In recent times two or more desalination processes have been combined to form integrated systems that have been widely used to resolve the limitations of individual processes as well as producing high performance systems. In this regard, a simple integrated system of the Multi Effect Distillation (MED)/Thermal Vapour Compression (TVC) and Permeate Reprocessing Reverse Osmosis (PRRO) process was developed by the same authors and confirmed its validity after a comparison study against other developed configurations. However, this design has a considerable amount of retentate flowrate and low productivity. To resolve this issue, two novel designs of MED and double reverse osmosis (RO) processes including Permeate and Retentate Reprocessing designs (PRRP and RRRO) are developed and modelled in this paper. To systematically assess the consistency of the presented designs, the performance indicators of the novel designs are compared against previous simple designs of MED and PRRO processes at a specified set of operating conditions. Results show the superiority of the integrated MED and double permeate reprocessing design. This has specifically achieved both economic and environmental advantages where total productivity is increased by around 9% and total retentate flowrate (disposed to water bodies) is reduced by 5% with a marginally reduced energy consumption.


Sign in / Sign up

Export Citation Format

Share Document