scholarly journals Adaptive control in roll-forward recovery for extreme scale multigrid

Author(s):  
Markus Huber ◽  
Ulrich Rüde ◽  
Barbara Wohlmuth

With the increasing number of compute components, failures in future exa-scale computer systems are expected to become more frequent. This motivates the study of novel resilience techniques. Here, we extend a recently proposed algorithm-based recovery method for multigrid iterations by introducing an adaptive control. After a fault, the healthy part of the system continues the iterative solution process, while the solution in the faulty domain is reconstructed by an asynchronous online recovery. The computations in both the faulty and the healthy subdomains must be coordinated in a sensitive way, in particular, both under- and over-solving must be avoided. Both of these waste computational resources and will therefore increase the overall time-to-solution. To control the local recovery and guarantee an optimal recoupling, we introduce a stopping criterion based on a mathematical error estimator. It involves hierarchically weighted sums of residuals within the context of uniformly refined meshes and is well-suited in the context of parallel high-performance computing. The recoupling process is steered by local contributions of the error estimator before the fault. Failure scenarios when solving up to 6.9 × 1011 unknowns on more than 245,766 parallel processes will be reported on a state-of-the-art peta-scale supercomputer demonstrating the robustness of the method.

2021 ◽  
Vol 16 (1) ◽  
Author(s):  
Ting Zhang ◽  
Shibin Li

AbstractIn this manuscript, the inorganic perovskite CsPbI2Br and CsPbIBr2 are investigated as photoactive materials that offer higher stability than the organometal trihalide perovskite materials. The fabrication methods allow anti-solvent processing the CsPbIxBr3−x films, overcoming the poor film quality that always occur in a single-step solution process. The introduced diethyl ether in spin-coating process is demonstrated to be successful, and the effects of the anti-solvent on film quality are studied. The devices fabricated using the methods achieve high-performance, self-powered and the stabilized photodetectors show fast response speed. The results illustrate a great potential of all-inorganic CsPbIxBr3−x perovskites in visible photodetection and provide an effective way to achieve high performance devices with self-powered capability.


2015 ◽  
Vol 3 (38) ◽  
pp. 19294-19298 ◽  
Author(s):  
Xichang Bao ◽  
Qianqian Zhu ◽  
Meng Qiu ◽  
Ailing Yang ◽  
Yujin Wang ◽  
...  

High-quality CH3NH3PbI3 perovskite films were directly prepared on simple treated ITO glass in air under a relative humidity of lower than 30%.


1990 ◽  
Vol 27 (1) ◽  
pp. 145-151 ◽  
Author(s):  
R. N. Chowdhury ◽  
S. Zhang

This note is concerned with the multiplicity of solutions for the factor of safety that may be obtained on the basis of the method of slices. Discontinuities in the function for the factor of safety are discussed and the reasons for false convergence in any iterative solution process are explored, with particular reference to the well-known Bishop simplified method (circular slip surfaces) and Janbu simplified or generalized method (slip surfaces of arbitrary shape). The note emphasizes that both the solution method and the method of searching for the critical slip surface must be considered in assessing the potential for numerical difficulties and false convergence. Direct search methods for optimization (e.g., the simplex reflection method) appear to be superior to the grid search or repeated trial methods in this respect. To avoid false convergence, the initially assumed value of factor of safety F0 should be greater than β1(=−tan α1 tan [Formula: see text]) where α1 and [Formula: see text] are respectively the base inclination and internal friction angle of the first slice near the toe of a slope, the slice with the largest negative reverse inclination. A value of F0 = 1 + β1, is recommended on the basis of experience. If there is no slice with a negative slope for any of the slip surfaces generated in the automatic, search process, then any positive value of F0 will lead to true convergence for F. It is necessary to emphasize that no slip surface needs to be rejected for computational reasons except for Sarma's methods and similarly no artificial changes need to be made to the value of [Formula: see text] except for Sarma's methods. Key words: slope stability, convergence, limit equilibrium, analysis, optimization, slip surfaces, geological discontinuity, simplex reflection technique.


2021 ◽  
Author(s):  
Emilie Macherel ◽  
Yuri Podladchikov ◽  
Ludovic Räss ◽  
Stefan M. Schmalholz

<p>Power-law viscous flow describes well the first-order features of long-term lithosphere deformation. Due to the ellipticity of the Earth, the lithosphere is mechanically analogous to a shell, characterized by a double curvature. The mechanical characteristics of a shell are fundamentally different to the characteristics of plates, having no curvature in their undeformed state. The systematic quantification of the magnitude and the spatiotemporal distribution of strain, strain-rate and stress inside a deforming lithospheric shell is thus of major importance: stress is for example a key physical quantity that controls geodynamic processes such as metamorphic reactions, decompression melting, lithospheric flexure, subduction initiation or earthquakes.</p><p>Stress calculations in a geometrically and mechanically heterogeneous 3-D lithospheric shell require high-resolution and high-performance computing. The pseudo-transient finite difference (PTFD) method recently enabled efficient simulations of high-resolution 3-D deformation processes, implementing an iterative implicit solution strategy of the governing equations for power-law viscous flow. Main challenges for the PTFD method is to guarantee convergence, minimize the required iteration count and speed-up the iterations.</p><p>Here, we present PTFD simulations for simple mechanically heterogeneous (weak circular inclusion) incompressible 2-D power-law viscous flow in cartesian and cylindrical coordinates. The flow laws employ a pseudo-viscoelastic behavior to optimize the iterative solution by exploiting the fundamental characteristics of viscoelastic wave propagation.</p><p>The developed PTFD algorithm executes in parallel on CPUs and GPUs. The development was done in Matlab (mathworks.com), then translated into the Julia language (julialang.org), and finally made compatible for parallel GPU architectures using the ParallelStencil.jl package (https://github.com/omlins/ParallelStencil.jl). We may unveil preliminary results for 3-D spherical configurations including gravity-controlled lithospheric stress distributions around continental plateaus.</p>


2017 ◽  
Vol 5 (35) ◽  
pp. 9138-9145 ◽  
Author(s):  
Zhaobing Tang ◽  
Jie Lin ◽  
Lishuang Wang ◽  
Ying Lv ◽  
Yongsheng Hu ◽  
...  

High performance top-emitting green quantum dot light-emitting diodes have been developed based on an all-solution process and with a bottom Al anode.


Author(s):  
Simon McIntosh–Smith ◽  
Rob Hunt ◽  
James Price ◽  
Alex Warwick Vesztrocy

High-performance computing systems continue to increase in size in the quest for ever higher performance. The resulting increased electronic component count, coupled with the decrease in feature sizes of the silicon manufacturing processes used to build these components, may result in future exascale systems being more susceptible to soft errors caused by cosmic radiation than in current high-performance computing systems. Through the use of techniques such as hardware-based error-correcting codes and checkpoint-restart, many of these faults can be mitigated at the cost of increased hardware overhead, run-time, and energy consumption that can be as much as 10–20%. Some predictions expect these overheads to continue to grow over time. For extreme scale systems, these overheads will represent megawatts of power consumption and millions of dollars of additional hardware costs, which could potentially be avoided with more sophisticated fault-tolerance techniques. In this paper we present new software-based fault tolerance techniques that can be applied to one of the most important classes of software in high-performance computing: iterative sparse matrix solvers. Our new techniques enables us to exploit knowledge of the structure of sparse matrices in such a way as to improve the performance, energy efficiency, and fault tolerance of the overall solution.


2019 ◽  
Vol 16 (3) ◽  
pp. 117-123
Author(s):  
Tsung-Ching Huang ◽  
Ting Lei ◽  
Leilai Shao ◽  
Sridhar Sivapurapu ◽  
Madhavan Swaminathan ◽  
...  

Abstract High-performance low-cost flexible hybrid electronics (FHE) are desirable for applications such as internet of things and wearable electronics. Carbon nanotube (CNT) thin-film transistor (TFT) is a promising candidate for high-performance FHE because of its high carrier mobility, superior mechanical flexibility, and material compatibility with low-cost printing and solution processes. Flexible sensors and peripheral CNT-TFT circuits, such as decoders, drivers, and sense amplifiers, can be printed and hybrid-integrated with thinned (<50 μm) silicon chips on soft, thin, and flexible substrates for a wide range of applications, from flexible displays to wearable medical devices. Here, we report (1) a process design kit (PDK) to enable FHE design automation for large-scale FHE circuits and (2) solution process-proven intellectual property blocks for TFT circuits design, including Pseudo-Complementary Metal-Oxide-Semiconductor (Pseudo-CMOS) flexible digital logic and analog amplifiers. The FHE-PDK is fully compatible with popular silicon design tools for design and simulation of hybrid-integrated flexible circuits.


2019 ◽  
Vol 9 (21) ◽  
pp. 4541
Author(s):  
Syed Asif Raza Shah ◽  
Seo-Young Noh

Large scientific experimental facilities currently are generating a tremendous amount of data. In recent years, the significant growth of scientific data analysis has been observed across scientific research centers. Scientific experimental facilities are producing an unprecedented amount of data and facing new challenges to transfer the large data sets across multi continents. In particular, these days the data transfer is playing an important role in new scientific discoveries. The performance of distributed scientific environment is highly dependent on high-performance, adaptive, and robust network service infrastructures. To support large scale data transfer for extreme-scale distributed science, there is the need of high performance, scalable, end-to-end, and programmable networks that enable scientific applications to use the networks efficiently. We worked on the AmoebaNet solution to address the problems of a dynamic programmable network for bulk data transfer in extreme-scale distributed science environments. A major goal of the AmoebaNet project is to apply software-defined networking (SDN) technology to provide “Application-aware” network to facilitate bulk data transfer. We have prototyped AmoebaNet’s SDN-enabled network service that allows application to dynamically program the networks at run-time for bulk data transfers. In this paper, we evaluated AmoebaNet solution with real world test cases and shown that how it efficiently and dynamically can use the networks for bulk data transfer in large-scale scientific environments.


Sign in / Sign up

Export Citation Format

Share Document