Design and evaluation of fault-tolerant shared file system for cluster systems

Author(s):  
S. Sumimoto
1974 ◽  
Author(s):  
Warren Juran ◽  
Charles Moore ◽  
Carl Orndorff ◽  
Larry Rice
Keyword(s):  

1992 ◽  
Author(s):  
Glenn Meredith ◽  
Kenneth R. Anderson ◽  
Emil Wirsz ◽  
Fred W. Prior ◽  
Dennis L. Wilson

1991 ◽  
Vol 30 (1) ◽  
pp. 52-71
Author(s):  
R. L. Stone ◽  
T. S. Nettleship ◽  
J. Curtiss
Keyword(s):  

2008 ◽  
Vol 1 (1) ◽  
pp. 574-585 ◽  
Author(s):  
YongChul Kwon ◽  
Magdalena Balazinska ◽  
Albert Greenberg

2020 ◽  
Vol 245 ◽  
pp. 09010
Author(s):  
Michal Svatoš ◽  
Jiří Chudoba ◽  
Petr Vokáč

The distributed computing system of the ATLAS experiment at LHC is allowed to opportunistically use resources at the Czech national HPC center IT4Innovations in Ostrava. The jobs are submitted via an ARC Compute Element (ARC-CE) installed at the grid site in Prague. Scripts and input files are shared between the ARC-CE and a shared file system located at the HPC centre via sshfs. This basic submission system has worked there since the end of 2017. Several improvements were made to increase the amount of resource that ATLAS can use. The most significant change was the migration of the submission system to enable pre-emptable jobs, to adapt to the HPC management’s decision to start pre-empting opportunistic jobs. Another improvement of the submission system was related to the sshfs connection which seemed to be a limiting factor of the system. Now, the submission system consists of several ARC-CE machines. Also, various parameters of sshfs were tested in an attempt to increase throughput. As a result of the improvements, the utilisation of the Czech national HPC center by the ATLAS distributed computing increased.


1989 ◽  
Vol 22 (15) ◽  
pp. 55-61
Author(s):  
R. Mori ◽  
M. Nozaki ◽  
E. Ishibashi

Sign in / Sign up

Export Citation Format

Share Document