Improvements in utilisation of the Czech national HPC center

Michal Svatoš, Jiří Chudoba, Petr Vokáč, G.A. Stewart, W. Kamleh, C. Doglioni, D. Kim, P. Jackson, L. Silvestris
2020 EPJ Web of Conferences  
The distributed computing system of the ATLAS experiment at LHC is allowed to opportunistically use resources at the Czech national HPC center IT4Innovations in Ostrava. The jobs are submitted via an ARC Compute Element (ARC-CE) installed at the grid site in Prague. Scripts and input files are shared between the ARC-CE and a shared file system located at the HPC centre via sshfs. This basic submission system has worked there since the end of 2017. Several improvements were made to increase the
more » ... de to increase the amount of resource that ATLAS can use. The most significant change was the migration of the submission system to enable pre-emptable jobs, to adapt to the HPC management's decision to start pre-empting opportunistic jobs. Another improvement of the submission system was related to the sshfs connection which seemed to be a limiting factor of the system. Now, the submission system consists of several ARC-CE machines. Also, various parameters of sshfs were tested in an attempt to increase throughput. As a result of the improvements, the utilisation of the Czech national HPC center by the ATLAS distributed computing increased.
doi:10.1051/epjconf/202024509010 fatcat:ea5hrjr5z5b7heibj65rjsd53y