Dynamic Distribution of High-Rate Data Processing from CERN to Remote HPC Data Centers

T. Boccali, D. Cameron, N. Cardo, D. Conciatore, A. Di Girolamo, G. Dissertori, P. Fernandez, A. Filipcic, M. Gila, C. Grab, J. Elmsheuser, V. Jankauskas (+9 others)
2021 Computing and Software for Big Science  
AbstractThe prompt reconstruction of the data recorded from the Large Hadron Collider (LHC) detectors has always been addressed by dedicated resources at the CERN Tier-0. Such workloads come in spikes due to the nature of the operation of the accelerator and in special high load occasions experiments have commissioned methods to distribute (spill-over) a fraction of the load to sites outside CERN. The present work demonstrates a new way of supporting the Tier-0 environment by provisioning
more » ... ces elastically for such spilled-over workflows onto the Piz Daint Supercomputer at CSCS. This is implemented using containers, tuning the existing batch scheduler and reinforcing the scratch file system, while still using standard Grid middleware. ATLAS, CMS and CSCS have jointly run selected prompt data reconstruction on up to several thousand cores on Piz Daint into a shared environment, thereby probing the viability of the CSCS high performance computer site as on demand extension of the CERN Tier-0, which could play a role in addressing the future LHC computing challenges for the high luminosity LHC.
doi:10.1007/s41781-020-00052-w fatcat:xjkjc7ps5ncvhkvvjnv6d2svni