The Data Ocean Project

Martin Barisits, Fernando Barreiro, Thomas Beermann, Karan Bhatia, Kaushik De, Arnaud Dubreuil, Johannes Elmsheuser, Alexei Klimentov, Mario Lassnig, Peter Love, Tadashi Maeno, Andrea Manzi (+10 others)
2019 EPJ Web of Conferences  
Transparent use of commercial cloud resources for scientific experiments is a hard problem. In this article, we describe the first steps of the Data Ocean R&D collaboration between the high-energy physics experiment ATLAS together with Google Cloud Platform, to allow seamless use of Google Compute Engine and Google Cloud Storage for physics analysis. We start by describing the three preliminary use cases that were identified at the beginning of the project. The following sections then detail
more » ... work done in the data management system Rucio and the workflow management systems PanDA and Harvester to interface Google Cloud Platform with the ATLAS distributed computing environment, and show the results of the integration tests. Afterwards, we describe the setup and results from a full ATLAS user analysis that was executed natively on Google Cloud Platform, and give estimates on projected costs. We close with a summary and and outlook on future work.
doi:10.1051/epjconf/201921404020 fatcat:bjvgjrgczrezjk247kylpc4v3e