Scalability tests of R-GMA based grid job monitoring system for CMS Monte Carlo data production

D. Bonacorsi, D. Colling, L. Field, S. Fisher, C. Grandi, P.R. Hobson, P. Kyberd, B. MacEvoy, J.J. Nebrensky, H. Tallini, S. Traylen
2003 2003 IEEE Nuclear Science Symposium. Conference Record (IEEE Cat. No.03CH37515)  
High-energy physics experiments, such as the compact muon solenoid (CMS) at the large hadron collider (LHC), have large-scale data processing computing requirements. The grid has been chosen as the solution. One important challenge when using the grid for large-scale data processing is the ability to monitor the large numbers of jobs that are being executed simultaneously at multiple remote sites. The relational grid monitoring architecture (R-GMA) is a monitoring and information management
more » ... ice for distributed resources based on the GMA of the Global Grid Forum. In this paper, we report on the first measurements of R-GMA as part of a monitoring architecture to be used for batch submission of multiple Monte Carlo simulation jobs running on a CMS-specific LHC computing grid test bed. Monitoring information was transferred in real time from remote execution nodes back to the submitting host and stored in a database. In scalability tests, the job submission rates supported by successive releases of R-GMA improved significantly, approaching that expected in full-scale production.
doi:10.1109/nssmic.2003.1352190 fatcat:kz7atyts5zbozbusfbh2ehacgy