Distributed data access and resource management in the D0 SAM system

I. Terekhov, R. Pordes, V. White, L. Lueking, L. Carpenter, H. Schellman, J. Trumbo, S. Veseli, M. Vranicar, S. White
Proceedings 10th IEEE International Symposium on High Performance Distributed Computing  
SAM (Sequential Access through Meta-data) is the data access and job management system for the D0 high energy physics experiment at Fermilab. The SAM system is being developed and used to handle the Petabyte-scale experiment data, accessed by hundreds of D0 collaborators scattered around the world. In this paper, we present solutions to some of the distributed data processing problems from the perspective of real experience dealing with mission-critical data. We concentrate on the distributed
more » ... sk caching, resource management and job control. The system has elements of the Grid Computing and has features applicable to data-intensive computing in general.
doi:10.1109/hpdc.2001.945179 dblp:conf/hpdc/TerekhovPWLLTVVWS01 fatcat:nog27wuxnvhsvj5mpdphfc2644