Stork: making data placement a first class citizen in the grid

T. Kosar, M. Livny
2004 24th International Conference on Distributed Computing Systems, 2004. Proceedings.  
Todays scientific applications have huge data requirements which continue to increase drastically every year. These data are generally accessed by many users from all across the the globe. This implies a major necessity to move huge amounts of data around wide area networks to complete the computation cycle, which brings with it the problem of efficient and reliable data placement. The current approach to solve this problem of data placement is either doing it manually, or employing simple
more » ... ploying simple scripts which do not have any automation or fault tolerance capabilities. Our goal is to make data placement activities first class citizens in the Grid just like the computational jobs. They will be queued, scheduled, monitored, managed, and even check-pointed. More importantly, it will be made sure that they complete successfully and without any human interaction. We also believe that data placement jobs should be treated differently from computational jobs, since they may have different semantics and different characteristics. For this purpose, we have developed Stork, a scheduler for data placement activities in the Grid.
doi:10.1109/icdcs.2004.1281599 dblp:conf/icdcs/KosarL04 fatcat:shw3dt5ddrgflehwawvvvru76e