Integrating Computing Resources on Multiple Grid-enabled Job Scheduling Systems Through a Grid RPC System

Y. Nakajima, M. Sato, Y. Aida, T. Boku, F. Cappello
2006 Sixth IEEE International Symposium on Cluster Computing and the Grid (CCGRID'06)  
We present a framework for a parallel programming model by remote procedure calls bridging between largescale computing resource pools managed by multiple gridenabled job scheduling systems. With this system, the user can exploit not only each remote servers and clusters, but also computing resources provided with grid-enabled job scheduling systems located on different sites. This framework requires a Grid RPC system to decouple the computation in a remote node from the Grid RPC mechanism and
more » ... ses document-based communication rather than connection-based communication. We implemented the proposed framework as an extension of the OmniRPC system, which is a Grid RPC system for parallel programming in a grid environment. We designed a general interface to adapt the OmniRPC system to various grid-enabled job scheduling systems easily and applied the proposed system to several grid-enabled job scheduling systems, including XtremWeb, CyberGRIP, Condor and Grid Engine. we show the preliminary performance of these implementations using a phylogenetic application. We found that the proposed system can achieve approximately the same performance as using OmniRPC and can handle interruptions in worker programs on remote nodes.
doi:10.1109/ccgrid.2006.60 dblp:conf/ccgrid/NakajimaSABC06 fatcat:7ltcx7fq4fcuzmltfca44nnw7m