High-Throughput Scientific Workflow Scheduling under Deadline Constraint in Clouds

Michelle M. Zhu, Fei Cao, Chase Q. Wu
2014 Journal of Communications  
Abstract-Cloud computing is a paradigm shift in service delivery that promises a leap in efficiency and flexibility in using computing resources. As cloud infrastructures are widely deployed around the globe, many data-and computeintensive scientific workflows have been moved from traditional highperformance computing platforms and grids to clouds. With the rapidly increasing number of cloud users in various science domains, it has become a critical task for the cloud service provider to
more » ... e provider to perform efficient job scheduling while still guaranteeing the workflow completion time as specified in the Service Level Agreement (SLA). Based on practical models for cloud utilization, we formulate a delay-constrained workflow optimization problem to maximize resource utilization for high system throughput and propose a two-step scheduling algorithm to minimize the cloud overhead under a user-specified execution time bound. Extensive simulation results illustrate that the proposed algorithm achieves lower computing overhead or higher resource utilization than existing methods under the execution time bound, and also significantly reduces the total workflow execution time by strategically selecting appropriate mapping nodes for prioritized modules.
doi:10.12720/jcm.9.4.312-321 fatcat:6ljhc3hsqrfkjkcisbnaqeee4e