3 Hits in 1.3 sec

RUMR: robust scheduling for divisible workloads

Yang Yang, H. Casanova
High Performance Distributed Computing, 2003. Proceedings. 12th IEEE International Symposium on  
In essence, we combine ideas from multi-round divisible workload scheduling, for performance, and from factoring-based scheduling, for robustness.  ...  Our goal is to develop a practical divisible workload scheduling strategy.  ...  Conclusion In this paper we have presented RUMR (Robust Uniform Multi-Round), a scheduling algorithm for minimizing the makespan of divisible workload applications under uncertainties of resource performance  ... 
doi:10.1109/hpdc.2003.1210021 dblp:conf/hpdc/YangC03 fatcat:oku6yejmezbtxexszv26bse4za

rDLB: A Novel Approach for Robust Dynamic Load Balancing of Scientific Applications with Parallel Independent Tasks [article]

Ali Mohammed, Aurelien Cavelan, Florina M. Ciorba
2019 arXiv   pre-print
In this work, a robust dynamic load balancing (rDLB) approach is proposed for the robust self scheduling of independent tasks.  ...  Most self-scheduling approaches do not consider fault-tolerant scheduling or depend on failure or perturbation detection and react by rescheduling failed tasks.  ...  RUMR [32] was introduced as a robust scheduling method to unpredictable task execution time and unpredictable communication time.  ... 
arXiv:1905.08073v3 fatcat:nfnobykxjzg7xegobihxxys2yi

SimAS: A Simulation-assisted Approach for the Scheduling Algorithm Selection under Perturbations [article]

Ali Mohammed, Florina M. Ciorba
2019 arXiv   pre-print
Dynamic loop self-scheduling (DLS) techniques are used to parallelize and to balance the load during the execution of such applications.  ...  A Simulator-assisted scheduling (SimAS) is introduced as a new control-theoretic-inspired approach to dynamically select DLS techniques that improve the performance of applications executing on heterogeneous  ...  Casanova, “Rumr: Robust Scheduling for Divisible Workloads,” in Pro- ceedings of the 12th IEEE International Symposium on High Performance Distributed Computing, pp. 114–123, 2003. [24] N.  ... 
arXiv:1912.02050v1 fatcat:brqtlxbsxfgpthnf6lfvoy2tzm