HALO 1.0: A Hardware-agnostic Accelerator Orchestration Framework for Enabling Hardware-agnostic Programming with True Performance Portability for Heterogeneous HPC [article]

Michael Riera, Erfan Bank Tavakoli, Masudul Hassan Quraishi, Fengbo Ren
<span title="2021-10-19">2021</span> <i > arXiv </i> &nbsp; <span class="release-stage" >pre-print</span>
This paper presents HALO 1.0, an open-ended extensible multi-agent software framework that implements a set of proposed hardware-agnostic accelerator orchestration (HALO) principles. HALO implements a novel compute-centric message passing interface (C^2MPI) specification for enabling the performance-portable execution of a hardware-agnostic host application across heterogeneous accelerators. The experiment results of evaluating eight widely used HPC subroutines based on Intel Xeon E5-2620 CPUs,
more &raquo; ... Intel Arria 10 GX FPGAs, and NVIDIA GeForce RTX 2080 Ti GPUs show that HALO 1.0 allows for a unified control flow for host programs to run across all the computing devices with a consistently top performance portability score, which is up to five orders of magnitude higher than the OpenCL-based solution.
<span class="external-identifiers"> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/2011.10896v4">arXiv:2011.10896v4</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/3z3g4gbhozhh5jpaaqgjmg6k44">fatcat:3z3g4gbhozhh5jpaaqgjmg6k44</a> </span>
