Challenges in Deeply Heterogeneous High Performance Systems

Giovanni Agosta, William Fornaciari, David Atienza, Ramon Canal, Alessandro Cilardo, Jose Flich, Carles Hernandez Luz, Michal Kulczewski, Giuseppe Massari, Rafael Tornero Gavila, Marina Zapater Sancho
2019 2019 22nd Euromicro Conference on Digital System Design (DSD)  
RECIPE (REliable power and time-ConstraIntsaware Predictive management of heterogeneous Exascale systems) is a recently started project funded within the H2020 FETHPC programme, which is expressly targeted at exploring new High-Performance Computing (HPC) technologies. RECIPE aims at introducing a hierarchical runtime resource management infrastructure to optimize energy efficiency and minimize the occurrence of thermal hotspots, while enforcing the time constraints imposed by the applications
more » ... nd ensuring reliability for both time-critical and throughput-oriented computation that run on deeply heterogeneous acceleratorbased systems. This paper presents a detailed overview of RECIPE, identifying the fundamental challenges as well as the key innovations addressed by the project, which span run-time management, heterogeneous computing architectures, HPC memory/interconnection infrastructures, thermal modelling, reliability, programming models, and timing analysis. For each of these areas, the paper describes the relevant state of the art as well as the specific actions that the project will take to effectively address the identified technological challenges.
doi:10.1109/dsd.2019.00068 dblp:conf/dsd/AgostaFACCFHKMG19 fatcat:3zmdras2iraq3nmyw7gn2qaesu