A multi-agent architecture for scheduling of high performance services in a GPU cluster

Joel Antonio Trejo-Sánchez, José Luis López-Martínez, José Octavio Gutiérrez García, Julio Cesar Ramírez Pacheco, Daniel Fajardo-Delgado
2018 International Journal of Combinatorial Optimization Problems and Informatics  
Nowadays, clusters containing multiple GPU nodes are widely used to execute highperformance computing applications. Diverse disciplines use these clusters to improve the performance of several services that consume high computational resources. The challenge of executing high-performance computing applications becomes harder when the applications are executed concurrently and each one of them may demand multiple GPU nodes for different periods of time. To tackle this challenge, we propose a
more » ... i-agent architecture for scheduling multiple services in a heterogeneous GPU cluster. We provide simulation results of our agent-based system utilizing three commonly used scheduling heuristics for several configuration settings.
dblp:journals/ijcopi/Trejo-SanchezLG18 fatcat:fohnvjp4nzf7vb2qxuqkcbi7hi