Temperature Aware Dynamic Workload Scheduling in Multisocket CPU Servers

Raid Ayoub, Krishnam Indukuri, Tajana Simunic Rosing
2011 IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems  
In this paper, we propose a multitier approach for significantly lowering the cooling costs associated with fan subsystems without compromising the system performance. Our technique manages the fan speed by intelligently allocating the workload at the core level as well as at the CPU socket level. At the core level we propose a proactive dynamic thermal management scheme. We introduce a new predictor that utilizes the band-limited property of the temperature frequency spectrum. A big advantage
more » ... f our predictor is that it does not require the costly training phase and still maintains high accuracy. At the socket level, we use control theoretic approach to develop a stable scheduler that reduces the cooling costs further by providing a better thermal distribution. Our thermal management scheme incorporates runtime workload characterization to perform efficient thermally aware scheduling. The experimental results show that our approach delivers an average cooling energy savings of 80% compared to the state of the art techniques. The reported results also show that our formal technique maintains stability while heuristic solutions fail in this aspect.
doi:10.1109/tcad.2011.2153852 fatcat:xelp5k7vkbeunhvah2meu6jype