1,431 Hits in 4.8 sec

Challenges in Deeply Heterogeneous High Performance Systems

Giovanni Agosta, William Fornaciari, David Atienza, Ramon Canal, Alessandro Cilardo, Jose Flich, Carles Hernandez Luz, Michal Kulczewski, Giuseppe Massari, Rafael Tornero Gavila, Marina Zapater Sancho
2019 2019 22nd Euromicro Conference on Digital System Design (DSD)  
RECIPE (REliable power and time-ConstraIntsaware Predictive management of heterogeneous Exascale systems) is a recently started project funded within the H2020 FETHPC programme, which is expressly targeted  ...  architectures, HPC memory/interconnection infrastructures, thermal modelling, reliability, programming models, and timing analysis.  ...  Thermal modelling Efficient thermal management requires accurate knowledge about the thermal profile of the chip in both steady and transient states.  ... 
doi:10.1109/dsd.2019.00068 dblp:conf/dsd/AgostaFACCFHKMG19 fatcat:3zmdras2iraq3nmyw7gn2qaesu

Energy Efficiency in HPC Systems [chapter]

Ivan Rodero, Manish Parashar
2012 Energy-Efficient Distributed Computing Systems  
Description of proactive, component-based power management (Section 1.3) • Quantification of possible power savings through component-based power management (Section 1.4)  ...  0723594, by Department of Energy via the grant number DE-FG02-06ER54857, by The Extreme Scale Systems Center at ORNL and the Department of Defense, and by an IBM Faculty Award, and was conducted as part  ...  Acknowledgments The research presented in this work is supported in part by National Science Foundation (NSF) via grants numbers IIP 0758566, CCF-0833039, DMS-0835436, CNS 0426354, IIS 0430826, and CNS  ... 
doi:10.1002/9781118342015.ch3 fatcat:r3tlvuyynfh4fimncjw5r77mqa

Data Mining for Big Dataset-Related Thermal Analysis of High Performance Computing (HPC) Data Center [chapter]

Davide De Chiara, Marta Chinnici, Ah-Lian Kor
2020 Lecture Notes in Computer Science  
Greening of Data Centers could be achieved through energy savings in two significant areas, namely: compute systems and cooling systems.  ...  Servers' dissipated heat effects a strain on the cooling systems. Consequently, it is necessary to identify hotspots that frequently occur in the server zones.  ...  In [5] , Parolini and colleagues propose a heat model; provide a brief overview of power and thermal efficiency from microprocessor micro-level to DC macro-level.  ... 
doi:10.1007/978-3-030-50436-6_27 fatcat:7siqpxvmyzaynpmxycemwnnnpm

Energy- and Heat-Aware HPC Benchmarks

Georges Da Costa, Mateusz Jarus, Thomas Zilio, Ariel Oleksiak
2013 2013 International Conference on Cloud and Green Computing  
for HPC systems.  ...  We also provide insight into several classical benchmarks and method to select an adequate and small number of benchmarks in order to provide a sensible and minimum set of energy-and thermal-aware benchmarks  ...  monitoring. 1) System monitoring: As applications are managed by the OS, all high level values are measured by it.  ... 
doi:10.1109/cgc.2013.75 dblp:conf/cgc/CostaJZO13 fatcat:zeasi7ey45agno4redizsr6eou

A survey of the research on power management techniques for high-performance systems

Yongpeng Liu, Hong Zhu
2010 Software, Practice & Experience  
In this paper, we first review the basic mechanisms that underlie power management techniques. Then we survey two fundamental techniques for power management: metrics and profiling.  ...  This paper surveys the research on power management techniques for high performance systems.  ...  Acknowledgement The authors would like to thank their colleagues in the Applied Formal Methods Research Group at the Department of Computing and Electronics of the Oxford Brookes University for valuable  ... 
doi:10.1002/spe.952 fatcat:64xblwxg2zc75ed6nztwzahde4

The RECIPE Approach to Challenges in Deeply Heterogeneous High Performance Systems

Giovanni Agosta, William Fornaciari, David Atienza, Ramon Canal, Alessandro Cilardo, José Flich Cardo, Carles Hernandez Luz, Michal Kulczewski, Giuseppe Massari, Rafael Tornero Gavilá, Marina Zapater
2020 Microprocessors and microsystems  
RECIPE (REliable power and time-ConstraInts-aware Predictive management of heterogeneous Exascale systems) is a recently started project funded within the H2020 FETHPC programme, which is expressly targeted  ...  resource management of the heterogeneous architectural components of the system, driven by estimates of the application latency and hardware reliability obtained respectively through timing analysis and  ...  power and time-ConstraIntsaware Predictive management of heterogeneous Exascale systems.  ... 
doi:10.1016/j.micpro.2020.103185 fatcat:kevagp2vdvf6hnus4qnxgxbnxq

Energy-Aware High-Performance Computing: Survey of State-of-the-Art Tools, Techniques, and Environments

Pawel Czarnul, Jerzy Proficz, Adam Krzywaniak
2019 Scientific Programming  
We discuss tools and APIs for energy/power management as well as tools and environments for prediction and/or simulation of energy/power consumption in modern HPC systems.  ...  The paper presents state of the art of energy-aware high-performance computing (HPC), in particular identification and classification of approaches by system and device types, optimization metrics, and  ...  Tools for Energy/Power Management in Modern HPC Systems Available tools for energy/power management can be considered in two categories: monitoring and controlling.  ... 
doi:10.1155/2019/8348791 fatcat:ib3dvjzg2bhhjnnklb4kaj2eqi

Energy-Efficient Thermal-Aware Autonomic Management of Virtualized HPC Cloud Infrastructure

Ivan Rodero, Hariharasudhan Viswanathan, Eun Kyung Lee, Marc Gamell, Dario Pompili, Manish Parashar
2012 Journal of Grid Computing  
However, maximizing energy efficiency and utilization of datacenter resources, and minimizing undesired thermal behavior while ensuring application performance and other Quality of Service (QoS) guarantees  ...  Virtualized datacenters and clouds are being increasingly considered for traditional High-Performance Computing (HPC) workloads that have typically targeted Grids and conventional HPC platforms.  ...  [41] studied application-centric aggressive power management of data centers resources for HPC workloads considering power management mechanisms and controls available at different levels and for different  ... 
doi:10.1007/s10723-012-9219-2 fatcat:zxxyef4dwzftjeaaqs2r5feb6u

Predictive Reliability and Fault Management in Exascale Systems

Ramon Canal, Carles Hernandez, Rafa Tornero, Alessandro Cilardo, Giuseppe Massari, Federico Reghenzani, William Fornaciari, Marina Zapater, David Atienza, Ariel Oleksiak, Wojciech PiĄtek, Jaume Abella
2020 ACM Computing Surveys  
In an HPC system, noisy input signals (1) and, specially, power noise can be managed by extra power regulators on the board. On the other side, ambient temperature (2) is controlled in the long run.  ...  INTRODUCTION AND MOTIVATION Exascale-grade HPC systems require dramatic improvements in energy efficiency to provide an unprecedented performance level within a strict power envelope.  ... 
doi:10.1145/3403956 fatcat:77xcpnevmnc5jfpj6ynhwdng3m

ANTAREX -- AutoTuning and Adaptivity appRoach for Energy Efficient eXascale HPC Systems

Cristina Silvano, Giovanni Agosta, Andrea Bartolini, Andrea Beccari, Luca Benini, Joao M. P. Cardoso, Carlo Cavazzoni, Radim Cmar, Jan Martinovic, Gianluca Palermo, Martin Palkovic, Erven Rohou (+2 others)
2015 2015 IEEE 18th International Conference on Computational Science and Engineering  
Computing (HPC) systems up to the Exascale level.  ...  The DSL approach will allow the definition of energy-efficiency, performance, and adaptivity strategies as well as their enforcement at runtime through application autotuning and resource and power management  ...  Self-Adaptivity & Autotuning The management of system adaptivity and autotuning is a key issue in HPC systems, where the computing infrastructure can easily evolve and the system needs to react promptly  ... 
doi:10.1109/cse.2015.58 dblp:conf/cse/SilvanoABBBCCCM15 fatcat:relhamsj2zgjdns43rfl7fimby

Minimizing Thermal Variation Across System Components

Kaicheng Zhang, Seda Ogrenci-Memik, Gokhan Memik, Kazutomo Yoshii, Rajesh Sankaran, Pete Beckman
2015 2015 IEEE International Parallel and Distributed Processing Symposium  
Elevated temperature levels reduce the reliability and the lifetime of the underlying hardware and increase their power consumption.  ...  On a two-node configuration, these models achieve 72.5% and 78.8% success rates in their predictions, respectively.  ...  We also gratefully acknowledge the computing resources provided and operated by the Joint Laboratory for System Evaluation (JLSE) at Argonne National Laboratory.  ... 
doi:10.1109/ipdps.2015.37 dblp:conf/ipps/ZhangMMYSB15 fatcat:i3um73sgsvct3ay5lkxely75hq

Coordinated energy management in heterogeneous processors

Indrani Paul, Vignesh Ravi, Srilatha Manne, Manish Arora, Sudhakar Yalamanchili
2013 Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis on - SC '13  
multi-node HPC system.  ...  We examine the intra-node CPU-GPU frequency sensitivity of HPC applications on tightly coupled CPU-GPU architectures as the first step in understanding power and performance optimization for a heterogeneous  ...  This research was supported in part by the Semiconductor Research Corporation under contract 2012-HJ-2318.  ... 
doi:10.1145/2503210.2503227 dblp:conf/sc/PaulRMAY13 fatcat:gqy377iu5bdmvm2e2zx64suao4

Energy and thermal models for simulation of workload and resource management in computing systems

Wojciech Piątek, Ariel Oleksiak, Georges Da Costa
2015 Simulation modelling practice and theory  
As the density of both computing power and heat is growing, proper energy and thermal management becomes crucial in terms of overall system efficiency.  ...  To cite this version : Piatek, Wojciech and Oleksiak, Ariel and Da Costa, Georges Energy and thermal models for simulation of workload and resource management in computing systems. (2015) Simulation Modelling  ...  The work presented in this paper was also supported by the COST Action IC1305, 'Network for Sustainable Ultrascale Computing (NESUS)'.  ... 
doi:10.1016/j.simpat.2015.04.008 fatcat:euhtiiumpncj7pf7rhku7fm2a4

Benefits in Relaxing the Power Capping Constraint

Daniele Cesarini, Andrea Bartolini, Luca Benini
2017 Proceedings of the 1st Workshop on AutotuniNg and aDaptivity AppRoaches for Energy efficient HPC Systems - ANDARE '17  
RAPL, which enforces the power constraint on a few ms time scale, fails on sharing power budget between more demanding and less demanding application phases.  ...  In this manuscript we evaluate the impact of HW power capping mechanisms on a real scientific application composed by parallel execution.  ...  ACKNOWLEDGMENTS Work supported by the EU FETHPC project ANTAREX (g.a. 671623), EU project ExaNoDe (g.a. 671578), and EU ERC Project MULTI-THERMAN (g.a. 291125).  ... 
doi:10.1145/3152821.3152878 dblp:conf/IEEEpact/CesariniBB17 fatcat:bwulgarainbxdnicrnx7yn5iwm

Self-Aware Thermal Management for High Performance Computing Processors

Andrea Bartolini, Roberto Diversi, Daniele Cesarini, Francesco Beneventi
2017 IEEE design & test  
To preserve a safe working temperature, state-of-the-art processors for this market segment integrates many cores on the same die and feature fine-grain power management and thermal management feedback  ...  In this paper, we present a self-aware thermal management framework for making each chip self-aware of its thermal and workload peculiarities and use it to maximize performance and thermal sustainability  ...  The Thermal/Power Management run-time operates at the node level and it is composed of two main components: the thermal-aware thread mapper and controller (TMC) and an energy-aware Message Passing Interface  ... 
doi:10.1109/mdat.2017.2774774 fatcat:smi6vjadvngzrkaaqtcsltlt4a
« Previous Showing results 1 — 15 out of 1,431 results