Simulation-Based Performance Analysis and Tuning for a Two-Level Directly Connected System

Ehsan Totoni, Abhinav Bhatele, Eric J. Bohm, Nikhil Jain, Celso L. Mendes, Ryan M. Mokos, Gengbin Zheng, Laxmikant V. Kale
2011 2011 IEEE 17th International Conference on Parallel and Distributed Systems  
Hardware and software co-design is becoming increasingly important due to complexities in supercomputing architectures. Simulating applications before there is access to the real hardware can assist machine architects in making better design decisions that can optimize application performance. At the same time, the application and runtime can be optimized and tuned beforehand. BigSim is a simulation-based performance prediction framework designed for these purposes. It can be used to perform
more » ... ket-level network simulations of parallel applications using existing parallel machines. In this paper, we demonstrate the utility of BigSim in analyzing and optimizing parallel application performance for future systems based on the PERCS network. We present simulation studies using benchmarks and real applications expected to run on future supercomputers. Future petascale systems will have more than 100,000 cores, and we present simulations at that scale.
doi:10.1109/icpads.2011.121 dblp:conf/icpads/TotoniBBJMMZK11 fatcat:hqmy4uhbmvcp3lizv422jjbg5y