Design and analysis of the Alliance/University of New Mexico Roadrunner Linux SMP SuperCluster

D.A. Bader, A.B. Maccabe, J.R. Mastaler, J.K. McIver, P.A. Kovatch
1999 ICWC 99. IEEE Computer Society International Workshop on Cluster Computing  
This paper will discuss high performance clustering from a series of critical topics: architectural design, system software infrastructure, and programming environment. This will be accomplished through an overview of a large scale, high performance SuperCluster (named Roadrunner) in production at The University of New Mexico (UNM) Albuquerque High Performance Computing Center (AHPCC). This SuperCluster, sponsored by the U.S. National Science Foundation (NSF) and the National Computational
more » ... ce Alliance (NCSA), is based almost entirely on freelyavailable, vendor-independent software. For example, its operating system (Linux), job scheduler (PBS), compilers (GNU/EGCS), and parallel programming libraries (MPI). The Globus toolkit, also available for this platform, allows high performance distributed computing applications to use geographically distributed resources such as this Su-perCluster. In addition to describing the design and analysis of the Roadrunner SuperCluster, we provide experimental analyses from grand challenge applications and future directions for SuperClusters. Please see www.alliance.unm.edu for further information.
doi:10.1109/iwcc.1999.810804 dblp:conf/iwcc/BaderMMMK99 fatcat:dqb6wty4xvaafm4t3fixk676lq