Software-based fault-tolerant routing algorithm in multidimensional networks

F. Safaei, M. Rezazad, A. Khonsari, M. Fathy, M. Ould-Khaoua, N. Alzeidi
2006 Proceedings 20th IEEE International Parallel & Distributed Processing Symposium  
Massively parallel computing systems are being built with hundreds or thousands of components such as nodes, links, memories, and connectors. The failure of a component in such systems will not only reduce the computational power but also alter the network's topology. The Software-Based fault-tolerant routing algorithm is a popular routing to achieve faulttolerance capability in networks. This algorithm is initially proposed only for two dimensional networks [1] . Since, higher dimensional
more » ... rks have been widely employed in many contemporary massively parallel systems; this paper proposes an approach to extend this routing scheme to these indispensable higher dimensional networks. Deadlock and livelock freedom and the performance of presented algorithm, have been investigated for networks with different dimensionality and various fault regions. Furthermore, performance results have been presented through simulation experiments.
doi:10.1109/ipdps.2006.1639644 dblp:conf/ipps/SafaeiRKFOA06 fatcat:47x7p32zqfa2jmjtsg7ouhmaca