Achieving predictable performance through better memory controller placement in many-core CMPs

Dennis Abts, Natalie D. Enright Jerger, John Kim, Dan Gibson, Mikko H. Lipasti
2009 SIGARCH Computer Architecture News  
In the near term, Moore's law will continue to provide an increasing number of transistors and therefore an increasing number of on-chip cores. Limited pin bandwidth prevents the integration of a large number of memory controllers on-chip. With many cores, and few memory controllers, where to locate the memory controllers in the on-chip interconnection fabric becomes an important and as yet unexplored question. In this paper, we show how the location of the memory controllers can reduce
more » ... on (hot spots) in the on-chip fabric, as well as lower the variance in reference latency which provides for predictable performance of memory-intensive applications regardless of the processing core on which a thread is scheduled. We explore the design space of onchip fabrics to find optimal memory controller placement relative to different topologies (i.e. mesh and torus), routing algorithms, and workloads. * This research was supported in part by NSF grants under ..., an IBM PhD fellowship, as well as grant and equipment donations from IBM and Intel. The authors would like to thank the anonymous reviewers for thsir comments and suggestions for improving this work
doi:10.1145/1555815.1555810 fatcat:zwagwcxsdnhd3mnks6f7vibaqe