A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2008; you can also visit the original URL.
The file type is application/pdf
.
Reducing the latency of L2 misses in shared-memory multiprocessors through on-chip directory integration
Proceedings 10th Euromicro Workshop on Parallel, Distributed and Network-based Processing
Recent technology improvements allow multiprocessor designers to put some key components inside the processor chip, such as the memory controller and the network interface. In this work we exploit such integration scale, presenting a new three-level directory architecture aimed at reducing the long L2 miss latencies and the memory overhead that characterize cc-NUMA machines and limit their scalability. The proposed architecture is based on the integration into the processor chip of the
doi:10.1109/empdp.2002.994312
dblp:conf/pdp/AcacioGGD02
fatcat:zvkipbonzvdcjitwag67545nq4