Improving Multiprocessor Performance with Coarse-Grain Coherence Tracking

J.F. Cantin, M.H. Lipasti, J.E. Smith
32nd International Symposium on Computer Architecture (ISCA'05)  
To maintain coherence in conventional shared-memory multiprocessor systems, processors first check other processors' caches before obtaining data from memory. This coherence checking adds latency to memory requests and leads to large amounts of interconnect traffic in broadcastbased systems. Our results for a set of commercial, scientific and multiprogrammed workloads show that on average 67% (and up to 94%) of broadcasts are unnecessary. Coarse-Grain Coherence Tracking is a new technique that
more » ... new technique that supplements a conventional coherence mechanism and optimizes the performance of coherence enforcement. The Coarse-Grain Coherence mechanism monitors the coherence status of large regions of memory, and uses that information to avoid unnecessary broadcasts. Coarse-Grain Coherence Tracking is shown to eliminate 55-97% of the unnecessary broadcasts, and improve performance by 8.8% on average (and up to 21.7%).
doi:10.1109/isca.2005.31 dblp:conf/isca/CantinLS05 fatcat:x5574s4kxba3bab65cx2l5i4xa