Efficient and scalable barrier synchronization for many-core CMPs

José L. Abellán, Juan Fernández, Manuel E. Acacio
2010 Proceedings of the 7th ACM international conference on Computing frontiers - CF '10  
We present in this work a novel hardware-based barrier mechanism for synchronization on many-core CMPs. In particular, we leverage global interconnection lines (G-lines) and S-CSMA technique, which have been used to overcome some limitations of a flow control mechanism (EVC) in the context of Networks-on-Chip, to develop a simple G-line-based network that operates independently of the main data network in order to carry out barrier synchronizations. Next, we evaluate our approach by running
more » ... ral applications on top of the Sim-PowerCMP performance simulator. Our method only takes 4 cycles to carry out the synchronization once all cores or threads have arrived at the barrier. Hence, we obtain much better performance results than softwarebased barrier implementations in terms of scalability and efficiency.
doi:10.1145/1787275.1787289 dblp:conf/cf/AbellanFA10 fatcat:hly3mh4uz5aphecx6o2gpnxfqq