Fine-Grained Treatment to Synchronizations in GPU-to-CPU Translation [chapter]

Ziyu Guo, Xipeng Shen
2013 Lecture Notes in Computer Science  
GPU-to-CPU translation may extend Graphics Processing Units (GPU) programs executions to multi-/many-core CPUs, and hence enable cross-device task migration and promote whole-system synergy. This paper describes some of our findings in treatment to GPU synchronizations during the translation process. We show that careful dependence analysis may allow a fine-grained treatment to synchronizations and reveal redundant computation at the instruction-instance level. Based on thread-level dependence
more » ... raphs, we present a method to enable such fine-grained treatment automatically. Experiments demonstrate that compared to existing translations, the new approach can yield speedup of a factor of integers.
doi:10.1007/978-3-642-36036-7_12 fatcat:n63sfbw3lzcw3clo2r2g3pzdyi