A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2016; you can also visit the original URL.
The file type is application/pdf
.
WorkQ: A many-core producer/consumer execution model applied to PGAS computations
2014
2014 20th IEEE International Conference on Parallel and Distributed Systems (ICPADS)
Partitioned global address space (PGAS) applications, such as the Tensor Contraction Engine (TCE) in NWChem, often apply a one-process-per-core mapping in which each process iterates through the following work-processing cycle: (1) determine a work-item dynamically, (2) get data via one-sided operations on remote blocks, (3) perform computation on the data locally, (4) put (or accumulate) resultant data into an appropriate remote location, and (5) repeat the cycle. However, this simple flow of
doi:10.1109/padsw.2014.7097863
dblp:conf/icpads/OzogMHB14
fatcat:dhtsfj4ozbb7nm62adnctznxly