A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2008; you can also visit the original URL.
The file type is application/pdf
.
Efficient asynchronous memory copy operations on multi-core systems and I/OAT
2007
2007 IEEE International Conference on Cluster Computing
Bulk memory copies incur large overheads such as CPU stalling (i.e., no overlap of computation with memory copy operation), small register-size data movement, cache pollution, etc. Asynchronous copy engines introduced by Intel's I/O Acceleration Technology help in alleviating these overheads by offloading the memory copy operations using several DMA channels. However, the startup overheads associated with these copy engines such as pinning the application buffers, posting the descriptors and
doi:10.1109/clustr.2007.4629228
dblp:conf/cluster/VaidyanathanCHP07
fatcat:p2xw7l3ecvcgrovrzynlkyazum