A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2015; you can also visit the original URL.
The file type is application/pdf
.
Toward multi-target autotuning for accelerators
2014
2014 20th IEEE International Conference on Parallel and Distributed Systems (ICPADS)
Producing high-performance implementations from simple, portable computation specifications is a challenge that compilers have tried to address for several decades. More recently, a relatively stable architectural landscape has evolved into a set of increasingly diverging and rapidly changing CPU and accelerator designs, with the main common factor being dramatic increases in the levels of parallelism available. The growth of architectural heterogeneity and parallelism, combined with the very
doi:10.1109/padsw.2014.7097851
dblp:conf/icpads/ChaimovNM14
fatcat:ezx34pl57zarzpzdmkj4hmvc7i