A programming system for future proofing performance critical libraries

Li-Wen Chang, Izzat El Hajj, Hee-Seok Kim, Juan Gómez-Luna, Abdul Dakkak, Wen-mei Hwu
2016 Proceedings of the 21st ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming - PPoPP '16  
We present Tangram, a programming system for writing performanceportable programs. The language enables programmers to write computation and composition codelets, supported by tuning knobs and primitives for expressing data parallelism and work decomposition. The compiler and runtime use a set of techniques such as hierarchical composition, coarsening, data placement, tuning, and runtime selection based on input characteristics and microprofiling. The resulting performance is competitive with optimized vendor libraries.
doi:10.1145/2851141.2851178 dblp:conf/ppopp/ChangHKGDH16 fatcat:btc6btzamjco5fd6wy4lalrqti