A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2020; you can also visit the original URL.
The file type is
Proceedings of the Platform for Advanced Scientific Computing Conference on - PASC '16
The sustained performance ratio of the main loop of the NICAM reached 0.87 PFLOPS with 81,920 nodes on the K computer. For GPUbased calculations, we applied OpenACC to the dynamical core of NICAM. ... We did not significantly change the loop and data ordering for sufficient usage of the features of the K computer, such as the hardware-aided thread barrier mechanism and the relatively high bandwidth ... The authors would like to thank Kiyotaka Sakamoto at Fujitsu Systems East Ltd. for his contribution to the optimization on the K computer. ...doi:10.1145/2929908.2929911 fatcat:n4zxi6uskzbw3e2ouezgw3ws64