OpenACC acceleration of the Nek5000 spectral element code
2015 (English)In: The international journal of high performance computing applications, ISSN 1094-3420, E-ISSN 1741-2846, Vol. 29, no 3, 311-319 p.Article in journal (Refereed) Published
We present a case study of porting NekBone, a skeleton version of the Nek5000 code, to a parallel GPU-accelerated system. Nek5000 is a computational fluid dynamics code based on the spectral element method used for the simulation of incompressible flow. The original NekBone Fortran source code has been used as the base and enhanced by OpenACC directives. The profiling of NekBone provided an assessment of the suitability of the code for GPU systems, and indicated possible kernel optimizations. To port NekBone to GPU systems required little effort and a small number of additional lines of code (approximately one OpenACC directive per 1000 lines of code). The naïve implementation using OpenACC leads to little performance improvement: on a single node, from 16 Gflops obtained with the version without OpenACC, we reached 20 Gflops with the naïve OpenACC implementation. An optimized NekBone version leads to a 43 Gflop performance on a single node. In addition, we ported and optimized NekBone to parallel GPU systems, reaching a parallel efficiency of 79.9% on 1024 GPUs of the Titan XK7 supercomputer at the Oak Ridge National Laboratory.
Place, publisher, year, edition, pages
2015. Vol. 29, no 3, 311-319 p.
Computer Science Computational Mathematics
IdentifiersURN: urn:nbn:se:kth:diva-171357DOI: 10.1177/1094342015576846ISI: 000358414200006ScopusID: 2-s2.0-84938095938OAI: oai:DiVA.org:kth-171357DiVA: diva2:843223
QC 201508042015-07-272015-07-272015-08-14Bibliographically approved