Change search
ReferencesLink to record
Permanent link

Direct link
An MPI/OpenACC implementation of a high-order electromagnetics solver with GPUDirect communication
KTH, School of Computer Science and Communication (CSC), Centres, Centre for High Performance Computing, PDC.
Show others and affiliations
2016 (English)In: The international journal of high performance computing applications, ISSN 1094-3420, E-ISSN 1741-2846, Vol. 30, no 3, 320-334 p.Article in journal (Refereed) Published
Abstract [en]

We present performance results and an analysis of a message passing interface (MPI)/OpenACC implementation of an electromagnetic solver based on a spectral-element discontinuous Galerkin discretization of the time-dependent Maxwell equations. The OpenACC implementation covers all solution routines, including a highly tuned element-by-element operator evaluation and a GPUDirect gather-scatter kernel to effect nearest neighbor flux exchanges. Modifications are designed to make effective use of vectorization, streaming, and data management. Performance results using up to 16,384 graphics processing units of the Cray XK7 supercomputer Titan show more than 2.5x speedup over central processing unit-only performance on the same number of nodes (262,144 MPI ranks) for problem sizes of up to 6.9 billion grid points. We discuss performance-enhancement strategies and the overall potential of GPU-based computing for this class of problems.

Place, publisher, year, edition, pages
Sage Publications, 2016. Vol. 30, no 3, 320-334 p.
Keyword [en]
Hybrid MPI, OpenACC, GPUDirect, spectral element-discontinuous Galerkin
National Category
Computer Science
URN: urn:nbn:se:kth:diva-194020DOI: 10.1177/1094342015626584ISI: 000382958000005ScopusID: 2-s2.0-84983414976OAI: diva2:1037621
Swedish eā€Science Research Center

QC 20161017

Available from: 2016-10-17 Created: 2016-10-14 Last updated: 2016-10-17Bibliographically approved

Open Access in DiVA

No full text

Other links

Publisher's full textScopus

Search in DiVA

By author/editor
Gong, Jing
By organisation
Centre for High Performance Computing, PDC
In the same journal
The international journal of high performance computing applications
Computer Science

Search outside of DiVA

GoogleGoogle Scholar
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Altmetric score

ReferencesLink to record
Permanent link

Direct link