Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
An MPI/OpenACC implementation of a high-order electromagnetics solver with GPUDirect communication
KTH, School of Computer Science and Communication (CSC), Centres, Centre for High Performance Computing, PDC. KTH, Centres, SeRC - Swedish e-Science Research Centre.ORCID iD: 0000-0002-3859-9480
Show others and affiliations
2016 (English)In: The international journal of high performance computing applications, ISSN 1094-3420, E-ISSN 1741-2846, Vol. 30, no 3, p. 320-334Article in journal (Refereed) Published
Abstract [en]

We present performance results and an analysis of a message passing interface (MPI)/OpenACC implementation of an electromagnetic solver based on a spectral-element discontinuous Galerkin discretization of the time-dependent Maxwell equations. The OpenACC implementation covers all solution routines, including a highly tuned element-by-element operator evaluation and a GPUDirect gather-scatter kernel to effect nearest neighbor flux exchanges. Modifications are designed to make effective use of vectorization, streaming, and data management. Performance results using up to 16,384 graphics processing units of the Cray XK7 supercomputer Titan show more than 2.5x speedup over central processing unit-only performance on the same number of nodes (262,144 MPI ranks) for problem sizes of up to 6.9 billion grid points. We discuss performance-enhancement strategies and the overall potential of GPU-based computing for this class of problems.

Place, publisher, year, edition, pages
Sage Publications, 2016. Vol. 30, no 3, p. 320-334
Keyword [en]
Hybrid MPI, OpenACC, GPUDirect, spectral element-discontinuous Galerkin
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:kth:diva-194020DOI: 10.1177/1094342015626584ISI: 000382958000005Scopus ID: 2-s2.0-84983414976OAI: oai:DiVA.org:kth-194020DiVA, id: diva2:1037621
Funder
Swedish eā€Science Research Center
Note

QC 20161017

Available from: 2016-10-17 Created: 2016-10-14 Last updated: 2018-01-14Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Search in DiVA

By author/editor
Gong, Jing
By organisation
Centre for High Performance Computing, PDCSeRC - Swedish e-Science Research Centre
In the same journal
The international journal of high performance computing applications
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 62 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf