Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
An MPI/OpenACC implementation of a high-order electromagnetics solver with GPUDirect communication
KTH, Skolan för datavetenskap och kommunikation (CSC), Centra, Parallelldatorcentrum, PDC. KTH, Centra, SeRC - Swedish e-Science Research Centre.ORCID-id: 0000-0002-3859-9480
Visa övriga samt affilieringar
2016 (Engelska)Ingår i: The international journal of high performance computing applications, ISSN 1094-3420, E-ISSN 1741-2846, Vol. 30, nr 3, s. 320-334Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

We present performance results and an analysis of a message passing interface (MPI)/OpenACC implementation of an electromagnetic solver based on a spectral-element discontinuous Galerkin discretization of the time-dependent Maxwell equations. The OpenACC implementation covers all solution routines, including a highly tuned element-by-element operator evaluation and a GPUDirect gather-scatter kernel to effect nearest neighbor flux exchanges. Modifications are designed to make effective use of vectorization, streaming, and data management. Performance results using up to 16,384 graphics processing units of the Cray XK7 supercomputer Titan show more than 2.5x speedup over central processing unit-only performance on the same number of nodes (262,144 MPI ranks) for problem sizes of up to 6.9 billion grid points. We discuss performance-enhancement strategies and the overall potential of GPU-based computing for this class of problems.

Ort, förlag, år, upplaga, sidor
Sage Publications, 2016. Vol. 30, nr 3, s. 320-334
Nyckelord [en]
Hybrid MPI, OpenACC, GPUDirect, spectral element-discontinuous Galerkin
Nationell ämneskategori
Datavetenskap (datalogi)
Identifikatorer
URN: urn:nbn:se:kth:diva-194020DOI: 10.1177/1094342015626584ISI: 000382958000005Scopus ID: 2-s2.0-84983414976OAI: oai:DiVA.org:kth-194020DiVA, id: diva2:1037621
Forskningsfinansiär
Swedish e‐Science Research Center
Anmärkning

QC 20161017

Tillgänglig från: 2016-10-17 Skapad: 2016-10-14 Senast uppdaterad: 2018-01-14Bibliografiskt granskad

Open Access i DiVA

Fulltext saknas i DiVA

Övriga länkar

Förlagets fulltextScopus

Sök vidare i DiVA

Av författaren/redaktören
Gong, Jing
Av organisationen
Parallelldatorcentrum, PDCSeRC - Swedish e-Science Research Centre
I samma tidskrift
The international journal of high performance computing applications
Datavetenskap (datalogi)

Sök vidare utanför DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetricpoäng

doi
urn-nbn
Totalt: 139 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf