Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Nek5000 with OpenACC
KTH, Centres, SeRC - Swedish e-Science Research Centre. KTH, School of Computer Science and Communication (CSC), Centres, Centre for High Performance Computing, PDC.ORCID iD: 0000-0002-3859-9480
KTH, School of Computer Science and Communication (CSC), High Performance Computing and Visualization (HPCViz). KTH, Centres, SeRC - Swedish e-Science Research Centre.ORCID iD: 0000-0003-0639-0639
KTH, School of Computer Science and Communication (CSC), High Performance Computing and Visualization (HPCViz). KTH, Centres, SeRC - Swedish e-Science Research Centre.ORCID iD: 0000-0002-5415-1248
KTH, School of Computer Science and Communication (CSC), High Performance Computing and Visualization (HPCViz). KTH, Centres, SeRC - Swedish e-Science Research Centre.ORCID iD: 0000-0002-9901-9857
Show others and affiliations
2015 (English)In: Solving software challenges for exascale, 2015, 57-68 p.Conference paper, Published paper (Refereed)
Abstract [en]

Nek5000 is a computational fluid dynamics code based on the spectral element method used for the simulation of incompressible flows. We follow up on an earlier study which ported the simplified version of Nek5000 to a GPU-accelerated system by presenting the hybrid CPU/GPU implementation of the full Nek5000 code using OpenACC. The matrix-matrix multiplication, the Nek5000 gather-scatter operator and a preconditioned Conjugate Gradient solver have implemented using OpenACC for multi-GPU systems. We report an speed-up of 1.3 on single node of a Cray XK6 when using OpenACC directives in Nek5000. On 512 nodes of the Titan supercomputer, the speed-up can be approached to 1.4. A performance analysis of the Nek5000 code using Score-P and Vampir performance monitoring tools shows that overlapping of GPU kernels with host-accelerator memory transfers would considerably increase the performance of the OpenACC version of Nek5000 code.

Place, publisher, year, edition, pages
2015. 57-68 p.
Series
Lecture Notes in Computer Science, ISSN 0302-9743 ; 8759
Keyword [en]
GPU programming, Nek5000, OpenACC, Spectral element method
National Category
Computer Science
Identifiers
URN: urn:nbn:se:kth:diva-170716DOI: 10.1007/978-3-319-15976-8_4ISI: 000355749700004Scopus ID: 2-s2.0-84928882903ISBN: 978-3-319-15975-1 (print)ISBN: 978-3-319-15976-8 (print)OAI: oai:DiVA.org:kth-170716DiVA: diva2:839926
Conference
2nd International Conference on Exascale Applications and Software (EASC), APR 02-03, 2014, Stockholm, SWEDEN
Note

QC 20150706

Available from: 2015-07-06 Created: 2015-07-03 Last updated: 2017-08-16Bibliographically approved

Open Access in DiVA

No full text

Other links

Publisher's full textScopus

Authority records BETA

Gong, JingMarkidis, StefanoSchliephake, MichaelLaure, ErwinHenningson, DanSchlatter, Philipp

Search in DiVA

By author/editor
Gong, JingMarkidis, StefanoSchliephake, MichaelLaure, ErwinHenningson, DanSchlatter, Philipp
By organisation
SeRC - Swedish e-Science Research CentreCentre for High Performance Computing, PDCHigh Performance Computing and Visualization (HPCViz)Stability, Transition and Control
Computer Science

Search outside of DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 106 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf