Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Nek5000 with OpenACC
KTH, Centra, SeRC - Swedish e-Science Research Centre. KTH, Skolan för datavetenskap och kommunikation (CSC), Centra, Parallelldatorcentrum, PDC.ORCID-id: 0000-0002-3859-9480
KTH, Skolan för datavetenskap och kommunikation (CSC), High Performance Computing and Visualization (HPCViz). KTH, Centra, SeRC - Swedish e-Science Research Centre.ORCID-id: 0000-0003-0639-0639
KTH, Skolan för datavetenskap och kommunikation (CSC), High Performance Computing and Visualization (HPCViz). KTH, Centra, SeRC - Swedish e-Science Research Centre.ORCID-id: 0000-0002-5415-1248
KTH, Skolan för datavetenskap och kommunikation (CSC), High Performance Computing and Visualization (HPCViz). KTH, Centra, SeRC - Swedish e-Science Research Centre.ORCID-id: 0000-0002-9901-9857
Visa övriga samt affilieringar
2015 (Engelska)Ingår i: Solving software challenges for exascale, 2015, s. 57-68Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

Nek5000 is a computational fluid dynamics code based on the spectral element method used for the simulation of incompressible flows. We follow up on an earlier study which ported the simplified version of Nek5000 to a GPU-accelerated system by presenting the hybrid CPU/GPU implementation of the full Nek5000 code using OpenACC. The matrix-matrix multiplication, the Nek5000 gather-scatter operator and a preconditioned Conjugate Gradient solver have implemented using OpenACC for multi-GPU systems. We report an speed-up of 1.3 on single node of a Cray XK6 when using OpenACC directives in Nek5000. On 512 nodes of the Titan supercomputer, the speed-up can be approached to 1.4. A performance analysis of the Nek5000 code using Score-P and Vampir performance monitoring tools shows that overlapping of GPU kernels with host-accelerator memory transfers would considerably increase the performance of the OpenACC version of Nek5000 code.

Ort, förlag, år, upplaga, sidor
2015. s. 57-68
Serie
Lecture Notes in Computer Science, ISSN 0302-9743 ; 8759
Nyckelord [en]
GPU programming, Nek5000, OpenACC, Spectral element method
Nationell ämneskategori
Datavetenskap (datalogi)
Identifikatorer
URN: urn:nbn:se:kth:diva-170716DOI: 10.1007/978-3-319-15976-8_4ISI: 000355749700004Scopus ID: 2-s2.0-84928882903ISBN: 978-3-319-15975-1 (tryckt)ISBN: 978-3-319-15976-8 (tryckt)OAI: oai:DiVA.org:kth-170716DiVA, id: diva2:839926
Konferens
2nd International Conference on Exascale Applications and Software (EASC), APR 02-03, 2014, Stockholm, SWEDEN
Anmärkning

QC 20150706

Tillgänglig från: 2015-07-06 Skapad: 2015-07-03 Senast uppdaterad: 2018-01-11Bibliografiskt granskad

Open Access i DiVA

Fulltext saknas i DiVA

Övriga länkar

Förlagets fulltextScopus

Personposter BETA

Gong, JingMarkidis, StefanoSchliephake, MichaelLaure, ErwinHenningson, DanSchlatter, Philipp

Sök vidare i DiVA

Av författaren/redaktören
Gong, JingMarkidis, StefanoSchliephake, MichaelLaure, ErwinHenningson, DanSchlatter, Philipp
Av organisationen
SeRC - Swedish e-Science Research CentreParallelldatorcentrum, PDCHigh Performance Computing and Visualization (HPCViz)Stabilitet, Transition, Kontroll
Datavetenskap (datalogi)

Sök vidare utanför DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetricpoäng

doi
isbn
urn-nbn
Totalt: 272 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf