Endre søk
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Strong Scaling of OpenACC enabled Nek5000 on several GPU based HPC systems
KTH, Skolan för elektroteknik och datavetenskap (EECS), Centra, Parallelldatorcentrum, PDC.
Uppsala University.
KTH, Skolan för elektroteknik och datavetenskap (EECS), Datavetenskap, Beräkningsvetenskap och beräkningsteknik (CST).ORCID-id: 0000-0003-3374-8093
KTH, Skolan för teknikvetenskap (SCI), Teknisk mekanik, Strömningsmekanik och Teknisk Akustik.ORCID-id: 0000-0002-7448-3290
Vise andre og tillknytning
2022 (engelsk)Inngår i: HPCAsia2022: International Conference on High Performance Computing in Asia-Pacific Region, Association for Computing Machinery (ACM) , 2022, s. 94-102Konferansepaper, Publicerat paper (Fagfellevurdert)
Abstract [en]

We present new results on the strong parallel scaling for the OpenACC-accelerated implementation of the high-order spectral element fluid dynamics solver Nek5000. The test case considered consists of a direct numerical simulation of fully-developed turbulent flow in a straight pipe, at two different Reynolds numbers Reτ = 360 and Reτ = 550, based on friction velocity and pipe radius. The strong scaling is tested on several GPU-enabled HPC systems, including the Swiss Piz Daint system, TACC's Longhorn, Jülich's JUWELS Booster, and Berzelius in Sweden. The performance results show that speed-up between 3-5 can be achieved using the GPU accelerated version compared with the CPU version on these different systems. The run-time for 20 timesteps reduces from 43.5 to 13.2 seconds with increasing the number of GPUs from 64 to 512 for Reτ = 550 case on JUWELS Booster system. This illustrates the GPU accelerated version the potential for high throughput. At the same time, the strong scaling limit is significantly larger for GPUs, at about 2000 - 5000 elements per rank; compared to about 50 - 100 for a CPU-rank.

sted, utgiver, år, opplag, sider
Association for Computing Machinery (ACM) , 2022. s. 94-102
Serie
ACM International Conference Proceeding Series
HSV kategori
Identifikatorer
URN: urn:nbn:se:kth:diva-309189DOI: 10.1145/3492805.3492818Scopus ID: 2-s2.0-85122621284OAI: oai:DiVA.org:kth-309189DiVA, id: diva2:1639938
Konferanse
HPC Asia2022: International Conference on High Performance Computing in Asia-Pacific Region Virtual Event Japan January 12 - 14, 2022
Merknad

QC 20220223

Part of conference proceedings: ISBN 978-145038498-8

Tilgjengelig fra: 2022-02-22 Laget: 2022-02-22 Sist oppdatert: 2024-03-18bibliografisk kontrollert

Open Access i DiVA

Fulltekst mangler i DiVA

Andre lenker

Forlagets fulltekstScopus

Person

Vincent, JonathanKarp, MartinPeplinski, AdamJansson, NiclasPodobas, ArturMarkidis, StefanoPleiter, DirkSchlatter, Philipp

Søk i DiVA

Av forfatter/redaktør
Vincent, JonathanKarp, MartinPeplinski, AdamJansson, NiclasPodobas, ArturMarkidis, StefanoPleiter, DirkSchlatter, Philipp
Av organisasjonen

Søk utenfor DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric

doi
urn-nbn
Totalt: 130 treff
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf