kth.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Towards Performance Portable Kernels for Computational Fluid Dynamics Using DaCe
KTH, School of Electrical Engineering and Computer Science (EECS), Computer Science, Computational Science and Technology (CST).ORCID iD: 0000-0002-6384-2630
KTH, School of Electrical Engineering and Computer Science (EECS), Computer Science, Computational Science and Technology (CST).ORCID iD: 0000-0003-3374-8093
KTH, School of Electrical Engineering and Computer Science (EECS), Computer Science, Computational Science and Technology (CST).ORCID iD: 0000-0003-0639-0639
2024 (English)In: 53rd International Conference on Parallel Processing, ICPP 2024 - Workshops Proceedings, Association for Computing Machinery (ACM) , 2024, p. 110-111Conference paper, Published paper (Refereed)
Abstract [en]

With the rise of new high-performance computing (HPC) accelerators, such as Nvidia and AMD GPUs, the demand for efficient code targeting diverse hardware accelerators poses a critical challenge for HPC application developers. This hardware diversity in the HPC systems necessitates the development of new code tailored to specific architectures, which, in turn, hampers the sustainability of large scientific application development. In this work, we rely on DaCe [1, 2], a data-centric parallel programming framework, to automate the generation of high-performance kernels. DaCe can generate automatic code for multicore processors and various accelerators, alleviating the programmer burden of rewriting code for a new architecture. Our work demonstrates the automatic code generation capabilities of DaCe, applied to a critical high-performance computational kernel for Computational Fluid Dynamics code. Specifically, we focus on the Fortran-based solver, Neko [4] which is based on the Spectral Element Method. This method relies on small-sized matrix multiplications akin to BLAS dgemm operations. We describe the formulation of this computational kernel through DaCe's Stateful Dataflow Multigraph (SDFG) representation. We discuss how this representation facilitates high-performance code generation and detail the workflow for integration of DaCe's automatically generated code into the Neko solver. Initial work on Nvidia GH200. By showcasing the potential of automatic code generation, we highlight the feasibility of supporting the long-term sustainability of large-scale scientific applications by using portable solutions for critical computational kernels of large-scale codes.

Place, publisher, year, edition, pages
Association for Computing Machinery (ACM) , 2024. p. 110-111
Keywords [en]
FEM, High-Order Methods, Performance, Portability, SEM
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:kth:diva-353518DOI: 10.1145/3677333.3678270ISI: 001298775800004Scopus ID: 2-s2.0-85202810034OAI: oai:DiVA.org:kth-353518DiVA, id: diva2:1899193
Conference
53rd International Conference on Parallel Processing, ICPP 2024, August 12-15, 2024, Gotland, Sweden
Note

Part of ISBN 9798400718021

QC 20241008

Available from: 2024-09-19 Created: 2024-09-19 Last updated: 2024-10-08Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Andersson, MånsKarp, MartinMarkidis, Stefano

Search in DiVA

By author/editor
Andersson, MånsKarp, MartinMarkidis, Stefano
By organisation
Computational Science and Technology (CST)
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 61 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf