kth.sePublications KTH
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Task-decomposed Overlapped Preconditioner for Sustained Strong Scalability on Accelerated Exascale Systems
KTH, School of Electrical Engineering and Computer Science (EECS), Centres, Centre for High Performance Computing, PDC.ORCID iD: 0000-0002-5020-1631
KTH, School of Electrical Engineering and Computer Science (EECS), Computational Science and Technology.ORCID iD: 0000-0003-3374-8093
KTH, School of Electrical Engineering and Computer Science (EECS), Centres, Centre for High Performance Computing, PDC.ORCID iD: 0000-0003-0603-5514
KTH, School of Electrical Engineering and Computer Science (EECS), Computational Science and Technology.ORCID iD: 0000-0003-0639-0639
Show others and affiliations
2026 (English)In: Proceedings of Supercomputing Asia and International Conference on High Performance Computing in Asia Pacific Region, SCA/HPCAsia 2026, Association for Computing Machinery (ACM) , 2026, p. 186-193Conference paper, Published paper (Refereed)
Abstract [en]

We detail our work on improving the performance and scalability of key numerical methods in the high-fidelity spectral element code Neko on accelerated exascale machines. Eifficient preconditioners are essential in incompressible fluid dynamics; however, the most eifficient method (with respect to convergence) might be challenging to implement with good performance on an accelerator. We present our development of a GPU-optimised preconditioner with task overlapping for the pressure-Poisson equation, improving the preconditioner's throughput (in TDoF/s) by close to 60%. The new preconditioner is explained in detail, together with detailed performance studies on accelerated Cray EX platforms, including strong scalability studies on LUMI and Frontier.

Place, publisher, year, edition, pages
Association for Computing Machinery (ACM) , 2026. p. 186-193
Keywords [en]
Accelerators, Direct numerical simulation, Spectral element method
National Category
Computational Mathematics Computer Sciences Fluid Mechanics
Identifiers
URN: urn:nbn:se:kth:diva-378758DOI: 10.1145/3773656.3773690Scopus ID: 2-s2.0-105031765001OAI: oai:DiVA.org:kth-378758DiVA, id: diva2:2049391
Conference
Supercomputing Asia and International Conference on High Performance Computing in Asia Pacific Region, SCA/HPCAsia 2026, Osaka, Japan, January 26-29, 2026
Note

Part of ISBN 9798400720673

QC 20260330

Available from: 2026-03-30 Created: 2026-03-30 Last updated: 2026-03-30Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Jansson, NiclasKarp, MartinPáll, SzilárdMarkidis, StefanoSchlatter, Philipp

Search in DiVA

By author/editor
Jansson, NiclasKarp, MartinPáll, SzilárdMarkidis, StefanoSchlatter, Philipp
By organisation
Centre for High Performance Computing, PDCComputational Science and TechnologyFluid Mechanics
Computational MathematicsComputer SciencesFluid Mechanics

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 14 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf