kth.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
GROMACS on AMD GPU-Based HPC Platforms: Using SYCL for Performance and Portability
KTH, School of Engineering Sciences (SCI), Applied Physics, Biophysics.
KTH, Centres, Science for Life Laboratory, SciLifeLab. KTH, Centres, SeRC - Swedish e-Science Research Centre. KTH, School of Electrical Engineering and Computer Science (EECS), Centres, Centre for High Performance Computing, PDC.ORCID iD: 0000-0003-0603-5514
KTH, Centres, SeRC - Swedish e-Science Research Centre. KTH, Centres, Science for Life Laboratory, SciLifeLab. KTH, School of Engineering Sciences (SCI), Applied Physics, Biophysics.ORCID iD: 0000-0002-2734-2794
2024 (English)In: CUG2024 Proceedings, 2024Conference paper, Published paper (Refereed)
Abstract [en]

GROMACS is a widely-used molecular dynamics software package with a focus on performance, portability, and maintainability across a broad range of platforms. Thanks to its early algorithmic redesign and flexible heterogeneous parallelization, GROMACS has successfully harnessed GPU accelerators for more than a decade.With the diversification of accelerator platforms in HPC and no obvious choice for a well-suited multi-vendor programming model, the GROMACS project found itself at a crossroads. The performance and portability requirements, as well as a strong preference for a standards-based programming model, motivated our choice to use SYCL for production on both new HPC GPU platforms: AMD and Intel.Since the GROMACS 2022 release, the SYCL backend has been the primary means to target AMD GPUs in preparation for exascale HPC architectures like LUMI and Frontier.SYCL is a cross-platform, royalty-free, C++17-based standard for programming hardware accelerators, from embedded to HPC.It allows using the same code to target GPUs from all three major vendors with minimal specialization, which offers major portability benefits.While SYCL implementations build on native compilers and runtimes, whether such an approach is performant is not immediately evident.Biomolecular simulations have challenging performance characteristics: latency sensitivity, the need for strong scaling, and typical iteration times as short as hundreds of microseconds. Hence, obtaining good performance across the range of problem sizes and scaling regimes is particularly challenging.Here, we share the results of our work on readying GROMACS for AMD GPU platforms using SYCL,and demonstrate performance on Cray EX235a machines with MI250X accelerators. Our findings illustrate that portability is possible without major performance compromises.We provide a detailed analysis of node-level kernel and runtime performance with the aim of sharing best practices with the HPC community on using SYCL as a performance-portable GPU framework.

Place, publisher, year, edition, pages
2024.
Keywords [en]
molecular dynamics; heterogeneous parallelism; GPU; SYCL; GROMACS
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:kth:diva-351148OAI: oai:DiVA.org:kth-351148DiVA, id: diva2:1886363
Conference
Cray User Group 2024, Perth, Western Australia, on May 5-9, 2024
Note

qc 20240802

Available from: 2024-07-31 Created: 2024-07-31 Last updated: 2024-08-02Bibliographically approved

Open Access in DiVA

fulltext(415 kB)20 downloads
File information
File name FULLTEXT01.pdfFile size 415 kBChecksum SHA-512
65de24d0efbbd569005b86a16e7e2d570b5e3f46de3090d61b411f0c099492a2c55cf002b9fa1522ad4aef61f93f8d3e7539e341e034fb9f288a98ff6ccf5e5a
Type fulltextMimetype application/pdf

Authority records

Alekseenko, AndreyPall, SzilardLindahl, Erik

Search in DiVA

By author/editor
Alekseenko, AndreyPall, SzilardLindahl, Erik
By organisation
BiophysicsScience for Life Laboratory, SciLifeLabSeRC - Swedish e-Science Research CentreCentre for High Performance Computing, PDC
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 20 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 95 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf