kth.sePublications
Planned maintenance
A system upgrade is planned for 10/12-2024, at 12:00-13:00. During this time DiVA will be unavailable.
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Spectral Element Simulations on the NEC SX-Aurora TSUBASA
KTH, School of Electrical Engineering and Computer Science (EECS), Centres, Centre for High Performance Computing, PDC.ORCID iD: 0000-0002-5020-1631
2021 (English)In: HPC Asia 2021: The International Conference on High Performance Computing in Asia-Pacific Region, Association for Computing Machinery (ACM) , 2021Conference paper, Published paper (Refereed)
Abstract [en]

Following the recent transition in the high performance computing landscape to more heterogeneous architectures, application developers are faced with the challenge of ensuring good performance across a diverse set of platforms. In this paper, we present our work on porting the spectral element code Nek5000 to the recent vector architecture SX-Aurora TSUBASA. Using Nek5000's mini-app Nekbone, we formulate suitable loop transformations in key kernels, allowing for better vectorization, increasing the baseline performance by a factor of six. Using the new transformations, we demonstrate that the main compute intensive matrix-vector and matrix-matrix multiplication kernels achieves close to half the peak performance of a SX-Aurora core. Our work also addresses the gather-scatter operations, a key kernel for efficient matrix-free spectral element formulation. We introduce a new implementation of Nek5000's gather-scatter library with mesh topology awareness for improved vectorization via exploitation of the SX-Aurora's hardware gather-scatter instructions, improving performance with up to 116%. A detailed description of the implementation is given together with a performance study, comparing both single node performance and strong scalability characteristics, running across multiple SX-Aurora cards.

Place, publisher, year, edition, pages
Association for Computing Machinery (ACM) , 2021.
National Category
Computer Sciences Computational Mathematics
Identifiers
URN: urn:nbn:se:kth:diva-286604DOI: 10.1145/3432261.3432265ISI: 000682970300004Scopus ID: 2-s2.0-85099878544OAI: oai:DiVA.org:kth-286604DiVA, id: diva2:1504203
Conference
The International Conference on High Performance Computing in Asia-Pacific Region (HPCAsia 2021)
Funder
EU, Horizon 2020, 823691EU, Horizon 2020, 801039
Note

Part of proceedings: ISBN 9781450388429,

QC 20230117

Available from: 2020-11-26 Created: 2020-11-26 Last updated: 2023-02-07Bibliographically approved

Open Access in DiVA

fulltext(645 kB)205 downloads
File information
File name FULLTEXT01.pdfFile size 645 kBChecksum SHA-512
0b02b4904948163aa8c1f6021e87759c4ee981daf90c77bc031f76fbba4c3ec530ab7f44e3d04fb8565a9f2ec930a1c4a9004405ff4e2c3225e1a7a8c9198cdd
Type fulltextMimetype application/pdf

Other links

Publisher's full textScopus

Authority records

Jansson, Niclas

Search in DiVA

By author/editor
Jansson, Niclas
By organisation
Centre for High Performance Computing, PDC
Computer SciencesComputational Mathematics

Search outside of DiVA

GoogleGoogle Scholar
Total: 205 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 727 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf