kth.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Less for more: reducing intra-cgra connectivity for higher performance and efficiency in hpc
Center for Computational Science (R-CCS), RIKEN, Japan.
Center for Computational Science (R-CCS), RIKEN, Japan.
Center for Computational Science (R-CCS), RIKEN, Japan.
Center for Computational Science (R-CCS), RIKEN, Japan.
Show others and affiliations
2023 (English)In: 2023 IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2023, Institute of Electrical and Electronics Engineers (IEEE) , 2023, p. 452-459Conference paper, Published paper (Refereed)
Abstract [en]

Coarse-Grained Reconfigurable Arrays (CGRAs) are a class of reconfigurable architectures that inherit the performance of Domain-specific accelerators and the reconfigurability aspects of Field-Programmable Gate Arrays (FPGAs). Historically, CGRAs have been successfully used to accelerate embedded applications and are now considered to accelerate High-Performance Computing (HPC) applications in future supercomputers. However, embedded systems and supercomputers are two vastly different domains with different applications and constraints, and it is today not fully understood what CGRA design decisions adequately cater to the HPC market. One such unknown design decision is regarding the interconnect that facilitates intra-CGRA communication. Our findings show that even the typical king-style mesh-like topology is often under-utilized with a typical HPC workload, leading to inefficiency. This research aims to explore the provisioning of intra-CGRA interconnect for HPC-oriented workloads and, ultimately, recoup the potential performance and efficiency lost by reducing the interconnect complexity. We proposed several reduced interconnect topologies based on the usage statistic. Then we evaluate the tradeoffs regarding hardware cost, routability of DFGs, and computational throughput.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE) , 2023. p. 452-459
Keywords [en]
CGRA, Design space exploration, HPC, Routing architecture, RTL simulation
National Category
Embedded Systems
Identifiers
URN: urn:nbn:se:kth:diva-336739DOI: 10.1109/IPDPSW59300.2023.00077ISI: 001055030700056Scopus ID: 2-s2.0-85169299919OAI: oai:DiVA.org:kth-336739DiVA, id: diva2:1798497
Conference
2023 IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2023, St. Petersburg, United States of America, May 15 2023 - May 19 2023
Note

Part of ISBN 9798350311990 

QC 20230919

Available from: 2023-09-19 Created: 2023-09-19 Last updated: 2023-10-02Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Podobas, Artur

Search in DiVA

By author/editor
Podobas, Artur
By organisation
Computational Science and Technology (CST)
Embedded Systems

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 174 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf