Matrix Engines for High Performance Computing: A Paragon of Performance or Grasping at Straws?Show others and affiliations
2021 (English)In: 2021 IEEE 35TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), Institute of Electrical and Electronics Engineers (IEEE) , 2021, p. 1056-1065Conference paper, Published paper (Refereed)
Abstract [en]
Matrix engines or units, in different forms and affinities, are becoming a reality in modern processors; CPUs and otherwise. The current and dominant algorithmic approach to Deep Learning merits the commercial investments in these units, and deduced from the No.1 benchmark in supercomputing, namely High Performance Linpack, one would expect an awakened enthusiasm by the HPC community, too. Hence, our goal is to identify the practical added benefits for HPC and machine learning applications by having access to matrix engines. For this purpose, we perform an in-depth survey of software stacks, proxy applications and benchmarks, and historical batch job records. We provide a cost-benefit analysis of matrix engines, both asymptotically and in conjunction with state-of-the-art processors. While our empirical data will temper the enthusiasm, we also outline opportunities to misuse these dense matrix-multiplication engines if they come for free.
Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE) , 2021. p. 1056-1065
Series
International Parallel and Distributed Processing Symposium IPDPS, ISSN 1530-2075
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:kth:diva-303380DOI: 10.1109/IPDPS49936.2021.00114ISI: 000695273000106Scopus ID: 2-s2.0-85110825497OAI: oai:DiVA.org:kth-303380DiVA, id: diva2:1603382
Conference
35th IEEE International Parallel and Distributed Processing Symposium (IPDPS), MAY 17-21, 2021, ELECTR NETWORK
Note
Part of proceedings: ISBN 978-1-6654-4066-0, QC 20230117
2021-10-152021-10-152023-01-17Bibliographically approved