On the Rise of AMD Matrix Cores: Performance, Power Efficiency, and ProgrammabilityShow others and affiliations
2024 (English)In: Proceedings - 2024 IEEE International Symposium on Performance Analysis of Systems and Software, ISPASS 2024, Institute of Electrical and Electronics Engineers (IEEE) , 2024, p. 132-143Conference paper, Published paper (Refereed)
Abstract [en]
Matrix multiplication is a core computational part of deep learning and scientific workloads. The emergence of Matrix Cores in high-end AMD GPUs, a building block of Exascale computers, opens new opportunities for optimizing the performance and power efficiency of compute-intensive applications. This work provides a timely, comprehensive characterization of the novel Matrix Cores in AMD GPUs. We develop low-level micro-benchmarks for leveraging Matrix Cores at different levels of parallelism, achieving up to 350, 88, and 69 TFLOPS for mixed, float, and double precision on one GPU. Using results obtained from the micro-benchmarks, we provide a performance model of Matrix Cores that can guide application developers in performance tuning. We also provide the first quantitative study and modeling of the power efficiency of Matrix Cores at different floating-point data types. Finally, we evaluate the high- level programmability of Matrix Cores through the rocBLAS library in a wide range of matrix sizes from 16 to 64K. Our results indicate that application developers can transparently leverage Matrix Cores to deliver more than 92% peak computing throughput by properly selecting data types and interfaces.
Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE) , 2024. p. 132-143
Keywords [en]
AI00, AMD GPU, AMD MI250, Matrix Core, Tensor Core
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:kth:diva-351749DOI: 10.1109/ISPASS61541.2024.00022Scopus ID: 2-s2.0-85199903360OAI: oai:DiVA.org:kth-351749DiVA, id: diva2:1888716
Conference
2024 IEEE International Symposium on Performance Analysis of Systems and Software, ISPASS 2024, Indianapolis, United States of America, May 5 2024 - May 7 2024
Note
Part of ISBN [9798350376388]
QC 20240823
2024-08-132024-08-132024-08-23Bibliographically approved