kth.sePublications KTH
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Cloud abstractions for AI workloads
KAUST, Thuwal, Saudi Arabia.ORCID iD: 0000-0002-5051-4283
Carnegie Mellon University, Pittsburgh, USA.ORCID iD: 0000-0002-5855-8811
Microsoft Azure, Redmond, USA.ORCID iD: 0000-0001-5971-5084
Microsoft, Redmond, USA.ORCID iD: 0000-0003-2591-4012
Show others and affiliations
2025 (English)In: APSys '25: Proceedings of the 16th ACM SIGOPS Asia-Pacific Workshop on Systems, Association for Computing Machinery (ACM) , 2025, p. 98-105Conference paper, Published paper (Refereed)
Abstract [en]

AI workloads, often hosted in multi-tenant cloud environments, require vast computational resources but suffer inefficiencies due to limited tenant-provider coordination. Tenants lack infrastructure insights, while providers lack workload details to optimize tasks like partitioning, scheduling, and fault tolerance. We propose HarmonAIze to redefine cloud abstractions, enabling cooperative optimization for improved performance, efficiency, resiliency, and sustainability. We outline key opportunities and challenges this vision faces.

Place, publisher, year, edition, pages
Association for Computing Machinery (ACM) , 2025. p. 98-105
National Category
Computer Engineering
Identifiers
URN: urn:nbn:se:kth:diva-374907DOI: 10.1145/3725783.3764395OAI: oai:DiVA.org:kth-374907DiVA, id: diva2:2025782
Conference
The 16th ACM SIGOPS Asia-Pacific Workshop on Systems (APSys 2025), Seoul, Republic of Korea, Oct 12-13, 2025
Funder
Knut and Alice Wallenberg Foundation
Note

Part of ISBN 979-8-4007-1572-3

QC 20260108

Available from: 2026-01-07 Created: 2026-01-07 Last updated: 2026-01-08Bibliographically approved

Open Access in DiVA

fulltext(471 kB)23 downloads
File information
File name FULLTEXT01.pdfFile size 471 kBChecksum SHA-512
3d3eca0843df88d2dfa285e25cd3a6024e5b4c52a1233334d8a89aba4477edc288500a2656368f5151cd1178672346ff03ecd584a389e364e50249da71257a1f
Type fulltextMimetype application/pdf

Other links

Publisher's full text

Authority records

Kostic, Dejan

Search in DiVA

By author/editor
Canini, MarcoBenson, Theophilus A.Bianchini, RicardoGoiri, ÍñigoKostic, DejanPietzuch, PeterPeter, Simon
By organisation
Network Systems Laboratory (NS Lab)
Computer Engineering

Search outside of DiVA

GoogleGoogle Scholar
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 5391 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf