kth.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Diffusion-Based Time Series Data Imputation for Cloud Failure Prediction at Microsoft 365
Microsoft, China.
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Robotics, Perception and Learning, RPL.ORCID iD: 0000-0002-7189-1336
Microsoft, China.
Microsoft, China.
Show others and affiliations
2023 (English)In: ESEC/FSE 2023 - Proceedings of the 31st ACM Joint Meeting European Software Engineering Conference and Symposium on the Foundations of Software Engineering, Association for Computing Machinery (ACM) , 2023, p. 2050-2055Conference paper, Published paper (Refereed)
Abstract [en]

Ensuring reliability in large-scale cloud systems like Microsoft 365 is crucial. Cloud failures, such as disk and node failure, threaten service reliability, causing service interruptions and financial loss. Existing works focus on failure prediction and proactively taking action before failures happen. However, they suffer from poor data quality, like data missing in model training and prediction, which limits performance. In this paper, we focus on enhancing data quality through data imputation by the proposed Diffusion+, a sample-efficient diffusion model, to impute the missing data efficiently conditioned on the observed data. Experiments with industrial datasets and application practice show that our model contributes to improving the performance of downstream failure prediction.

Place, publisher, year, edition, pages
Association for Computing Machinery (ACM) , 2023. p. 2050-2055
Keywords [en]
Diffusion model, disk failure prediction, missing data imputation
National Category
Software Engineering Other Computer and Information Science
Identifiers
URN: urn:nbn:se:kth:diva-341954DOI: 10.1145/3611643.3613866ISI: 001148157800169Scopus ID: 2-s2.0-85180547809OAI: oai:DiVA.org:kth-341954DiVA, id: diva2:1824946
Conference
31st ACM Joint Meeting European Software Engineering Conference and Symposium on the Foundations of Software Engineering, ESEC/FSE 2023, San Francisco, United States of America, Dec 3 2023 - Dec 9 2023
Note

Part of ISBN 9798400703270

QC 20240108

Available from: 2024-01-08 Created: 2024-01-08 Last updated: 2025-07-02Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Yin, WenjieBjörkman, Mårten

Search in DiVA

By author/editor
Yin, WenjieBjörkman, Mårten
By organisation
Robotics, Perception and Learning, RPL
Software EngineeringOther Computer and Information Science

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 63 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf