kth.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
CheckMate: Evaluating Checkpointing Protocols for Streaming Dataflows
Delft University of Technology, Delft University of Technology.
Delft University of Technology, Delft University of Technology.
Delft University of Technology, Delft University of Technology.
Delft University of Technology, Delft University of Technology.
Show others and affiliations
2024 (English)In: Proceedings - 2024 IEEE 40th International Conference on Data Engineering, ICDE 2024, Institute of Electrical and Electronics Engineers (IEEE) , 2024, p. 4030-4043Conference paper, Published paper (Refereed)
Abstract [en]

Stream processing in the last decade has seen broad adoption in both commercial and research settings. One key element for this success is the ability of modern stream processors to handle failures while ensuring exactly-once processing guarantees. At the moment of writing, virtually all stream processors that guarantee exactly-once processing implement a variant of Apache Flink's coordinated checkpoints -an extension of the original Chandy-Lamport checkpoints from 1985. However, the reasons behind this prevalence of the coordinated approach remain anecdotal, as reported by practitioners of the stream processing community. At the same time, common checkpointing approaches, such as the uncoordinated and the communication-induced ones, remain largely unexplored. This paper is the first to address this gap by i) shedding light on why practitioners have favored the coordinated approach and ii) investigating whether there are viable alternatives. To this end, we implement three checkpointing approaches that we surveyed and adapted for the distinct needs of streaming dataflows. Our analysis shows that the coordinated approach outperforms the uncoordinated and communication-induced protocols under uniformly distributed workloads. To our surprise, however, the uncoordinated approach is not only competitive to the coordinated one in uniformly distributed workloads, but it also outperforms the coordinated approach in skewed workloads. We conclude that rather than blindly employing coordinated checkpointing, research should focus on optimizing the very promising uncoordinated approach, as it can address issues with skew and support prevalent cyclic queries. We believe that our findings can trigger further research into checkpointing mechanisms.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE) , 2024. p. 4030-4043
Keywords [en]
Benchmarking, Checkpointing, Experimental evaluation, Fault tolerance, Stream processing
National Category
Computer Sciences Software Engineering
Identifiers
URN: urn:nbn:se:kth:diva-351951DOI: 10.1109/ICDE60146.2024.00309Scopus ID: 2-s2.0-85200460157OAI: oai:DiVA.org:kth-351951DiVA, id: diva2:1890167
Conference
40th IEEE International Conference on Data Engineering, ICDE 2024, Utrecht, Netherlands, Kingdom of the, May 13 2024 - May 17 2024
Note

 Part of ISBN 9798350317152

QC 20240827

Available from: 2024-08-19 Created: 2024-08-19 Last updated: 2024-08-27Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Carbone, Paris

Search in DiVA

By author/editor
Carbone, Paris
By organisation
Software and Computer systems, SCS
Computer SciencesSoftware Engineering

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 27 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf