Change search
ReferencesLink to record
Permanent link

Direct link
Fault Prediction in Distributed Systems Gone Wild
EPFL.ORCID iD: 0000-0002-1256-1070
2010 (English)In: Proceedings of The 4th ACM SIGOPS/SIGACT Workshop on Large Scale Distributed Systems and Middleware, Association for Computing Machinery (ACM), 2010, -11 p.Conference paper (Refereed)
Abstract [en]

We consider the problem of predicting faults in deployed, large-scale distributed systems that are heterogeneous and federated. Motivated by the importance of ensuring reliability of the services these systems provide, we argue that the key step in making these systems reliable is the need to automatically predict faults. For example, doing so is vital for avoiding Internet-wide outages that occur due to programming errors or misconfigurations.

Place, publisher, year, edition, pages
Association for Computing Machinery (ACM), 2010. -11 p.
National Category
Computer Science
URN: urn:nbn:se:kth:diva-147099DOI: 10.1145/1859184.1859187ScopusID: 2-s2.0-78649392927ISBN: 978-1-4503-0406-1OAI: diva2:727668
The 4th ACM SIGOPS/SIGACT Workshop on Large Scale Distributed Systems and Middleware,Zurich, SwitzerlandJuly 25-28, 2010

QC 20140707

Available from: 2014-06-23 Created: 2014-06-23 Last updated: 2014-07-07Bibliographically approved

Open Access in DiVA

fulltext(177 kB)80 downloads
File information
File name FULLTEXT01.pdfFile size 177 kBChecksum SHA-512
Type fulltextMimetype application/pdf

Other links

Publisher's full textScopusACM Digital libraryConference website

Search in DiVA

By author/editor
Kostic, Dejan
Computer Science

Search outside of DiVA

GoogleGoogle Scholar
Total: 80 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Altmetric score

Total: 53 hits
ReferencesLink to record
Permanent link

Direct link