Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Achieving Robust Self Management for Large Scale Distributed Applications using Management Elements
KTH, School of Information and Communication Technology (ICT).
2010 (English)Independent thesis Advanced level (degree of Master (Two Years)), 80 credits / 120 HE creditsStudent thesis
Abstract [en]

Abstract

Autonomic computing is an approach proposed by IBM that enables a system to self-con gure, self-heal, self-optimize, and self-protect itself, usually referred to as self-* or self-management. Humans should only specify higher level policies to guide the self-* behavior of the system.

Self-Management is achieved using control feedback loops that consist of four stages: monitor, analyze, plan, and execute. Management is more challenging in dynamic distributed environments where resources can join, leave, and fail. To address this problem a Distributed Component Management System (DCMS), a.k.a Niche, is being developed at KTH and SICS (Swedish Institute of Computer Science). DCMS provides abstractions that enable the construction of distributed control feedback loops. Each loop consists of a number of management elements (MEs) that do one or more of the four stages of a control loop mentioned above.

The current implementation of DCMS assumes that management elements (MEs) are deployed on stable nodes that do not fail. This assumption is dicult to guarantee in many environments and application scenarios. One solution to this limitation is to replicate MEs so that if one fails other MEs can continue working and restore the failed one. The problem is that MEs are stateful. We need to keep the state consistent among replicas. We also want to be sure that all events are processed (nothing is lost) and all actions are applied exactly once.

This report explains a proposal for the replication of stateful MEs under DCMS framework. For improved scalability, load-balancing and fault-tolerance, dierent breakthroughs in the eld of replicated state machine has been taken into account and discussed in this report. Chord has been used as an underlying structured overlay network (SON). This report also describes a prototype implementation of this proposal and discusses the results.

 

Place, publisher, year, edition, pages
2010. , 108 p.
Identifiers
URN: urn:nbn:se:kth:diva-24229OAI: oai:DiVA.org:kth-24229DiVA: diva2:345684
Uppsok
Technology
Examiners
Available from: 2010-08-26 Created: 2010-08-26 Last updated: 2010-08-26Bibliographically approved

Open Access in DiVA

fulltext(1624 kB)309 downloads
File information
File name FULLTEXT01.pdfFile size 1624 kBChecksum SHA-512
8e01c3252063ca6996ba0789d8d326a29f063eb344376f460badbd516e2cd264c1f2bc38fadf07490b42da618da95f6e730128151342b916be9f713a51927d67
Type fulltextMimetype application/pdf

By organisation
School of Information and Communication Technology (ICT)

Search outside of DiVA

GoogleGoogle Scholar
Total: 309 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 315 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf