Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Efficient and Cost-effective Workflow Based on Containers for Distributed Reproducible Experiments
KTH, Skolan för informations- och kommunikationsteknik (ICT).
2016 (Engelska)Självständigt arbete på avancerad nivå (masterexamen), 20 poäng / 30 hpStudentuppsats (Examensarbete)
Abstract [en]

Reproducing distributed experiments is a challenging task for many researchers. There are many factors which make this problem harder to solve. In order to reproduce distributed experiments, researchers need to perform complex deployments which involve many dependent software stacks with many configurations and manual orchestrations.

Further, researchers need to allocate a larger amount of money for clusters of machines and then spend their valuable time to perform those experiments. Also, some of the researchers spend a lot of time to validate a distributed scenario in a real environment as most of the pseudo distributed systems do not provide the characteristics of a real distributed system.

Karamel provides solutions for the inconvenience caused by the manual orchestration by providing a comprehensive orchestration platform to deploy and run distributed experiments. But still, this solution may incur a similar amount of expenses as of a manual distributed setup since it uses virtual machines underneath. Further, it does not provide quick validations of a distributed setup with a quick feedback loop, as it takes considerable time to terminate and provision new virtual machines.

Therefore, we provide a solution by integrating Docker that can co-exists with virtual machine based deployment model seamlessly. Our solution encapsulates the container-based deployment model for users to reproduce distributed experiment in a cost-effective and efficient manner.

In this project, we introduce novel deployment model with containers that is not possible with the conventional virtual machine based deployment model. Further, we evaluate our solution with a real deployment of Apache Hadoop Terasort experiment which is a benchmark for Apache Hadoop map-reduce platform in order to explain how this model can be used to save the cost and improve the efficiency. 

Ort, förlag, år, upplaga, sidor
2016. , 48 s.
Serie
TRITA-ICT-EX, 2016:125
Nyckelord [en]
docker, orchestration, container, workflow, cloud, reproducible-experiments
Nationell ämneskategori
Datorsystem
Identifikatorer
URN: urn:nbn:se:kth:diva-194209OAI: oai:DiVA.org:kth-194209DiVA: diva2:1038806
Ämne / kurs
Datalogi
Utbildningsprogram
Teknologie masterexamen - Distribuerade system
Handledare
Examinatorer
Tillgänglig från: 2016-11-07 Skapad: 2016-10-19 Senast uppdaterad: 2017-04-24Bibliografiskt granskad

Open Access i DiVA

fulltext(9900 kB)102 nedladdningar
Filinformation
Filnamn FULLTEXT01.pdfFilstorlek 9900 kBChecksumma SHA-512
6fbb6baa18899addbf1243e4f3244dd73517af349813e6110bab3948efeeb300073e7fe4d1fb6c38dbd5a57e90f1635ca01d264cc5bd1392dea9bd144251c564
Typ fulltextMimetyp application/pdf

Av organisationen
Skolan för informations- och kommunikationsteknik (ICT)
Datorsystem

Sök vidare utanför DiVA

GoogleGoogle Scholar
Totalt: 102 nedladdningar
Antalet nedladdningar är summan av nedladdningar för alla fulltexter. Det kan inkludera t.ex tidigare versioner som nu inte längre är tillgängliga.

urn-nbn

Altmetricpoäng

urn-nbn
Totalt: 296 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf