Endre søk
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Efficient and Cost-effective Workflow Based on Containers for Distributed Reproducible Experiments
KTH, Skolan för informations- och kommunikationsteknik (ICT).
2016 (engelsk)Independent thesis Advanced level (degree of Master (Two Years)), 20 poäng / 30 hpOppgave
Abstract [en]

Reproducing distributed experiments is a challenging task for many researchers. There are many factors which make this problem harder to solve. In order to reproduce distributed experiments, researchers need to perform complex deployments which involve many dependent software stacks with many configurations and manual orchestrations.

Further, researchers need to allocate a larger amount of money for clusters of machines and then spend their valuable time to perform those experiments. Also, some of the researchers spend a lot of time to validate a distributed scenario in a real environment as most of the pseudo distributed systems do not provide the characteristics of a real distributed system.

Karamel provides solutions for the inconvenience caused by the manual orchestration by providing a comprehensive orchestration platform to deploy and run distributed experiments. But still, this solution may incur a similar amount of expenses as of a manual distributed setup since it uses virtual machines underneath. Further, it does not provide quick validations of a distributed setup with a quick feedback loop, as it takes considerable time to terminate and provision new virtual machines.

Therefore, we provide a solution by integrating Docker that can co-exists with virtual machine based deployment model seamlessly. Our solution encapsulates the container-based deployment model for users to reproduce distributed experiment in a cost-effective and efficient manner.

In this project, we introduce novel deployment model with containers that is not possible with the conventional virtual machine based deployment model. Further, we evaluate our solution with a real deployment of Apache Hadoop Terasort experiment which is a benchmark for Apache Hadoop map-reduce platform in order to explain how this model can be used to save the cost and improve the efficiency. 

sted, utgiver, år, opplag, sider
2016. , 48 s.
Serie
TRITA-ICT-EX, 2016:125
Emneord [en]
docker, orchestration, container, workflow, cloud, reproducible-experiments
HSV kategori
Identifikatorer
URN: urn:nbn:se:kth:diva-194209OAI: oai:DiVA.org:kth-194209DiVA: diva2:1038806
Fag / kurs
Computer Science
Utdanningsprogram
Master of Science - Distributed Computing
Veileder
Examiner
Tilgjengelig fra: 2016-11-07 Laget: 2016-10-19 Sist oppdatert: 2017-04-24bibliografisk kontrollert

Open Access i DiVA

fulltext(9900 kB)93 nedlastinger
Filinformasjon
Fil FULLTEXT01.pdfFilstørrelse 9900 kBChecksum SHA-512
6fbb6baa18899addbf1243e4f3244dd73517af349813e6110bab3948efeeb300073e7fe4d1fb6c38dbd5a57e90f1635ca01d264cc5bd1392dea9bd144251c564
Type fulltextMimetype application/pdf

Av organisasjonen

Søk utenfor DiVA

GoogleGoogle Scholar
Totalt: 93 nedlastinger
Antall nedlastinger er summen av alle nedlastinger av alle fulltekster. Det kan for eksempel være tidligere versjoner som er ikke lenger tilgjengelige

Totalt: 208 treff
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf