kth.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Accelerating Distributed Storage in Heterogeneous Settings
KTH, School of Electrical Engineering and Computer Science (EECS), Computer Science, Software and Computer systems, SCS.ORCID iD: 0000-0001-5890-9629
2022 (English)Doctoral thesis, monograph (Other academic)
Abstract [en]

Heterogeneity in cloud environments is a fact of life—from workload skews and network path changes, to the diversity of server hardware components, these are all factors that impact the performance of distributed storage. In this dissertation, we identify that heterogeneity can in fact be one of the primary causes of service degradation for storage systems. We then tackle this challenge by building next-generation distributed storage systems that can operate amidst heterogeneity while providing fast and predictable response times. First, we study skews in cloud workloads and propose scheduling strategies for key-value stores that seek to optimize latency. We then conduct a measurements study in one of the largest cloud provider networks to quantify variations in network latencies, and possible implications for storage services. Next, with fast non-volatile RAM (NVRAM) now becoming commercially available, we look into how storage systems can deal with the increasing diversity of storage technologies. We design and evaluate a distributed file system that can manage data across NVRAM and other types of storage, while providing low latency and high scalability. Lastly, we build a framework that transforms commodity Remote Direct Memory Access (RDMA) NICs into Turing machines—capable of performing arbitrary computations. This provides yet another compute resource on server machines, and we show how we can leverage it to accelerate common storage tasks as well as real storage applications.

Abstract [sv]

Heterogenitet i molnmiljöer är ett livsfaktum—allt från skevheter i arbetsbelastningen och nätverksvägsförändringar till diversitet av serverhårdvarukomponenter, dessa är faktorer som påverkar prestanda för distribuerad lagring. I denna avhandling identifierar vi att heterogenitet faktiskt kan vara en av de främsta orsakerna till tjänste-degradering för lagringssystem. Vi tacklar sedan denna utmaning genom att bygga nästa generations distribuerade lagringssystem som kan fungera mitt i heterogenitet medan de ger snabba och förutsägbara svarstider. Först studerar vi skevheter i molnarbetsbelastningar och föreslår schemaläggningsstrategier för nyckelvärdelagring som försöker optimera latens. Vi utför sedan en mätstudie i ett av de största molnleverantörsnätverken för att kvantifiera variationer i nätverkslatenser och möjliga implikationer för lagringstjänster. Därefter, i och med att snabbt non-volatile RAM (NVRAM) nu blir kommersiellt tillgängligt, undersöker vi hur lagringssystem kan hantera den ökande diversiteten av lagringsteknik. Vi designar och utvärderar ett distribuerat filsystem som kan hantera data tvärs över NVRAM och andra typer av lagring, medan det ger låg latens och hög skalbarhet. Slutligen bygger vi ett ramverk som transformerar lättillgänglig Remote Direct Memory Access (RDMA) NICs till Turingmaskiner—som kan utföra godtyckliga beräkningar. Detta ger ännu en beräkningsresurs på servrar, och vi visar hur vi kan utnyttja den för att accelerera gemensamma lagringsuppgifter såväl som verkliga lagringsapplikationer.

Place, publisher, year, edition, pages
Kista, Stockholm, Sweden: KTH Royal Institute of Technology, 2022. , p. 177
Series
TRITA-EECS-AVL ; 2022:32
Keywords [en]
Distributed storage, File systems, Hardware offload, Persistent memory
Keywords [sv]
Distribuerad lagring, Dilsystem, Maskinvaruavlastning, Beständigt minne
National Category
Computer Systems Communication Systems Computer Sciences
Research subject
Computer Science
Identifiers
URN: urn:nbn:se:kth:diva-311963ISBN: 978-91-8040-230-9 (print)OAI: oai:DiVA.org:kth-311963DiVA, id: diva2:1656395
Public defence
2022-05-30, https://kth-se.zoom.us/meeting/register/u5Iqd-yhqTorGNLFoosJXIJmTXZl3rsxS55J, Sal C, Electrum, Kungliga Tekniska Högskolan, Kistagången 16, Kista, Stockholm, 15:00 (English)
Opponent
Supervisors
Funder
EU, European Research Council, 770889
Note

This work was also supported by a fellowship from the Erasmus Mundus Joint Doctorate in Distributed Computing (EMJD-DC), funded by the European Commission (EACEA) (FPA 2012-0030). QC 20220509

Available from: 2022-05-09 Created: 2022-05-05 Last updated: 2022-06-25Bibliographically approved

Open Access in DiVA

fulltext(4546 kB)748 downloads
File information
File name FULLTEXT01.pdfFile size 4546 kBChecksum SHA-512
acbbe82ca3fa5665a6fc1df93df6642679501fae60f4ac23bd10eab878d897dbfa7798c69e8d2099883163add62c2bb5f56f8530f824c60f61f00ef33bdcc487
Type fulltextMimetype application/pdf

Authority records

Reda, Waleed

Search in DiVA

By author/editor
Reda, Waleed
By organisation
Software and Computer systems, SCS
Computer SystemsCommunication SystemsComputer Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 751 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 1038 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf