Change search
ReferencesLink to record
Permanent link

Direct link
Contributions to Performance Modeling and Management of Data Centers
KTH, School of Electrical Engineering (EES), Communication Networks. KTH, School of Electrical Engineering (EES), Centres, ACCESS Linnaeus Centre.ORCID iD: 0000-0002-2680-9065
2013 (English)Licentiate thesis, monograph (Other academic)
Abstract [en]

Over the last decade, Internet-based services, such as electronic-mail, music-on-demand, and social-network services, have changed the ways we communicate and access information. Usually, the key functionality of such a service is in backend components, which are located in a data center, a facility for hosting computing systems and related equipment. This thesis focuses on two fundamental problems related to the management, dimensioning, and provisioning of such backend components.

The first problem centers around resource allocation for a large-scale cloud environment. Data centers have become very large; they often contain hundreds of thousands of machines and applications. In such a data center, resource allocation cannot be efficiently achieved through a traditional management system that is centralized in nature. Therefore, a more scalable solution is needed. To address this problem, we have developed and evaluated a scalable and generic protocol for resource allocation. The protocol is generic in the sense that it can be instantiated for different management objectives through objective functions. The protocol jointly allocates CPU, memory, and network resources to applications that are hosted by the cloud. We prove that the protocol converges to a solution, if an objective function satisfies a certain property. We perform a simulation study of the protocol for realistic scenarios. Simulation results suggest that the quality of the allocation is independent of the system size, up to 100,000 machines and applications, for the management objectives considered.

The second problem is related to performance modeling of a distributed key-value store. The specific distributed key-value store we focus on in this thesis is the Spotify storage system. Understanding the performance of the Spotify storage system is essential for achieving a key quality of service objective, namely that the playback latency of a song is sufficiently low. To address this problem, we have developed and evaluated models for predicting the performance of a distributed key-value store for a lightly loaded system. First, we developed a model that allows us to predict the response time distribution of requests. Second, we modeled the capacity of the distributed key-value store for two different object allocation policies. We evaluate the models by comparing model predictions with measurements from two different environments: our lab testbed and a Spotify operational environment. We found that the models are accurate in the sense that the prediction error, i.e., the difference between the model predictions and the measurements from the real systems, is at most 11%.

Place, publisher, year, edition, pages
Stockholm: KTH Royal Institute of Technology, 2013. , 100 p.
Trita-EE, ISSN 1653-5146 ; 2013:036
Keyword [en]
Cloud computing, distributed management, resource allocation, gossip protocols, management objectives, distributed object store, object allocation policy
National Category
Computer Systems
URN: urn:nbn:se:kth:diva-129296ISBN: 978-91-7501-882-9OAI: diva2:651374
2013-10-25, Q2, Osquldas väg, KTH, Stockholm, 10:00 (English)

QC 20131001

Available from: 2013-10-01 Created: 2013-09-25 Last updated: 2013-10-01Bibliographically approved

Open Access in DiVA

fulltext(2656 kB)545 downloads
File information
File name FULLTEXT01.pdfFile size 2656 kBChecksum SHA-512
Type fulltextMimetype application/pdf

Search in DiVA

By author/editor
Yanggratoke, Rerngvit
By organisation
Communication NetworksACCESS Linnaeus Centre
Computer Systems

Search outside of DiVA

GoogleGoogle Scholar
Total: 545 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 323 hits
ReferencesLink to record
Permanent link

Direct link