Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Optimization of Data Warehouse Design and Architecture
KTH, School of Information and Communication Technology (ICT).
2011 (English)Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE creditsStudent thesis
Abstract [en]

A huge number of SCANIA trucks and busses are running on the roads. Unlike the trucks and buses of the past, they are hi-tech vehicles, carrying a lot of technical and operational information that pertains to different aspects like load statistics, driving time, engine-speed over time and much more.

This information is fed into an analysis system where it is organized to support analytical questions. Over the period of time this system has become overloaded and needs to be optimized. There are a number of areas identified that can be considered for improvement. However, it is not possible to analyze the whole system within the given constraints. A subset is picked which has been thought of to be sufficient for the purpose of the thesis.

The system takes a lot of time to load new data. Data loading is not incremental. There is a lot of redundancy in the storage structure. Query execution takes a lot of time in some parts of the database.

The methods chosen for this thesis includes data warehouse design and architecture analysis, end user queries review, and code analysis.

A potential solution is presented to reduce the storage space requirements and maintenance time taken by the databases. This is achieved by presenting a solution to reduce the number of databases maintained in parallel and contains duplicated data.

Some optimizations have been made in the storage structure and design to improve the query processing time for the end users.

An example incremental loading strategy is also implemented to demonstrate the working and idea. This helps in the reduction of loading time.

Moreover, An investigation has been made into a commercially available Data warehouse management System. The investigation is mostly based on hardware architecture and how it can contributes to better performance. This portion is only theoretical.

Based on the analysis recommendations are made regarding the architecture and design of the data warehouse.

Place, publisher, year, edition, pages
2011. , 45 p.
Series
Trita-ICT-EX, 126
Identifiers
URN: urn:nbn:se:kth:diva-37233OAI: oai:DiVA.org:kth-37233DiVA: diva2:432740
Subject / course
Information and Software Systems
Educational program
Master of Science - Software Engineering of Distributed Systems
Uppsok
Technology
Examiners
Available from: 2011-08-05 Created: 2011-08-05 Last updated: 2011-08-05Bibliographically approved

Open Access in DiVA

fulltext(714 kB)307 downloads
File information
File name FULLTEXT01.pdfFile size 714 kBChecksum SHA-512
dc5350c5a2e522f217fa3a9bb67dcf15078cf1071fe231d8ee2860bafa890ee3eb76f7dfb14a2b0ff3bbb851b0e935bdd64cfc89458f96cbcd7bec00c2d91432
Type fulltextMimetype application/pdf

By organisation
School of Information and Communication Technology (ICT)

Search outside of DiVA

GoogleGoogle Scholar
Total: 307 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 640 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf