Change search
ReferencesLink to record
Permanent link

Direct link
Design and Implementation of aHeterogeneous Multicore Architectureusing Field Programmable Technology
KTH, School of Information and Communication Technology (ICT).
2013 (English)Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE creditsStudent thesis
Abstract [en]

Latest trend in multi core architectures is to integrate heterogeneouscores on a single chip in order to achieve task and threadlevel parallelism, high performance and energy efficiency. Someexamples of heterogeneous multi cores processors include (Tegraby NVIDIA,Cell by IBM and Fusion by AMD).

The goal of this thesis work is to design a heterogeneous (2x2)network on chip which can run different tasks in parallel on allthe four cores in the network. Development steps of heterogeneousnetwork on chip include integration of Leon3 -a soft core processorby AeroFlex Gaisler which conforms with IEEE 1754 (SPARCV8) architecture- at one of the nodes of a homogeneous networkon chip incorporating four NiosII/s cores -soft core processor byAltera.This integration involves replacing a NiosII/s processor fromone of the four nodes of the homogeneous network by Leon3 processor.To translate the signals between the resource to networkinterface of the node and the Leon3 processor an AMBA bus1 toAvalon bus2 signal translation wrapper was designed. All processorsin the network on chip communicate by message passing interface.To exploit the potential of heterogeneous network on chipthree applications including sparse LU factorization, nqueens andFibonacci numbers calculation were run on it. These applicationwere run on Leon3 SPARC which generated a number of tasks thatcan run in parallel on all cores of the network simultaneously. Thisparallel execution of nqueens and fibonacci numbers calculationhas resulted in speed up as compared to the serial execution ofthese applications on Leon3 SPARC only. Because of the limitedsize of the on chip memory available for the Leon3 processor, itwas not possible to run sparse LU factorization for bigger matrixsizes and this constraint has resulted in no speed up in case ofsparse LU factorization.

Place, publisher, year, edition, pages
2013. , 164 p.
Trita-ICT-EX, 2013:67
Keyword [en]
National Category
Engineering and Technology
URN: urn:nbn:se:kth:diva-121697OAI: diva2:619396
Educational program
Master of Science - System-on-Chip Design
Available from: 2013-05-03 Created: 2013-05-03 Last updated: 2013-05-03Bibliographically approved

Open Access in DiVA

fulltext(699 kB)450 downloads
File information
File name FULLTEXT01.pdfFile size 699 kBChecksum SHA-512
Type fulltextMimetype application/pdf

By organisation
School of Information and Communication Technology (ICT)
Engineering and Technology

Search outside of DiVA

GoogleGoogle Scholar
Total: 450 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 264 hits
ReferencesLink to record
Permanent link

Direct link