Locality-aware Scheduling and Characterization of Task-based Programs
2014 (English)Licentiate thesis, comprehensive summary (Other academic)
Modern computer architectures expose an increasing number of parallel features supported by complex memory access and communication structures. Currently used task scheduling techniques perform poorly since they focus solely on balancing computation load across parallel features and remain oblivious to locality properties of support structures. We contribute with locality-aware task scheduling mechanisms which improve execution time performance on average by 44\% and 11\% respectively on two locality-sensitive architectures - the Tilera TILEPro64 manycore processor and an AMD Opteron 6172 processor based four socket SMP machine.
Programmers need task performance metrics such as amount of task parallelism and task memory hierarchy utilization to analyze performance of task-based programs. However, existing tools indicate performance mainly using thread-centric metrics. Programmers therefore resort to using low-level and tedious thread-centric analysis methods to infer task performance. We contribute with tools and methods to characterize task-based OpenMP programs at the level of tasks using which programmers can quickly understand important properties of the task graph such as critical path and parallelism as well as properties of individual tasks such as instruction count and memory behavior.
Place, publisher, year, edition, pages
Stockholm: KTH Royal Institute of Technology, 2014. , vi, 29 p.
TRITA-ICT-ECS AVH, ISSN 1653-6363 ; 14:01
Locality-aware, Task scheduling, OpenMP
Research subject Information and Communication Technology
IdentifiersURN: urn:nbn:se:kth:diva-141124ISBN: 978-91-7501-994-9OAI: oai:DiVA.org:kth-141124DiVA: diva2:694751
2014-03-05, Sal/Hall E, Forum, KTH-ICT, Isafjordsgatan 39, Kista, 10:00 (English)
Kessler, Christoph, Professor
Brorsson, Mats, Professor
QC 201402122014-02-122014-02-072014-02-12Bibliographically approved
List of papers