Performance tuning of small scale shared memory multiprocessor applications using visualisation
1997 (English)Conference paper (Refereed)
Even though shared memory multiprocessors are becoming more and more common, it is still a formidable task to achieve high performance on parallel applications. One of the main reasons for this is a high amount of implicit communication generated by the program due to poor structuring of the program. This article shows the importance of performance visualisation in order to spot and find the source of cache coherence bottlenecks. This is exemplified by a performance analysis tool, SM-prof, that visualises accesses to shared data structures so that problematic access patterns are highlighted. SM-prof maintains links from the visualisation to the actual source code lines responsible for the accesses. In contrast to earlier approaches, SM-prof shows the inherent data sharing of the application that would occur in any shared memory architecture. We demonstrate the merits of SM-prof by means of two detailed case studies.
Place, publisher, year, edition, pages
1997. 155-62 p.
IdentifiersURN: urn:nbn:se:kth:diva-109659OAI: oai:DiVA.org:kth-109659DiVA: diva2:584916
Proceedings of 10th International Conference on Parallel and Distributed Computing Systems
NR 201408052013-01-092013-01-082013-01-09Bibliographically approved