Implementing O(N ) N–body algorithms efficiently in data parallel languages
1996 (English)In: Scientific Programming, ISSN 1058-9244, E-ISSN 1875-919X, Vol. 5, no 4, 337-364 p.Article in journal (Refereed) Published
The optimization techniques for hierarchical O(N) N-body algorithms described here focus on managing the data distribution and the data references, both between the memories of different nodes and within the memory hierarchy of each node. We show how the techniques can be expressed in data-parallel languages, such as High Performance Fortran (HPF) and Connection Machine Fortran (CMF). The effectiveness of our techniques is demonstrated on an implementation of Anderson's hierarchical O(N) N-body method for the Connection Machine system CM-5/5E. Of the total execution time, communication accounts for about 10-20% of the total time, with the average efficiency for arithmetic operations being about 40% and the total efficiency (including communication) being about 35%. For the CM-5E, a performance in excess of 60 Mflop/s per node (peak 160 Mflop/s per node) has been measured.
Place, publisher, year, edition, pages
1996. Vol. 5, no 4, 337-364 p.
Computer and Information Science
IdentifiersURN: urn:nbn:se:kth:diva-90979OAI: oai:DiVA.org:kth-90979DiVA: diva2:507629
NR 201408052012-03-052012-03-05Bibliographically approved