Scalable Distributed Fast Multipole Methods

Qi Hu, Nail A. Gumerov, Ramani Duraiswami
2012 2012 IEEE 14th International Conference on High Performance Computing and Communication & 2012 IEEE 9th International Conference on Embedded Software and Systems  
The Fast Multipole Method (FMM) allows O(N ) evaluation to any arbitrary precision of N -body interactions that arises in many scientific contexts. These methods have been parallelized, with a recent set of papers attempting to parallelize them on heterogeneous CPU/GPU architectures [1]. While impressive performance was reported, the algorithms did not demonstrate complete weak or strong scalability. Further, the algorithms were not demonstrated on nonuniform distributions of particles that
more » ... e in practice. In this paper, we develop an efficient scalable version of the FMM that can be scaled well on many heterogeneous nodes for nonuniform data. Key contributions of our work are data structures that allow uniform work distribution over multiple computing nodes, and that minimize the communication cost. These new data structures are computed using a parallel algorithm, and only require a small additional computation overhead. Numerical simulations on a heterogeneous cluster empirically demonstrate the performance of our algorithm.
doi:10.1109/hpcc.2012.44 dblp:conf/hpcc/HuGD12 fatcat:urstmdeybjbqdb7p77252f2npq