Fine-grained Locality-aware Parallel Scheme for Anisotropic Mesh Adaptation

Hoby Rakotoarivelo, Franck Ledoux, Franck Pommereau
2016 Procedia Engineering  
In this paper, we provide a fine-grained parallel scheme for anisotropic mesh adaptation on NUMA 1 architectures. Data dependencies are expressed by a graph for each kernel, and concurrency is extracted through fine-grained graph coloring. Tasks are structured into bulk-synchronous steps to avoid data races and to aggregate shared-data accesses. To ensure performance prediction, time cost and load imbalance are theoretically characterized. The devised scheme was evaluated on a 4 NUMA node
more » ... ket) machine, and a mean efficiency of 70% was reached on 32 cores for 3 kernels out of 4. The impact of irregular degree distribution and data layout on scalability is highlighted. c 2016 The Authors. Published by Elsevier Ltd. Peer-review under responsibility of organizing committee of the 25 th International Meshing Roundtable (IMR25).
doi:10.1016/j.proeng.2016.11.035 fatcat:zom63bc53bc43mn42dbsnebvuq