【24h】

Scalable Distributed Fast Multipole Methods

机译:可扩展的分布式快速多极方法

获取原文

摘要

The Fast Multipole Method (FMM) allows $O(N)$ evaluation to any arbitrary precision of $N$-body interactions that arises in many scientific contexts. These methods have been parallelized, with a recent set of papers attempting to parallelize them on heterogeneous CPU/GPU architectures cite{Qi11:SC11}. While impressive performance was reported, the algorithms did not demonstrate complete weak or strong scalability. Further, the algorithms were not demonstrated on nonuniform distributions of particles that arise in practice. In this paper, we develop an efficient scalable version of the FMM that can be scaled well on many heterogeneous nodes for nonuniform data. Key contributions of our work are data structures that allow uniform work distribution over multiple computing nodes, and that minimize the communication cost. These new data structures are computed using a parallel algorithm, and only require a small additional computation overhead. Numerical simulations on a heterogeneous cluster empirically demonstrate the performance of our algorithm.
机译:快速的多极方法(FMM)允许$ O(n)$评估在许多科学环境中出现的N $ -body互动的任何任意精度。这些方法已经并行化,其中一组尝试在异构CPU / GPU架构中并将其平行化它们Cite {QI11:SC11}。虽然报告了令人印象深刻的表现,但该算法没有表现出完全弱或强大的可扩展性。此外,在实践中产生的颗粒的非均匀分布上未证明该算法。在本文中,我们开发了一个有效的可扩展版本,可以在许多异构节点上进行缩放,用于非均匀数据。我们的工作的主要贡献是允许在多个计算节点上均匀的工作分布,并最大限度地减少通信成本的数据结构。使用并行算法计算这些新数据结构,并且仅需要小额额外的计算开销。异构集群的数值模拟经验证明了算法的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号