首页> 外文期刊>Comptes rendus. Mecanique >The fast multipole method on parallel clusters, multicore processors, and graphics processing units
【24h】

The fast multipole method on parallel clusters, multicore processors, and graphics processing units

机译:并行集群,多核处理器和图形处理单元上的快速多极方法

获取原文
获取原文并翻译 | 示例
           

摘要

In this article, we discuss how the fast multipole method (FMM) can be implemented on modern parallel computers, ranging from computer clusters to multicore processors and graphics cards (GPU). The FMM is a somewhat difficult application for parallel computing because of its tree structure and the fact that it requires many complex operations which are not regularly structured. Computational linear algebra with dense matrices for example allows many optimizations that leverage the regular computation pattern. FMM can be similarly optimized but we will see that the complexity of the optimization steps is greater. The discussion will start with a general presentation of FMMs. We briefly discuss parallel methods for the FMM, such as building the FMM tree in parallel, and reducing communication during the FMM procedure. Finally, we will focus on porting and optimizing the FMM on GPUs.
机译:在本文中,我们讨论了如何在现代并行计算机上实现快速多极方法(FMM),该并行计算机的范围从计算机群集到多核处理器和图形卡(GPU)。 FMM由于其树状结构以及需要许多没有规则结构的复杂操作的事实,因此对于并行计算而言,它有些困难。例如,具有密集矩阵的计算线性代数允许进行许多利用常规计算模式的优化。可以类似地优化FMM,但是我们会看到优化步骤的复杂性更大。讨论将从FMM的一般介绍开始。我们简要讨论了FMM的并行方法,例如并行构建FMM树,以及在FMM过程中减少通信。最后,我们将重点介绍在GPU上移植和优化FMM。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号