首页> 外文会议>European PVM/MPI(Parallel Virtual Machine and Message Passing Interface) Users' Group Meeting; 20040919-22; Budapest(HU) >On the Parallelization of a Cache-Optimal Iterative Solver for PDEs Based on Hierarchical Data Structures and Space-Filling Curves
【24h】

On the Parallelization of a Cache-Optimal Iterative Solver for PDEs Based on Hierarchical Data Structures and Space-Filling Curves

机译:基于分层数据结构和空间填充曲线的PDE缓存优化迭代求解器的并行化

获取原文
获取原文并翻译 | 示例

摘要

Competitive numerical simulation codes solving partial differential equations have to tap the full potential of both modern numerical methods - like multi-grid and adaptive grid refinement - and available computing resources. In general, these two are rival tasks. Typically, hierarchical data structures resulting from multigrid and adaptive grid refinement impede efficient usage of modern memory architectures on the one hand and complicate the efficient parallelization on the other hand due to scattered data for coaxse-level-points and unbalanced data trees. In our previous work, we managed to bring together high performance aspects in numerics as well as in hardware usage in a very satisfying way. The key to this success was to integrate space-filling curves consequently not only in the programs flow control but also in the construction of data structures which are processed linearly even for hierarchical multilevel data. In this paper, we present first results on the second challenge, namely the efficient parallelization of algorithms working on hierarchical data. It shows that with the same algorithms as desribed above, the two main demands on good parellel programs can be fulfilled in a natural way, too: The balanced data partitioning can be done quite easily and cheaply by cutting the queue of data linearized along the space-filling curve into equal pieces. Furtheron, this partitioning is quasi-optimal regarding the amount of communication. Thus, we will end up with a code that overcomes the quandary between hierarchical data and efficient memory usage and parallelization in a very natural way by a very deep integration of space-filling-curves in the underlying algorithm.
机译:解决偏微分方程的竞争性数值模拟代码必须充分利用现代数值方法(如多网格和自适应网格细化)的全部潜力以及可用的计算资源。通常,这两个是相互竞争的任务。通常,多网格和自适应网格细化所产生的分层数据结构一方面会阻碍现代内存体系结构的有效使用,另一方面会由于同轴级别点和不平衡数据树的分散数据而使高效并行化复杂化。在我们以前的工作中,我们设法以非常令人满意的方式将数字和硬件使用方面的高性能方面结合在一起。因此,成功的关键是不仅要在程序流控制中集成空间填充曲线,而且还要在构建数据结构时集成空间填充曲线,即使对于分层的多级数据也可以对它们进行线性处理。在本文中,我们提出了第二个挑战的第一个结果,即处理分层数据的算法的高效并行化。它表明,使用与上述相同的算法,也可以自然地满足对好的并行程序的两个主要要求:通过削减沿空间线性化的数据队列,可以非常轻松且经济地完成平衡的数据分区。 -将曲线填充为相等的片段。此外,关于通信量,该划分是准最优的。因此,我们将获得一个代码,该代码通过在底层算法中非常深入地集成空间填充曲线,以一种非常自然的方式克服了分层数据与有效内存使用和并行化之间的难题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号