首页> 外文OA文献 >Renumbering unstructured grids to improve the performance of codes on hierarchical memory machines
【2h】

Renumbering unstructured grids to improve the performance of codes on hierarchical memory machines

机译:重新编号非结构化网格以提高分层存储计算机上代码的性能

摘要

The performance of unstructured grid codes on workstations and distributed memory parallel computers is substantially affected by the efficiency of the memory hierarchy. This efficiency essentially depends on the order of computation and numbering of the grid. Most grid generators do not take into account the effect of the memory hierarchy when producing grids so application programmers must renumber grids to improve the performance of their codes. To design a good renumbering scheme a detailed runtime analysis of the data movement in an application code is needed. Thus, a memory hierarchy simulator has been developed to analyse the effect of existing renumbering schemes such as bandwidth reduction, the Greedy method, colouring, random numbering and the original numbering produced by the grid generator. The renumbering is applied to either vertices, edges, faces or cells and two algorithms are proposed to consistently renumber the other entities used in the solver. The simulated and actual timings show that bandwidth reduction and Greedy methods give the best performance on IBM RS/6000, SGI Indy, SGI Indigo and SGI Power Challenge machines for three-dimensional Poissons's, Maxwell's and the Euler equations solvers. The improvement in performance is over a factor of two for applications with large grids and a high ratio of memory-accesses to computation. This factor is even higher for memory hierarchies with small caches. © 1997 Elsevier Science Limited. All rights reserved.
机译:工作站和分布式内存并行计算机上的非结构化网格代码的性能在很大程度上受到内存层次结构效率的影响。该效率主要取决于网格的计算顺序和编号。大多数网格生成器在生成网格时并未考虑内存层次结构的影响,因此应用程序程序员必须对网格重新编号以提高其代码的性能。为了设计好的重编号方案,需要对应用程序代码中的数据移动进行详细的运行时分析。因此,已经开发了存储器层次结构模拟器来分析现有重编号方案的效果,例如带宽减少,贪婪方法,着色,随机编号和网格生成器产生的原始编号。将重编号应用于顶点,边,面或像元,并提出了两种算法来一致地重编号求解器中使用的其他实体。仿真和实际时序显示,带宽减少和贪婪方法在IBM RS / 6000,SGI Indy,SGI Indigo和SGI Power Challenge机器上为三维Poissons,Maxwell和Euler方程求解器提供了最佳性能。对于具有较大网格和内存访问与计算的比率很高的应用程序,性能的提高超过了两倍。对于具有小缓存的内存层次结构,此因素甚至更高。 ©1997爱思唯尔科学有限公司。版权所有。

著录项

  • 作者

    Burgess DA; Giles MB;

  • 作者单位
  • 年度 1997
  • 总页数
  • 原文格式 PDF
  • 正文语种
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号