首页> 外文期刊>Journal of supercomputing >Optimizing inter-nest data locality in imperfect stencils based on loop blocking
【24h】

Optimizing inter-nest data locality in imperfect stencils based on loop blocking

机译:基于循环阻塞优化不完善模具中的嵌套数据局部性

获取原文
获取原文并翻译 | 示例
       

摘要

With the interesting growth in high-performance computing, the performance of data-driven programs is becoming more and more dependent on fast memory access, which can be improved by data locality. Data locality between a pair of loop nest is called inter-nest data locality. A very important class of loop nests that shows a significant inter-nest data locality is stencils. In this paper, we have proposed a method to optimize inter-nest data locality in stencils and named it EALB. In the proposed method, two “compute” and “copy” loop nests within the time loop nest of stencils are partitioned into blocks and these blocks are executed interleaved. Determining the optimum block size in the proposed method is based on an evolutionary algorithm which uses cache miss rate and cache eviction rate. The experimental results show that the EALB is significantly effective compared to the original programs and has better results compared to the results of the state-of-the-art approach, Pluto.
机译:随着高性能计算的迅猛增长,数据驱动程序的性能越来越依赖于快速内存访问,可以通过数据局部性来改善这种访问。一对循环嵌套之间的数据局部性称为嵌套数据局部性。模版是一类非常重要的循环嵌套,它显示了重要的嵌套数据局部性。在本文中,我们提出了一种优化模板中嵌套数据位置的方法,并将其命名为EALB。在提出的方法中,模板的时间循环嵌套中的两个“计算”和“复制”循环嵌套被划分为多个块,并且这些块交错执行。在提出的方法中确定最佳块大小是基于使用高速缓存未命中率和高速缓存逐出率的进化算法。实验结果表明,与原始程序相比,EALB效果显着,并且与最新方法Pluto的结果相比,EALB具有更好的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号