首页> 外文期刊>IEEE Transactions on Parallel and Distributed Systems >A unified framework for optimizing locality, parallelism, and communication in out-of-core computations
【24h】

A unified framework for optimizing locality, parallelism, and communication in out-of-core computations

机译:一个用于优化核外计算中的局部性,并行性和通信的统一框架

获取原文
获取原文并翻译 | 示例

摘要

This paper presents a unified framework that optimizes out-of-core programs by exploiting locality and parallelism, and reducing communication overhead. For out-of-core problems where the data set sizes far exceed the size of the available in-core memory, it is particularly important to exploit the memory hierarchy by optimizing the I/O accesses. We present algorithms that consider both iteration space (loop) and data space (file layout) transformations in a unified framework. We show that the performance of an out-of-core loop nest containing references to out-of-core arrays can be improved by using a suitable combination of file layout choices and loop restructuring transformations. Our approach considers array references one-by-one and attempts to optimize each reference for parallelism and locality. When there are references for which parallelism optimizations do not work, communication is vectorized so that data transfer can be performed before the innermost loop. Results from hand-compiles on IBM SP-2 and Inter Paragon distributed-memory message-passing architectures show that this approach reduces the execution times and improves the overall speedups. In addition, we extend the base algorithm to work with file layout constraints and show how it is useful for optimizing programs that consist of multiple loop nests.
机译:本文提出了一个统一的框架,该框架通过利用局部性和并行性并减少通信开销来优化内核外程序。对于数据集大小远远超过可用内核内存的内核外问题,通过优化I / O访问来利用内存层次结构尤为重要。我们提出了在统一框架中同时考虑迭代空间(循环)和数据空间(文件布局)转换的算法。我们表明,通过使用文件布局选择和循环重构转换的适当组合,可以改善包含对核心阵列的引用的核心循环嵌套的性能。我们的方法一对一考虑数组引用,并尝试针对并行性和局部性优化每个引用。如果存在无法进行并行性优化的参考,则会对通信进行矢量化处理,以便可以在最内层循环之前执行数据传输。在IBM SP-2和Inter Paragon分布式内存消息传递体系结构上进行手工编译的结果表明,这种方法减少了执行时间并提高了整体速度。此外,我们扩展了基本算法以处理文件布局约束,并展示了它对于优化包含多个循环嵌套的程序的有用性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号