...
首页> 外文期刊>Journal of Parallel and Distributed Computing >Empirical performance model-driven data layout optimization and library call selection for tensor contraction expressions
【24h】

Empirical performance model-driven data layout optimization and library call selection for tensor contraction expressions

机译:基于经验性能模型的数据布局优化和张量收缩表达式的库调用选择

获取原文
获取原文并翻译 | 示例
           

摘要

Empirical optimizers like ATLAS have been very effective in optimizing computational kernels in libraries. The best choice of parameters such as tile size and degree of loop unrolling is determined in ATLAS by executing different versions of the computation. In contrast, optimizing compilers use a modeldriven approach to program transformation. While the model-driven approach of optimizing compilers is generally orders of magnitude faster than ATLAS-like library generators, its effectiveness can be limited by the accuracy of the performance models used. In this paper, we describe an approach where a class of computations is modeled in terms of constituent operations that are empirically measured, thereby allowing modeling of the overall execution time. The performance model with empirically determined cost components is used to select library calls and choose data layout transformations in the context of the Tensor Contraction Engine, a compiler for a high-level domain-specific language for expressing computational models in quantum chemistry. The effectiveness of the approach is demonstrated through experimental measurements on representative computations from quantum chemistry.
机译:像ATLAS这样的经验优化器在优化库中的计算内核方面非常有效。通过执行不同版本的计算,可以在ATLAS中确定参数的最佳选择,例如瓦片大小和循环展开程度。相反,优化编译器使用模型驱动的方法进行程序转换。尽管由模型驱动的优化编译器方法通常比类似ATLAS的库生成器快几个数量级,但其有效性可能会受到所使用性能模型准确性的限制。在本文中,我们描述了一种方法,其中根据经验测量的组成操作对一类计算进行建模,从而允许对总体执行时间进行建模。具有经验确定的成本成分的性能模型用于选择库调用并在Tensor Contracting Engine的上下文中选择数据布局转换,Tensor Contracting Engine是用于在量子化学中表达计算模型的高级领域特定语言的编译器。通过对来自量子化学的代表性计算的实验测量证明了该方法的有效性。

著录项

  • 来源
    《Journal of Parallel and Distributed Computing》 |2012年第3期|p.338-352|共15页
  • 作者单位

    Department of Computer Science and Engineering, The Ohio State University, Columbus, OH 43210, USA,Software and Services Croup, Intel Corporation, 2111 NE 25th Ave., Hillsboro, OR 97124, USA.;

    Department of Computer Science and Engineering, The Ohio State University, Columbus, OH 43210, USA,IBM Silicon Valley Lab., 555 Bailey Ave., San Jose, CA 95141, USA.;

    Department of Computer Science and Engineering, The Ohio State University, Columbus, OH 43210, USA,Computational Sciences and Mathematics Division, Pacific Northwest National Laboratory, Richland, WA 99352, USA;

    Department of Computer Science, Louisiana State University, Baton Rouge, LA 70803, USA;

    Department of Electrical and Computer Engineering, Louisiana State University, Baton Rouge, LA 70803, USA;

    Department of Computer Science and Engineering, The Ohio State University, Columbus, OH 43210, USA;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    data layout optimization; library call selection; compiler optimization; tensor contractions;

    机译:数据布局优化;库调用选择;编译器优化;张量收缩;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号