首页> 外文学位 >On the role of search in generating high-performance BLAS libraries.
【24h】

On the role of search in generating high-performance BLAS libraries.

机译:关于搜索在生成高性能BLAS库中的作用。

获取原文
获取原文并翻译 | 示例

摘要

A key step in program optimization is the estimation of optimal values for parameters such as tile sizes and loop unrolling factors. Traditional compilers use simple analytical models to compute these values. In contrast, library generators like ATLAS use global search over the space of parameter values by generating programs with many different combinations of parameter values, and running them on the actual hardware to determine which values give the best performance. It is widely believed that traditional model-driven optimization cannot compete with search-based empirical optimization because tractable analytical models cannot capture all the complexities of modern high-performance architectures. This thesis disproves this belief.; In this work we replaced the global search engine in ATLAS with a model-driven optimization engine, and measured the relative performance of the code produced by the two systems on a variety of architectures. Our experiments show that model-driven optimization can be surprisingly effective., and can generate code with performance comparable to that of code generated by ATLAS using global search. For improving the code further, we advocate complementing modelling with local search and model refinement.; Model-driven optimization needs accurate values of hardware parameters. In this thesis we also describe X-Ray, a robust framework of micro-benchmark for measuring such hardware parameters. X-Ray is designed to be extensible to make it easy to implement new micro-benchmarks. We have developed novel algorithms for measuring hardware parameters commonly used in optimizing software performance, and we have implemented them in X-Ray. We evaluate X-Ray experimentally on traditional workstations and servers as well as on embedded architectures, and show that it produces more accurate and complete results than existing tools.
机译:程序优化中的关键步骤是估计参数的最佳值,例如图块大小和循环展开因子。传统的编译器使用简单的分析模型来计算这些值。相反,像ATLAS这样的库生成器通过生成具有许多参数值组合的程序,并在实际硬件上运行它们以确定哪些值提供最佳性能,从而对参数值空间进行全局搜索。人们普遍认为,传统的模型驱动优化无法与基于搜索的经验优化竞争,因为易处理的分析模型无法捕获现代高性能体系结构的所有复杂性。本论文证明了这一信念。在这项工作中,我们用模型驱动的优化引擎替换了ATLAS中的全局搜索引擎,并测量了两种系统在各种体系结构上生成的代码的相对性能。我们的实验表明,模型驱动的优化可能出奇地有效,并且可以生成性能与ATLAS使用全局搜索生成的代码相当的代码。为了进一步改进代码,我们主张通过局部搜索和模型优化来补充建模。模型驱动的优化需要准确的硬件参数值。在本文中,我们还描述了X射线,这是一种用于测量此类硬件参数的强大的微基准测试框架。 X射线被设计为可扩展的,以使其易于实现新的微基准。我们已经开发了用于测量通常用于优化软件性能的硬件参数的新颖算法,并且已在X射线中实现它们。我们在传统的工作站和服务器以及嵌入式体系结构上对X射线进行了实验评估,结果表明,与现有工具相比,X射线产生的结果更加准确和完整。

著录项

  • 作者

    Yotov, Kamen Yotov.;

  • 作者单位

    Cornell University.;

  • 授予单位 Cornell University.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2006
  • 页码 211 p.
  • 总页数 211
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号