首页> 外文会议>Proceedings of the 42nd Annual IEEE/ACM International Symposium on Microarchitecture >Portable compiler optimisation across embedded programs and microarchitectures using machine learning
【24h】

Portable compiler optimisation across embedded programs and microarchitectures using machine learning

机译:使用机器学习跨嵌入式程序和微体系结构进行可移植编译器优化

获取原文

摘要

Building an optimising compiler is a difficult and time consuming task which must be repeated for each generation of a microprocessor. As the underlying microarchitecture changes from one generation to the next, the compiler must be retuned to optimise specifically for that new system. It may take several releases of the compiler to effectively exploit a processor's performance potential, by which time a new generation has appeared and the process starts again. We address this challenge by developing a portable optimising compiler. Our approach employs machine learning to automatically learn the best optimisations to apply for any new program on a new microarchitectural configuration. It achieves this by learning a model off-line which maps a microarchitecture description plus the hardware counters from a single run of the program to the best compiler optimisation passes. Our compiler gains 67% of the maximum speedup obtainable by an iterative compiler search using 1000 evaluations. We obtain, on average, a 1.16x speedup over the highest default optimisation level across an entire microarchitecture configuration space, achieving a 4.3x speedup in the best case. We demonstrate the robustness of this technique by applying it to an extended microarchitectural space where we achieve comparable performance.
机译:构建优化的编译器是一项艰巨且耗时的工作,每代微处理器都必须重复进行。随着底层微体系结构从一代到下一代的变化,必须重新编译编译器,以针对该新系统进行专门的优化。要有效地利用处理器的性能潜力,可能需要花费几个发行版本的编译器,届时,将出现新一代的处理器,并且该过程将再次开始。我们通过开发可移植的优化编译器来应对这一挑战。我们的方法利用机器学习来自动学习最佳优化,以便在新的微体系结构配置上申请任何新程序。它是通过离线学习模型来实现的,该模型将微体系结构描述以及硬件计数器从程序的一次运行映射到最佳的编译器优化过程。通过使用1000个求值进行迭代编译器搜索,我们的编译器获得了最大加速的67%。在整个微体系结构配置空间中,我们平均在最高默认优化级别上平均获得了1.16倍的加速,在最佳情况下达到了4.3倍的加速。我们通过将其应用于扩展的微体系结构空间来证明该技术的鲁棒性,在该空间中我们可以实现可比的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号