首页> 外文会议>Annual ACM/IEEE international symposium on Microarchitecture;ACM/IEEE international symposium on Microarchitecture >Profile-driven instruction level parallel scheduling with application to super blocks
【24h】

Profile-driven instruction level parallel scheduling with application to super blocks

机译:配置文件驱动的指令级并行调度及其在超级块中的应用

获取原文
获取外文期刊封面目录资料

摘要

Code scheduling to exploit instruction level parallelism (ILP) is a critical problem in compiler optimization research in light of the increased use of long-instruction-word machines. Unfortunately optimum scheduling is computationally intractable, and one must resort to carefully crafted heuristics in practice. If the scope of application of a scheduling heuristic is limited to basic blocks, considerable performance loss may be incurred at block boundaries. To overcome this obstacle, basic blocks can be coalesced across branches to form larger regions such as super blocks. In the literature, these regions are typically scheduled using algorithms that are either oblivious to profile information (under the assumption that the process of forming the region has fully utilized the profile information), or use the profile information as an addendum to classical scheduling techniques. We believe that even for the simple case of linear code regions such as super blocks, additional performance improvementcan be gained by utilizing the profile information in scheduling as well. We propose a general paradigm for converting any profile-insensitive list scheduler to a profile-sensitive scheduler. Our technique is developed via a theoretical analysis of a simplified abstract model of the general problem of profile-driven scheduling over any acyclic code region, yielding a scoring measure for ranking branch instructions.
机译:鉴于长指令字机器的使用日益增多,利用指令级并行性(ILP)进行代码调度是编译器优化研究中的关键问题。不幸的是,最佳调度在计算上是棘手的,并且在实践中必须采用精心设计的启发式算法。如果调度试探法的应用范围限于基本块,则可能在块边界处引起相当大的性能损失。为了克服这一障碍,可以将基本块跨分支合并以形成更大的区域,例如超级块。在文献中,通常使用不考虑轮廓信息的算法来调度这些区域(在形成区域的过程已充分利用轮廓信息的假设下),或者将轮廓信息用作经典调度技术的附录。我们相信,即使对于线性代码区域(例如超级块)的简单情况,也可以通过在调度中利用配置文件信息来获得额外的性能改进。我们提出了一个通用的范式,用于将任何对配置文件不敏感的列表调度程序转换为对配置文件不敏感的调度程序。我们的技术是通过对任何非循环代码区域上的概要文件驱动的调度的一般问题的简化抽象模型进行理论分析而开发的,从而得出了对分支指令进行排名的评分方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号