【24h】

Cluster Assignment of Global Values for Clustered VLIW Processors

机译:群集VLIW处理器的全局值的群集分配

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

In this paper high-level language (HLL) variables that are alive in a whole HLL function, across multiple scheduling units, are termed as global values. Due to their long live ranges and, hence, large impact on the schedule, the global values require different compiler optimizations than local values, which span across only one scheduling unit. The instruction scheduler for a clustered ILP processor, which is responsible for cluster assignment of operations and variables, faces a difficult problem of assigning global values to clusters. Our study shows that trivial assignments (e.g. mapping all global values into one cluster) may result in a severe cycle count overhead relative to the unicluster of up to 26.3% for a four cluster VLIW machine. This paper presents three advanced algorithms for assigning global values to clusters based on multi-pass scheduling and affinity of variables. Furthermore, we measure performance of these algorithms on optimized multimedia C applications and assess quality of our algorithms by comparing them to a practical higher performance bound derived from a vast random search. Our algorithms reduce the execution time overhead of the best simple algorithm round-robin from 10.5% to 5.9% for the two cluster VLIW machine and from 17.3% to 14.12% for the four cluster VLIW machine.
机译:在本文中,跨多个调度单元存在于整个HLL函数中的高级语言(HLL)变量被称为全局值。由于它们的有效期很长,因此对调度的影响很大,因此全局值需要的编译器优化与局部值的编译器优化不同,局部值仅跨一个调度单元。负责操作和变量的群集分配的群集ILP处理器的指令调度程序面临将全局值分配给群集的难题。我们的研究表明,琐碎的分配(例如将所有全局值映射到一个群集中)可能会导致严重的周期计数开销,相对于四群集VLIW机器的高达26.3%的非分散性而言。本文提出了三种基于多次遍历调度和变量亲和力将全局值分配给群集的高级算法。此外,我们在优化的多媒体C应用程序上测量这些算法的性能,并通过将它们与从大量随机搜索中得出的实际更高的性能范围进行比较,来评估算法的质量。我们的算法将最佳简单算法循环的执行时间开销从两群集VLIW机器的10.5%降低到5.9%,将四群集VLIW机器的执行时间从17.3%降低到14.12%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号