首页> 外文会议>International symposium on Systems synthesis >Phase coupled operation assignment for VLIW processors with distributed register files
【24h】

Phase coupled operation assignment for VLIW processors with distributed register files

机译:具有分布式寄存器文件的VLIW处理器的相位耦合操作分配

获取原文

摘要

The ever increasing complexity of signal processing applications and the desire to reduce the time to market demands efficient compilation techniques for programmable Digital Signal Processors (DSPs). Because the instruction sets of VLIW processors are more regular and orthogonal then the instruction sets of the traditional DSPs, they tend to be more compiler friendly. However in order to improve the maximal clock frequency, the power efficiency and the code density, several register files are used in the newer generation of VLIW processors instead of just one large register file. The use of several register files and partial connected networks leads, for example, to the problem that a result stored in a register file can not be accessed by all the functional units in the processor. This makes the assignment of operations to functional units a difficult task for the compiler.This paper describes the constraint analysis based operation assignment techniques intended to deal with processors with distributed register files and partially connected networks. The assignment techniques have been implemented in our code generation tool FACTS [8]. This tool is intended for the generation of an operation assignment, a register binding and a schedule of folded loops that satisfy the specified timing constraints. Our approach is based on satisfaction of constraints which makes it different from the optimisation based operation assignment techniques [4][2][3] which are known from literature. The operation assignment technique is based on the modeling of the assignment search space in a conflict graph. Pruning of this conflict graph prevents decisions that inevitably lead to solutions that do not satisfy the timing constraints. If after pruning infeasibility is detected, backtracking of assignment decisions is performed. In order to obtain a tight coupling between the assignment phase and the schedule phase information is derived from the conflict graph which is used to prune the schedule search space and information from the schedule search space is incorporated in the conflict graph. Automatic insertion of copy operations for moving intermediate values from one register file to another register files is not supported. However the use of a shared global bus in the processor guarantees that at least one direct communication path from a producing functional unit to a consuming functional unit exists and therefore the use of copy operations is not necessary.
机译:信号处理应用程序的复杂性和减少市场时间的愿望需要可编程数字信号处理器(DSP)的有效编译技术。因为VLIW处理器的指令集更加常规和正交,那么传统DSP的指令集,它们往往更加编译。然而,为了提高最大时钟频率,功率效率和代码密度,在较新的VLIW处理器中使用了几个寄存器文件而不是仅仅是一个大寄存器文件。例如,使用多个寄存器文件和部分连接的网络引导到存储在寄存器文件中的结果无法被处理器中的所有功能单元访问的问题。这使得运作单位的作业分配给编译器的困难任务。本文介绍了基于的约束分析旨在处理具有分布式寄存器文件和部分连接的网络的处理器的操作分配技术。在我们的代码生成工具事实中实现了分配技术[8]。该工具用于生成操作分配,寄存器绑定和满足指定定时约束的折叠循环的计划。我们的方法基于对限制的满意度,这使得与文献中已知的基于优化的操作分配技术[4] [2] [3]不同。操作分配技术基于冲突图中的分配搜索空间的建模。这种冲突图的修剪可以防止不可避免地导致不满足时序约束的解决方案的决定。如果在检测到灌注后,则执行分配决策的回溯。为了在分配阶段和调度阶段信息之间获得紧密的耦合,从用于将时间表搜索空间和来自日程搜索空间的信息汇总的冲突图,在冲突图中结合在冲突图中。不支持自动插入复制操作,以将中间值从一个寄存器文件移动到另一个寄存器文件。然而,在处理器中使用共享全局总线保证从生成功能单元到消费功能单元的至少一个直接通信路径存在,因此不需要使用复制操作。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号