首页> 外文学位 >Interconnect-aware scheduling and resource allocation for high-level synthesis.
【24h】

Interconnect-aware scheduling and resource allocation for high-level synthesis.

机译:用于高层综合的互连感知调度和资源分配。

获取原文
获取原文并翻译 | 示例

摘要

A high-level architectural synthesis can be described as the process of transforming a behavioral description into a structural description. The scheduling, processor allocation, and register binding are the most important tasks in the high-level synthesis. In the past, it has been possible to focus simply on the delays of the processing units in a high-level synthesis and neglect the wire delays, since the overall delay of a digital system was dominated by the delay of the logic gates. However, with the process technology being scaled down to deep-submicron region, the global interconnect delays can no longer be neglected in VLSI designs. It is, therefore, imperative to include in high-level synthesis the delays on wires and buses used to communicate data between the processing units i.e., inter-processor communication delays. Furthermore, the way the process of register binding is performed also has an impact on the complexity of the interconnect paths required to transfer data between the processing units. Hence, the register binding can no longer ignore its effect on the wiring complexity of resulting designs. The objective of this thesis is to develop techniques for an interconnect-aware high-level synthesis. Under this common theme, this thesis has two distinct focuses. The first focus of this thesis is on developing a new high-level synthesis framework while taking the inter-processor communication delay into consideration. The second focus of this thesis is on the developing of a technique to carry out the register binding and a scheme to reduce the number of registers while taking the complexity of the interconnects into consideration.;A technique for register binding that results in a reduced number of registers and interconnects is developed by appropriately dividing the lifetime of a token into multiple segments and then binding those having the same source and/or destination into a single register. A node regeneration scheme, in which the idle processing units are utilized to generate multiple copies of the nodes in a given DFG, is devised to reduce the number of registers and interconnects even further.;The techniques and schemes developed in this thesis are applied to the synthesis of architectures for a number of benchmark DSP problems and compared with various other commonly used synthesis methods in order to assess their effectiveness. It is shown that the proposed techniques provide superior performance in terms of the iteration period, placement area, and the numbers of the processing units, registers and interconnects in the synthesized architecture.;A novel scheduling and processor allocation technique taking into consideration the inter-processor communication delay is presented. In the proposed technique, the communication delay between a pair of nodes of different types is treated as a non-computing node, whereas that between a pair of nodes of the same type is taken into account by re-adjusting the firing times of the appropriate nodes of the data flow graph (DFG). Another technique for the integration of the placement process into the scheduling and processor allocation in order to determine the actual positions of the processing units in the placement space is developed. The proposed technique makes use of a hybrid library of functional units, which includes both operation-specific and reconfigurable multiple-operation functional units, to maximize the local data transfer.
机译:可以将高级体系结构综合描述为将行为描述转换为结构描述的过程。调度,处理器分配和寄存器绑定是高级综合中最重要的任务。过去,由于数字系统的总体延迟主要由逻辑门的延迟决定,因此有可能仅在高级综合中仅关注处理单元的延迟而忽略布线延迟。但是,随着制程技术被缩小到深亚微米区域,在VLSI设计中不能再忽略全局互连延迟。因此,必须在高级综合中包括用于在处理单元之间通信数据的电线和总线上的延迟,即处理器间通信延迟。此外,执行寄存器绑定过程的方式也对在处理单元之间传输数据所需的互连路径的复杂性有影响。因此,寄存器绑定不能再忽略其对最终设计布线复杂性的影响。本文的目的是开发一种用于互连的高级综合技术。在这个共同主题下,本论文有两个不同的重点。本文的首要重点是在考虑处理器间通信延迟的同时,开发一种新的高级综合框架。本论文的第二个重点是开发一种进行寄存器绑定的技术和一种减少寄存器数量同时又考虑到互连的复杂性的方案。通过适当地将令牌的生存期分为多个段,然后将具有相同源和/或目标的段绑定到单个寄存器中,可以开发寄存器和互连的功能。设计了一种节点再生方案,其中利用空闲处理单元在给定的DFG中生成节点的多个副本,以进一步减少寄存器和互连的数量。针对许多基准DSP问题的架构综合,并与其他各种常用的综合方法进行了比较,以评估其有效性。结果表明,所提出的技术在迭代周期,放置区域以及综合体系结构中的处理单元,寄存器和互连的数量方面提供了优异的性能。介绍了处理器通信延迟。在提出的技术中,将不同类型的一对节点之间的通信延迟视为非计算节点,而通过重新调整适当的触发时间来考虑相同类型的一对节点之间的通信延迟。数据流图(DFG)的节点。为了确定处理单元在放置空间中的实际位置,开发了另一种用于将放置过程集成到调度和处理器分配中的技术。所提出的技术利用功能单元的混合库,其中包括特定于操作的和可重新配置的多操作功能单元,以最大化本地数据传输。

著录项

  • 作者

    Itradat, Awni.;

  • 作者单位

    Concordia University (Canada).;

  • 授予单位 Concordia University (Canada).;
  • 学科 Engineering Computer.
  • 学位 Ph.D.
  • 年度 2009
  • 页码 173 p.
  • 总页数 173
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号