【24h】

Communication scheduling

机译:通讯调度

获取原文
获取外文期刊封面目录资料

摘要

The high arithmetic rates of media processing applications require architectures with tens to hundreds of functional units, multiple register files, and explicit interconnect between functional units and register files. Communication scheduling enables scheduling to these emerging architectures, including those that use shared buses and register file ports. Scheduling to these shared interconnect architectures is difficult because it requires simultaneously allocating functional units to operations and buses and register file ports to the communications between operations. Prior VLIW scheduling algorithms are limited to clustered register file architectures with no shared buses or register file ports. Communication scheduling extends the range of target architectures by making each communication explicit and decomposing it into three components: a write stub, zero or more copy operations, and a read stub. Communication scheduling allows media processing kernels to achieve 98% of the performance ofa central register file architecture on a distributed register file architecture with only 9% of the area, 6% of the power consumption, and 37% of the access delay, and 120% of the performance of a clustered register file architecture on a distributed register file architecture with 56% of the area and 50% of the power consumption.
机译:媒体处理应用程序的高运算率要求架构具有数十到数百个功能单元,多个寄存器文件以及功能单元和寄存器文件之间的显式互连。通过通信调度,可以调度到这些新兴的体系结构,包括那些使用共享总线和注册文件端口的体系结构。调度到这些共享的互连体系结构很困难,因为它需要同时为操作和总线分配功能单元,并为操作之间的通信注册文件端口。先前的VLIW调度算法仅限于没有共享总线或寄存器文件端口的群集寄存器文件体系结构。通信调度通过使每个通信明确并将其分解为三个组件来扩展目标体系结构的范围:写存根,零个或多个复制操作以及读存根。通过通信调度,媒体处理内核可以在分布式寄存器文件体系结构上实现中央寄存器文件体系结构98%的性能,而其面积仅为9%,功耗为6%,访问延迟为37%和120%集群寄存器文件体系结构在分布式寄存器文件体系结构上的性能(占面积的56%和功耗的50%)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号