首页> 外文期刊>IEEE Transactions on Parallel and Distributed Systems >A High-Utilization Scheduling Schemeof Stream Programs on ClusteredVLIW Stream Architectures
【24h】

A High-Utilization Scheduling Schemeof Stream Programs on ClusteredVLIW Stream Architectures

机译:ClusteredVLIW流架构上的流程序高效调度方案

获取原文
获取原文并翻译 | 示例

摘要

Stream architectures have emerged as a mainstream solution for computation-intensive applications due to their rich arithmetic units. This paper proposes a multithreading technique based on a scheduling scheme of stream programs on clustered VLIW stream architecture, which aims at optimal arithmetic unit utilization without increasing energy consumption. Its principle is to exploit more kernel-level parallelism for further optimal compilation by constructing homogeneous multiple threads on stream programs. Three phases are proposed in the scheduling scheme. First, threads in stream programs are replicated for constructing homogeneous multiple threads. Second, time step assignment for homogeneous multithreaded stream programs is utilized to obtain efficient kernel combination. Third, stream segmentation is presented to optimize both memory transfers and startup overheads of kernels. A set of benchmarks are exploited to evaluate the effectiveness of the proposed technique. Experimental results show that, with exploiting kernel-level software pipeline, the proposed technique improves the performance by 20.9 percent averagely with the energy decreasing by 7.6 percent. Utilizations of adders and multipliers are up to average 77.4 and 75.8 percent, increasing 17.0 and 13.3 percent, respectively. Moreover, the proposed technique performs an average of 12.5 percent improvement over CSMT4 with the energy decreasing by 12.0 percent.
机译:流体系结构具有丰富的算术单元,已成为计算密集型应用程序的主流解决方案。本文提出了一种基于VLIW流架构的流程序调度方案的多线程技术,其目的是在不增加能耗的情况下优化算法单元的利用率。它的原理是通过在流程序上构造同质的多个线程来利用更多的内核级并行性来进一步优化编译。在调度方案中提出了三个阶段。首先,复制流程序中的线程以构造同构的多个线程。第二,利用同质多线程流程序的时间步分配来获得有效的内核组合。第三,提出了流分段以优化内存传输和内核的启动开销。利用一组基准来评估所提出技术的有效性。实验结果表明,利用内核级软件流水线,该技术平均可将性能提高20.9%,能耗降低7.6%。加法器和乘法器的利用率平均达到77.4%和75.8%,分别增长17.0%和13.3%。此外,所提出的技术比CSMT4平均提高了12.5%,能量降低了12.0%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号