首页> 外文会议>Languages and compilers for parallel computing >Parallelizing Compiler Framework and API for Power Reduction and Software Productivity of Real-Time Heterogeneous Multicores
【24h】

Parallelizing Compiler Framework and API for Power Reduction and Software Productivity of Real-Time Heterogeneous Multicores

机译:并行异构编译器框架和API,可降低实时异构多核的功耗并提高其软件生产率

获取原文
获取原文并翻译 | 示例

摘要

Heterogeneous multicores have been attracting much attention to attain high performance keeping power consumption low in wide spread of areas. However, heterogeneous multicores force programmers very difficult programming. The long application program development period lowers product competitiveness. In order to overcome such a situation, this paper proposes a compilation framework which bridges a gap between programmers and heterogeneous multicores. In particular, this paper describes the compilation framework based on OSCAR compiler. It realizes coarse grain task parallel processing, data transfer using a DMA controller, power reduction control from user programs with DVFS and clock gating on various heterogeneous multicores from different vendors. This paper also evaluates processing performance and the power reduction by the proposed framework on a newly developed 15 core heterogeneous multicore chip named RP-X integrating 8 general purpose processor cores and 3 types of accelerator cores which was developed by Renesas Electronics, Hitachi, Tokyo Institute of Technology and Waseda University. The framework attains speedups up to 32x for an optical flow program with eight general purpose processor cores and four DRP(Dynamically Reconfigurable Processor) accelerator cores against sequential execution by a single processor core and 80% of power reduction for the real-time AAC encoding.
机译:异构多核已吸引了很多关注,以实现高性能,从而在广泛的区域中保持较低的功耗。但是,异构多核迫使程序员非常困难的编程。较长的应用程序开发周期降低了产品竞争力。为了克服这种情况,本文提出了一种编译框架,该框架弥合了程序员和异构多核之间的鸿沟。特别是,本文描述了基于OSCAR编译器的编译框架。它实现了粗粒度任务并行处理,使用DMA控制器的数据传输,带有DVFS的用户程序的功耗降低控制以及来自不同供应商的各种异构多核上的时钟门控。本文还通过瑞萨电子,日立,东京研究所开发的,集成了8个通用处理器内核和3种类型的加速器内核的新开发的15核异构多核芯片RP-X评估了所提出框架的处理性能和功耗降低早稻田大学技术学院。该框架针对具有八个通用处理器内核和四个DRP(动态可重配置处理器)加速器内核的光流程序,最高可将速度提高32倍,而单个处理器内核无法依次执行,并且实时AAC编码可降低80%的功耗。

著录项

  • 来源
  • 会议地点 Houston TX(US);Houston TX(US)
  • 作者单位

    Department of Computer Science and Engineering, Waseda University, 3-4-1 Okubo, Shinjuku-ku, Tokyo, Japan;

    Department of Computer Science and Engineering, Waseda University, 3-4-1 Okubo, Shinjuku-ku, Tokyo, Japan;

    Department of Computer Science and Engineering, Waseda University, 3-4-1 Okubo, Shinjuku-ku, Tokyo, Japan;

    Department of Computer Science and Engineering, Waseda University, 3-4-1 Okubo, Shinjuku-ku, Tokyo, Japan;

    Department of Computer Science and Engineering, Waseda University, 3-4-1 Okubo, Shinjuku-ku, Tokyo, Japan;

    Department of Computer Science and Engineering, Waseda University, 3-4-1 Okubo, Shinjuku-ku, Tokyo, Japan;

    Department of Computer Science and Engineering, Waseda University, 3-4-1 Okubo, Shinjuku-ku, Tokyo, Japan;

    Department of Computer Science and Engineering, Waseda University, 3-4-1 Okubo, Shinjuku-ku, Tokyo, Japan;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 计算技术、计算机技术;
  • 关键词

    heterogeneous multicore; parallelizing compiler; api;

    机译:异构多核并行化编译器; api;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号