首页> 外文期刊>IEEE transactions on very large scale integration (VLSI) systems >Architecture and Compiler Optimizations for Data Bandwidth Improvement in Configurable Processors
【24h】

Architecture and Compiler Optimizations for Data Bandwidth Improvement in Configurable Processors

机译:可配置处理器中用于数据带宽改善的体系结构和编译器优化

获取原文
获取原文并翻译 | 示例

摘要

Many commercially available embedded processors are capable of extending their base instruction set for a specific domain of applications. While steady progress has been made in the tools and methodologies of automatic instruction set extension for configurable processors, the limited data bandwidth available in the core processor (e.g., the number of simultaneous accesses to the register file) becomes a potential performance bottleneck. In this paper, we first present a quantitative analysis of the data bandwidth limitation in configurable processors, and then propose a novel low-cost architectural extension and associated compilation techniques to address the problem. Specifically, we embed a single control bit in the instruction op-codes to selectively copy the execution results to a set of hash-mapped shadow registers in the write-back stage. This can efficiently reduce the communication overhead due to data transfers between the core processor and the custom logic. We also present a novel simultaneous global shadow register binding with a hash function generation algorithm to take full advantage of the extension. The application of our approach leads to a nearly optimal performance speedup
机译:许多商用嵌入式处理器都能够针对特定的应用领域扩展其基本指令集。虽然可配置处理器的自动指令集扩展的工具和方法已经取得了稳步的进展,但是核心处理器中可用的有限数据带宽(例如,同时访问寄存器文件的数量)成为潜在的性能瓶颈。在本文中,我们首先对可配置处理器中数据带宽限制进行了定量分析,然后提出了一种新颖的低成本体系结构扩展以及相关的编译技术来解决该问题。具体来说,我们在指令操作码中嵌入了一个控制位,以便在回写阶段将执行结果选择性地复制到一组哈希映射的影子寄存器中。由于核心处理器和定制逻辑之间的数据传输,这可以有效地减少通信开销。我们还提出了一种新颖的同时具有阴影函数生成算法的全局影子寄存器绑定,以充分利用扩展。我们方法的应用导致近乎最佳的性能加速

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号