...
首页> 外文期刊>IEICE Electronics Express >Decoupled iteration mapping: improving dependency-loop performance on SIMD processors
【24h】

Decoupled iteration mapping: improving dependency-loop performance on SIMD processors

机译:解耦的迭代映射:提高SIMD处理器上的依赖循环性能

获取原文
   

获取外文期刊封面封底 >>

       

摘要

References(7) Wide Single Instruction Multiple Data (SIMD) architectures are very important in the compute-intensive applications, but less efficient for applications with cross-iteration dependency loops which are difficult to parallelize and vectorize. This paper introduces Decoupled Iteration Mapping (DIM), a technique that dynamically maps a cross-iteration dependency loop onto the improved SIMD architecture which achieved multicore-like thread-parallel performance. The minor modification on the baseline architecture is composed of a Prefetch Unit & Instruction Buffer Array (PU&IBA), a Loop Control Unit & Instruction Dispatch Unit (LCU&IDU), and a Data Buffer Chain (DBC). Experimental results show that, the proposed DIM scheme can achieve average 3.04x performance speedup with a cost of only 6.44% area overhead.
机译:参考文献(7)宽单指令多数据(SIMD)架构在计算密集型应用程序中非常重要,但对于具有交叉迭代依赖性循环却难以并行化和矢量化的应用程序,效率较低。本文介绍了去耦迭代映射(DIM),该技术可将交叉迭代依赖性循环动态映射到改进的SIMD体系结构上,从而实现了类似多核的线程并行性能。对基准体系结构的次要修改由预取单元和指令缓冲区阵列(PU&IBA),循环控制单元和指令分配单元(LCU&IDU)以及数据缓冲区链(DBC)组成。实验结果表明,所提出的DIM方案可以实现平均3.04倍的性能提升,而面积开销仅为6.44%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号