首页> 外文会议>International Workshop on Embedded Multicore Systems >In-place Irregular Computation for Message-passing Chip-multiprocessors
【24h】

In-place Irregular Computation for Message-passing Chip-multiprocessors

机译:用于消息传递芯片 - 多处理器的地理不规则计算

获取原文

摘要

With the increase of CMP (Chip-Multiprocessor) scale, moving data to computation on chip becomes more expensive. Accordingly, moving computation to data has potential to improve efficiency. We propose an in-place computation co-design of many-simple-core CMP for irregular applications. The computing paradigm is that an application's critical irregular data (or part of them) is partitioned into on-chip memory-slices and each slice is delegated by an adjacent core. From the hardware aspect, it divides cores into two groups with load balancing; each group is responsible for accessing off-chip data or irregular data respectively. Moreover, L2 caches are replaced with scratchpads and intra-core message-passing is supported by hardware. Accordingly, algorithms of some typical irregular application kernels are presented, including Breadth-First Search, hash-map, Sparse Matrix-Vector Multiplication and data-walk. Simulations show that, compared with conventional implementations based on cache-coherence (CC), it can improve the performance and energy-efficiency significantly. The limitation is also discussed.
机译:随着CMP(芯片 - 多处理器)的增加,将数据移动到芯片的计算变得更加昂贵。因此,向数据移动计算具有提高效率。我们提出了一种用于不规则应用的许多简单核心CMP的原位计算共同设计。计算范例是应用程序的临界不规则数据(或其中一部分)被划分为片上存储器切片,并且每个切片由相邻的核心委派。从硬件方面,它将核心划分为具有负载平衡的两组;每个组负责分别访问芯片数据或不规则数据。此外,L2高速缓存被缩小板替换,硬件支持核心内部消息传递。因此,呈现了一些典型的不规则应用程序内核的算法,包括广度优先搜索,散列图,稀疏矩阵矢量乘法和数据步行。模拟表明,与基于缓存相干(CC)的传统实施相比,它可以显着提高性能和能效。还讨论了限制。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号