In-place Irregular Computation for Message-passing Chip-multiprocessors

机译：用于消息传递芯片 - 多处理器的地理不规则计算

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

With the increase of CMP (Chip-Multiprocessor) scale, moving data to computation on chip becomes more expensive. Accordingly, moving computation to data has potential to improve efficiency. We propose an in-place computation co-design of many-simple-core CMP for irregular applications. The computing paradigm is that an application's critical irregular data (or part of them) is partitioned into on-chip memory-slices and each slice is delegated by an adjacent core. From the hardware aspect, it divides cores into two groups with load balancing; each group is responsible for accessing off-chip data or irregular data respectively. Moreover, L2 caches are replaced with scratchpads and intra-core message-passing is supported by hardware. Accordingly, algorithms of some typical irregular application kernels are presented, including Breadth-First Search, hash-map, Sparse Matrix-Vector Multiplication and data-walk. Simulations show that, compared with conventional implementations based on cache-coherence (CC), it can improve the performance and energy-efficiency significantly. The limitation is also discussed.

机译：随着CMP（芯片 - 多处理器）的增加，将数据移动到芯片的计算变得更加昂贵。因此，向数据移动计算具有提高效率。我们提出了一种用于不规则应用的许多简单核心CMP的原位计算共同设计。计算范例是应用程序的临界不规则数据（或其中一部分）被划分为片上存储器切片，并且每个切片由相邻的核心委派。从硬件方面，它将核心划分为具有负载平衡的两组;每个组负责分别访问芯片数据或不规则数据。此外，L2高速缓存被缩小板替换，硬件支持核心内部消息传递。因此，呈现了一些典型的不规则应用程序内核的算法，包括广度优先搜索，散列图，稀疏矩阵矢量乘法和数据步行。模拟表明，与基于缓存相干（CC）的传统实施相比，它可以显着提高性能和能效。还讨论了限制。

著录项

来源
《International Workshop on Embedded Multicore Systems》|2017年|320p|共8页
会议地点
作者
Zhang Youhui; Zhang Youyang; Li Yanhua; Fei Xiang; Zheng Weimin;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP311.133.2-53;
关键词
In-place computation; Chip-multiprocessor; Irregular applications; Message passing;

机译：就地计算;芯片 - 多处理器;不规则应用;消息传递;

相似文献

外文文献
中文文献
专利

1. Near-Optimal and Scalable Intrasignal In-Place Optimization for Non-Overlapping and Irregular Access Schemes [J] . ANGELIKI KRITIKAKOU, FRANCKY CATTHOOR, VASILIOS KELEFOURAS, ACM Transactions on Design Automation of Electronic Systems . 2014,第1期

机译：非重叠和不规则访问方案的接近最佳和可伸缩的信号内就地优化
2. Relaxed barrier synchronization for the BSP model of computation on message-passing architectures [J] . Jin-Soo Kim, Soonhoi Ha, Chu Shik John Information Processing Letters . 1998,第5期

机译：消息传递体系结构上的BSP计算模型的轻松屏障同步
3. Performance evaluation of scheduling precedence-constrained computations on message-passing systems [J] . Al-Mouhamed M., Al-Maasarani A. IEEE Transactions on Parallel and Distributed Systems . 1994,第12期

机译：消息传递系统中调度优先约束计算的性能评估
4. In-place Irregular Computation for Message-passing Chip-multiprocessors [C] . Zhang Youhui, Zhang Youyang, Li Yanhua, International Workshop on Embedded Multicore Systems . 2017

机译：用于消息传递芯片 - 多处理器的地理不规则计算
5. Metascalable hybrid message-passing and multithreading algorithms for n-tuple computation. [D] . Kunaseth, Manaschai. 2013

机译：用于n元组计算的metascalable混合消息传递和多线程算法。
6. G-Computation and Hierarchical Models for Estimating Multiple Causal Effects From Observational Disease Registries With Irregular Visits [O] . Zach Shahn, Ying Li, Zhaonan Sun, 2019

机译：G计算和分层模型用于从不定期访问的观察性疾病登记处估计多种因果效应
7. A Comparison of Different Message-Passing Paradigms for the Parallelization of Two Irregular Applications [O] . Seungjo Bae, Sanjay Ranka 1994

机译：两种不规则应用并行化的不同消息传递范式比较
8. Characterizing Computation-Communication Overlap in Message-Passing Systems [R] . 2008

机译：在消息传递系统中表征计算 - 通信重叠

In-place Irregular Computation for Message-passing Chip-multiprocessors

摘要

著录项

相似文献

相关主题

期刊订阅