Cache-Integrated Network Interfaces: Flexible On-Chip Communication and Synchronization for Large-Scale CMPs

Stamatis Kavadias; Manolis Katevenis; Michail Zampetakis; Dimitrios S. Nikolopoulos

首页> 外文期刊>International journal of parallel programming >Cache-Integrated Network Interfaces: Flexible On-Chip Communication and Synchronization for Large-Scale CMPs

【24h】

Cache-Integrated Network Interfaces: Flexible On-Chip Communication and Synchronization for Large-Scale CMPs

机译：高速缓存集成的网络接口：大规模CMP的灵活片上通信和同步

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Per-core scratchpad memories (or local stores) allow direct inter-core communication, with latency and energy advantages over coherent cache-based communication, especially as CMP architectures become more distributed. We have designed cache-integrated network interfaces, appropriate for scalable multicores, that combine the best of two worlds - the flexibility of caches and the efficiency of scratchpad memories: on-chip SRAM is configurably shared among caching, scratchpad, and virtualized network interface (NI) functions. This paper presents our architecture, which provides local and remote scratchpad access, to either individual words or multiword blocks through RDMA copy. Furthermore, we introduce event responses, as a technique that enables software configurable communication and synchronization primitives. We present three event response mechanisms that expose NI functionality to software, for multiword transfer initiation, completion notifications for software selected sets of arbitrary size transfers, and multi-party synchronization queues. We implemented these mechanisms in a four-core FPGA prototype, and measure the logic overhead over a cache-only design for basic NI functionality to be less than 20%. We also evaluate the on-chip communication performance on the prototype, as well as the performance of synchronization functions with simulation of CMPs with up to 128 cores. We demonstrate efficient synchronization, low-overhead communication, and amortized-overhead bulk transfers, which allow parallelization gains for fine-grain tasks, and efficient exploitation of the hardware bandwidth.

机译：每核暂存器存储器（或本地存储）允许直接进行核间通信，与基于一致性的基于缓存的通信相比，具有延迟和能源优势，尤其是在CMP体系结构变得更加分散时。我们设计了适用于可扩展多核的，集成了缓存的网络接口，结合了两个方面的优势-缓存的灵活性和暂存器的效率：片内SRAM可配置地在缓存，暂存器和虚拟化网络接口之间共享（ NI）功能。本文介绍了我们的体系结构，该体系结构通过RDMA复制提供对单个单词或多单词块的本地和远程暂存器访问。此外，我们介绍了事件响应，作为一种启用软件可配置的通信和同步原语的技术。我们提供了三种事件响应机制，这些机制将NI功能暴露给软件，用于多字传输启动，针对软件选择的任意大小传输集的完成通知以及多方同步队列。我们在四核FPGA原型中实现了这些机制，并测量了仅用于高速缓存的设计的逻辑开销，以使NI的基本功能小于20％。我们还评估了原型上的片上通信性能，以及通过模拟具有多达128个内核的CMP的同步功能的性能。我们展示了高效的同步，低开销的通信和摊销的开销的批量传输，这些实现了细粒度任务的并行化收益以及对硬件带宽的有效利用。

著录项

来源
《International journal of parallel programming》 |2012年第6期|583-604|共22页
作者
Stamatis Kavadias; Manolis Katevenis; Michail Zampetakis; Dimitrios S. Nikolopoulos;
展开▼
作者单位

Foundation for Research & Technology - Hellas, Institute of Computer Science (FORTH-ICS), Heraklion, Crete, Greece;

Foundation for Research & Technology - Hellas, Institute of Computer Science (FORTH-ICS), Heraklion, Crete, Greece;

Foundation for Research & Technology - Hellas, Institute of Computer Science (FORTH-ICS), Heraklion, Crete, Greece;

Foundation for Research & Technology - Hellas, Institute of Computer Science (FORTH-ICS), Heraklion, Crete, Greece;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
cache; network interface; explicit communication; synchronization;

机译：缓存网络接口;明确的沟通;同步化;

相似文献

外文文献
中文文献
专利

1. Hybrid circuit-switched network for on-chip communication in large-scale chip-multiprocessors [J] . Hongyin Luo, Shaojun Wei, Deming Chen, Journal of Parallel and Distributed Computing . 2014,第9期

机译：用于大规模芯片多处理器中片上通信的混合电路交换网络
2. An energy consumption characterization of on-chip interconnection networks for tiled CMP architectures [J] . Antonio Flores, Juan L. Aragon, Manuel E. Acacio Journal of supercomputing . 2008,第3期

机译：平铺CMP架构的片上互连网络的能耗表征
3. Chaotic Synchronization in Star Coupled Networks of Three-Dimensional Cellular Neural Networks and Its Application in Communications [J] . H. Serrano-Guerrero, C. Cruz-Hernandez, R. M. Lopez-Gutierrez, International journal of nonlinear sciences and numerical simulation . 2010,第8期

机译：三维细胞神经网络星形耦合网络中的混沌同步及其在通信中的应用
4. On-chip Communication and Synchronization Mechanisms with Cache-Integrated Network Interfaces [C] . Stamatis Kavadias, Manolis Katevenis, Michail Zampetakis, 7th ACM computing frontiers conference 2010 . 2010

机译：具有高速缓存集成网络接口的片上通信和同步机制
5. Robust and systemwide fault location in large-scale power networks via optimal deployment of synchronized measurements. [D] . Korkali, Mert. 2013

机译：通过优化部署同步测量，可以在大型电力网络中实现可靠的系统级故障定位。
6. On-chip constructive cell-Network study (I): Contribution of cardiac fibroblasts to cardiomyocyte beating synchronization and community effect [O] . Tomoyuki Kaneko, Fumimasa Nomura, Kenji Yasuda 2011

机译：片上建设性细胞-网络研究（I）：心脏成纤维细胞对心肌细胞搏动同步和社区效应的贡献
7. Cache-Integrated Network Interfaces: Flexible On-Chip Communication and Synchronization for Large-Scale CMPs [O] . Kavadias, Stamatis, Katevenis, Manolis, Zampetakis, Michail, 2012

机译：高速缓存集成的网络接口：大规模CMP的灵活片上通信和同步

Cache-Integrated Network Interfaces: Flexible On-Chip Communication and Synchronization for Large-Scale CMPs

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅