Efficient and Predictable Group Communication for Manycore NoCs

机译：Manycore NoC的有效且可预测的团队沟通

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Massive manycore embedded processors with network-on-chip (NoC) architectures are becoming common. These architectures provide higher processing capability due to an abundance of cores. They provide native core-to-core communication that can be exploited via message passing to provide system scalability. Despite these advantages, manycores pose predictability challenges that can affect both performance and real-time capabilities. In this work, we develop efficient and predictable group communication using message passing specifically designed for large core counts in 2D mesh NoC architectures. We have implemented the most commonly used collectives in such a way that they incur low latency and high timing predictability making them suitable for balanced parallelization of scalable high-performance and embedded/real-time systems alike. Experimental results on a single-die 64 core hardware platform show that our collectives can significantly reduce communication times by up to 95 % for single packet messages and up to 98 % for longer messages with superior performance for sometimes all message sizes and sometimes only small message sizes depending on the group primitive. In addition, our communication primitives have significantly lower variance than prior approaches, thereby providing more balanced parallel execution progress and better real-time predictability.

机译：具有片上网络（NoC）架构的大型多核嵌入式处理器正变得越来越普遍。这些架构由于拥有大量内核而提供了更高的处理能力。它们提供了本机的核心到核心通信，可以通过消息传递来利用这些通信，以提供系统可伸缩性。尽管有这些优点，但许多核提出了可预测性挑战，可能会影响性能和实时功能。在这项工作中，我们使用专门为2D网状NoC架构中的大量核心设计的消息传递来开发有效且可预测的群组通信。我们已经实现了最常用的集合，以使其具有低延迟和高时序可预测性，使其适合于可扩展高性能和嵌入式/实时系统的平衡并行化。在单芯片64核硬件平台上的实验结果表明，我们的团队可以将单个数据包消息的通信时间最多减少95％，对于较长的消息，则可以将通信时间最多缩短98％，并且在某些情况下，对于所有消息大小，有时仅是小消息，都具有出色的性能。大小取决于组基元。此外，我们的通信原语比以前的方法具有更低的方差，从而提供了更平衡的并行执行进度和更好的实时可预测性。

著录项

来源
《International conference on high performance computing》|2016年|383-403|共21页
会议地点
作者
Karthik Yagna; Onkar Patil; Frank Mueller;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. An energy-efficient design of microkernel-based on-chip OS for NOC-based manycore system [J] . Hu Wei, Guo Hong, Zhang Kai, Journal of supercomputing . 2017,第8期

机译：基于NOC的多核系统的基于微内核的片上OS的节能设计
2. On-Chip Communication Network for Efficient Training of Deep Convolutional Networks on Heterogeneous Manycore Systems [J] . Choi Wonje, Duraisamy Karthi, Kim Ryan Gary, Fortschritte der Physik . 2018,第5期

机译：用于在异构多核系统中高效训练的芯片通信网络
3. Learning-Based Application-Agnostic 3D NoC Design for Heterogeneous Manycore Systems [J] . Joardar Biresh Kumar, Kim Ryan Gary, Doppa Janardhan Rao, IEEE Transactions on Computers . 2019,第6期

机译：异构Manycore系统的基于学习的与应用程序无关的3D NoC设计
4. Efficient and Predictable Group Communication for Manycore NoCs [C] . Karthik Yagna, Onkar Patil, Frank Mueller ISC High Performance Conference . 2016

机译：Manycore Nocs的高效和可预测的群组沟通
5. RELAX: Cross-Layer Resource Management for Reliable NoC-Based 2D and 3D Manycore Architectures in The Dark Silicon Era [D] . Raparti, Venkata Yaswanth. 2019

机译：放松：在黑暗的硅时代，基于NOC的可靠性2D和3D多核体系结构的跨层资源管理
6. ParRouting: An Efficient Area Partition-Based Congestion-Aware Routing Algorithm for NoCs [O] . Juan Fang, Di Zhang, Xiaqing Li 2020

机译：判断：NOCS的基于有效的基于区域分区的拥塞感知路由算法
7. On-Chip Communication Network for Efficient Training of Deep Convolutional Networks on Heterogeneous Manycore Systems [O] . Choi, Wonje, Duraisamy, Karthi, Kim, Ryan Gary, 2017

机译：片上通信网络对深度训练的有效训练异构manycore系统上的卷积网络

Efficient and Predictable Group Communication for Manycore NoCs

摘要

著录项

相似文献

相关主题

期刊订阅