Communication Lower Bound in Convolution Accelerators

机译：卷积加速器中的通信下限

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In current convolutional neural network (CNN) accelerators, communication (i.e., memory access) dominates the energy consumption. This work provides comprehensive analysis and methodologies to minimize the communication for CNN accelerators. For the off-chip communication, we derive the theoretical lower bound for any convolutional layer and propose a dataflow to reach the lower bound. This fundamental problem has never been solved by prior studies. The on-chip communication is minimized based on an elaborate workload and storage mapping scheme. We in addition design a communication-optimal CNN accelerator architecture. Evaluations based on the 65nm technology demonstrate that the proposed architecture nearly reaches the theoretical minimum communication in a three-level memory hierarchy and it is computation dominant. The gap between the energy efficiency of our accelerator and the theoretical best value is only 37-87%.

机译：在当前的卷积神经网络（CNN）加速器中，通信（即，内存访问）主导能耗。这项工作提供了全面的分析和方法，以最大限度地减少CNN加速器的沟通。对于片外通信，我们导出任何卷积层的理论下限，并提出数据流来达到下限。此基本问题从未通过之前的研究解决。基于精心的工作负载和存储映射方案，可以最小化芯片通信。我们此外，我们还设计了一种通信 - 最佳CNN加速器架构。基于65nm技术的评估表明，所提出的架构几乎达到三级内存层级中的理论最小通信，并且它是计算主导。我们加速器的能效与理论最佳价值之间的差距仅为37-87％。

著录项

来源
《IEEE International Symposium on High Performance Computer Architecture》|2020年|732p|共13页
会议地点
作者
Xiaoming Chen; Yinhe Han; Yu Wang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词
System-on-chip; Convolution; Random access memory; Convolutional codes; Memory management; Microsoft Windows; Kernel;

机译：片上系统;卷积;随机存取内存;卷积码;内存管理;Microsoft Windows;内核;

相似文献

外文文献
中文文献
专利

1. Lower bounds for the kinetic energy and resistance of wire array Z pinches on the Z pulsed-power accelerator [J] . Waisman EM, Cuneo ME, Lemke RW, Physics of plasmas . 2008,第4期

机译：Z脉冲功率加速器上的线阵列Z挤压的动能和电阻的下限
2. Estimating Lower Bound and Upper Bound of a Markov chain over a noisy communication channel with Poisson distribution [J] . Mr. Vinay Mahajan, Prof. Rajesh Nema, Prof. Puran Gour International Journal of Advanced Computer Research . 2013,第10期

机译：用泊松分布估计嘈杂通信信道上的马尔可夫链的上下界
3. Estimating Lower Bound and Upper Bound of a Markov chain over a noisy communication channel with Poisson distribution [J] . Vinay Mahajan, Rajesh Nema, Puran Gour International Journal of Advanced Computer Research . 2012,第4期

机译：用泊松分布估计嘈杂通信信道上的马尔可夫链的上下界
4. Communication Lower Bound in Convolution Accelerators [C] . Xiaoming Chen, Yinhe Han, Yu Wang IEEE International Symposium on High Performance Computer Architecture . 2020

机译：卷积加速器中的通信下界
5. FPGA-based Accelerators for Convolutional Neural Networks on Embedded Devices [D] . Perera Miro, Jordi. 2020

机译：基于FPGA的嵌入式设备卷积神经网络的加速器
6. Random-Access Accelerator (RAA): A Framework to Speed Up the Random-Access Procedure in 5G New Radio for IoT mMTC by Enabling Device-To-Device Communications [O] . Abel Rodriguez Medel, Jose Marcos C. Brito 2020

机译：随机访问加速器（RAA）：通过启用设备到设备通信将随机接入过程加速5G新型无线电随机接入过程的框架
7. Communication Lower Bound in Convolution Accelerators [O] . Xiaoming Chen, Yinhe Han, Yu Wang 2020

机译：卷积加速器中的通信下限

Communication Lower Bound in Convolution Accelerators

摘要

著录项

相似文献

相关主题

期刊订阅