首页> 外国专利> LOW-COMPLEXITY DEEP LEARNING ACCELERATION HARDWARE DATA PROCESSING DEVICE

LOW-COMPLEXITY DEEP LEARNING ACCELERATION HARDWARE DATA PROCESSING DEVICE

机译:低复杂性深度学习加速硬件数据处理设备

摘要

Provided is deep learning accelerator hardware which is designed to have a structure in which the number of times of accessing an external memory is reduced and a data request can be also predicted, so that data reusability is maximized and a peak bandwidth can be reduced. A deep learning accelerator according to an embodiment of the present invention comprises: a deep learning accelerator for calculating input data; an encoder for compressing output data of the deep learning accelerator; and a WDMA for recording the output data compressed by the encoder in an external memory, wherein the encoder selectively applies different compression schemes and compresses the output data on the basis of context of the output data. Therefore, the present invention can reduce the number of times of accessing an external large-capacity memory for the same channel/weight-specific data processing each time by a deep learning accelerator.
机译:提供了深度学习加速器硬件,其被设计为具有一种结构,其中还减少了访问外部存储器的次数并且还可以预测数据请求,从而最大化数据可重用性,并且可以减少峰值带宽。根据本发明的实施例的深度学习加速器包括:用于计算输入数据的深度学习加速器;用于压缩深度学习加速器的输出数据的编码器;和用于记录由外部存储器中的编码器压缩的输出数据的WDMA,其中编码器选择性地应用不同的压缩方案并基于输出数据的上下文压缩输出数据。因此,本发明可以通过深度学习加速器每次每次每次访问相同的信道/重量特定数据处理的外部大容量存储器的次数。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号