首页> 外国专利> NEURAL NETWORK PROCESSOR USING COMPRESSION AND DECOMPRESSION OF ACTIVATION DATA TO REDUCE MEMORY BANDWIDTH UTILIZATION

NEURAL NETWORK PROCESSOR USING COMPRESSION AND DECOMPRESSION OF ACTIVATION DATA TO REDUCE MEMORY BANDWIDTH UTILIZATION

机译:神经网络处理器使用压缩和解压缩激活数据来减少内存带宽利用率

摘要

The performance of a neural network (NN) can be limited by the number of operations being performed. Using a line buffer that is directed to shift a memory block by a selected shift stride for cooperating neurons, data that is operatively residing memory and which would require multiple write cycles into a cooperating line buffer can be processed as in a single line buffer write cycle thereby enhancing the performance of a NN/DNN. A controller and/or iterator can generate one or more instructions having the memory block shifting values for communication to the line buffer. The shifting values can be calculated using various characteristics of the input data as well as the NN/DNN inclusive of the data dimensions. The line buffer can read data for processing, shift the data of the memory block and write the data in the line buffer for subsequent processing.
机译:神经网络(NN)的性能可以受到正在执行的操作的数量的限制。使用针对用于协作神经元的所选移位级别来使用线缓冲器,用于协作神经元,可以在单行缓冲器写周期中处理可操作地驻留存储器并且需要将多个写入周期更需要多个写入周期的数据因此增强了NN / DNN的性能。控制器和/或迭代器可以生成具有存储器块移位值的一个或多个指令,用于与行缓冲器进行通信。可以使用输入数据的各种特性以及包括数据维度的NN / DNN来计算转换值。行缓冲区可以读取用于处理的数据,移动存储器块的数据并在线缓冲器中的数据写入后续处理。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号