首页> 外国专利> Neural network processor using compression and decompression of activation data to reduce memory bandwidth utilization

Neural network processor using compression and decompression of activation data to reduce memory bandwidth utilization

机译：神经网络处理器使用激活数据的压缩和解压缩来减少内存带宽利用率

页面导航

摘要
著录项
相似文献

摘要

The deep neural network (DNN) module may compress and decompress neurally generated activation data to reduce the utilization of memory bus bandwidth. The compression unit may receive an uncompressed chunk of data generated by neurons in the DNN module. The compression unit produces a mask portion and a data portion of the compressed output chunk. The mask portion encodes the presence and position of zero and nonzero bytes in the uncompressed chunk of data. The data portion stores truncated nonzero bytes from the uncompressed chunks of data. The decompression unit may receive a compressed chunk of data from the memory of the DNN processor or the memory of the application host. The decompression unit decompresses the compressed chunks of data using the mask portion and the data portion. This may reduce memory bus utilization, allow the DNN module to complete processing operations more quickly, and reduce power consumption.

机译：深度神经网络（DNN）模块可以压缩和解压缩神经生成的激活数据，以减少内存总线带宽的利用。压缩单元可以接收由DNN模块中的神经元生成的未压缩数据块。压缩单元产生压缩的输出组块的掩码部分和数据部分。掩码部分对未压缩数据块中零字节和非零字节的存在和位置进行编码。数据部分存储来自未压缩数据块的截断的非零字节。解压缩单元可以从DNN处理器的存储器或应用程序主机的存储器接收压缩的数据块。解压缩单元使用掩码部分和数据部分对压缩的数据块进行解压缩。这可以降低内存总线利用率，允许DNN模块更快地完成处理操作，并降低功耗。

著录项

公开/公告号KR20190141694A

专利类型
公开/公告日2019-12-24

原文格式PDF
申请/专利权人 마이크로소프트 테크놀로지 라이센싱 엘엘씨;
展开▼

申请/专利号KR20197033456
发明设计人 코커리 조셉 레온;룬델 벤자민 엘리엇;월 래리 마빈;맥브라이드 차드 발링;암바르데카르 아몰 아쇽;페트르 조지;세도라 켄트 디.;밥로브 보리스;
展开▼

申请日2018-04-16
分类号G06N3/063;G06F1/32;G06F12/02;G06F13/16;G06N3/04;
国家 KR
入库时间 2022-08-21 11:08:35

相似文献

专利
外文文献
中文文献