首页> 外国专利> LOW DISPLACEMENT RANK BASED DEEP NEURAL NETWORK COMPRESSION

LOW DISPLACEMENT RANK BASED DEEP NEURAL NETWORK COMPRESSION

机译:基于低位移级的深神经网络压缩

摘要

A method and an apparatus for performing deep neural network compression use an approximation training set along with information, such as in matrices representing weights, biases and non-linearities, to iteratively compress a p re-trained deep neural network by low displacement rank based approximation of the network layer weight matrices. The low displacement rank approximation allows for replacement of an original layer weight matrices of the pre-trained deep neural network as the sum of a small number of structured matrices, allowing compression and low inference complexity.
机译:用于执行深神经网络压缩的方法和装置使用近似训练集合以及诸如表示权重,偏置和非线性的矩阵中的信息,以通过基于低位移级的近似来迭代地压缩深度神经网络 网络层权重矩阵。 低位移秩近似允许将预先训练的深神经网络的原始层权重矩阵替换为少量结构矩阵的总和,允许压缩和低引起复杂性。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号