首页> 外国专利> LOW DISPLACEMENT RANK BASED DEEP NEURAL NETWORK COMPRESSION

LOW DISPLACEMENT RANK BASED DEEP NEURAL NETWORK COMPRESSION

机译:基于低位移秩的深度神经网络压缩

摘要

A method and an apparatus for performing deep neural network compression use an approximation training set along with information, such as in matrices representing weights, biases and non-linearities, to iteratively compress a pre-trained deep neural network by low displacement rank based approximation of the network layer weight matrices. The low displacement rank approximation allows for replacement of an original layer weight matrices of the pre-trained deep neural network as the sum of a small number of structured matrices, allowing compression and low inference complexity.
机译:一种用于执行深度神经网络压缩的方法和设备,使用近似训练集和信息,例如在表示权重、偏差和非线性的矩阵中,通过基于网络层权重矩阵的低位移秩近似来迭代压缩预训练的深度神经网络。低位移秩近似允许将预训练深度神经网络的原始层权重矩阵替换为少量结构化矩阵的总和,从而实现压缩和低推理复杂度。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号