首页>
外国专利>
LINEAR NEURAL RECONSTRUCTION FOR DEEP NEURAL NETWORK COMPRESSION
LINEAR NEURAL RECONSTRUCTION FOR DEEP NEURAL NETWORK COMPRESSION
展开▼
机译:用于深度神经网络压缩的线性神经重构
展开▼
页面导航
摘要
著录项
相似文献
摘要
A method and apparatus for performing deep neural network compression of convolutional and fully connected layers using a linear approximation of their outputs with information, such as in matrices representing weights, biases and non-linearities, to iteratively compress a pre-trained deep neural network by low displacement rank based approximation of the network layer weight matrices. Extension of the technique enables consecutive layers to be compressed jointly, allowing compression and speeding inference by reducing the number of channels/hidden neurons in the network.
展开▼