首页>
外国专利>
NEURAL NETWORK MODEL COMPRESSION WITH SELECTIVE STRUCTURED WEIGHT UNIFICATION
NEURAL NETWORK MODEL COMPRESSION WITH SELECTIVE STRUCTURED WEIGHT UNIFICATION
展开▼
机译:具有选择性结构化重量统一的神经网络模型压缩
展开▼
页面导航
摘要
著录项
相似文献
摘要
A method, computer program, or computer system is provided for compressing a neural network model. One or more blocks are identified from among a superblock corresponding to a multi-dimensional tensor associated with a neural network. A set of weight coefficients associated with the superblock is unified. A model of the neural network is compressed based on the unified set of weight coefficients.
展开▼