首页>
外国专利>
CLUSTER COMPRESSION FOR COMPRESSING WEIGHTS IN NEURAL NETWORKS
CLUSTER COMPRESSION FOR COMPRESSING WEIGHTS IN NEURAL NETWORKS
展开▼
机译:压缩神经网络中的权重
展开▼
页面导航
摘要
著录项
相似文献
摘要
A method for instantiating a convolutional neural network on a computing system. The convolutional neural network includes a plurality of layers, and instantiating the convolutional neural network includes training the convolutional neural network using a first loss function until a first classification accuracy is reached, clustering a set of F x K kernels of the first layer into a set of C clusters, training the convolutional neural network using a second loss function until a second classification accuracy is reached, creating a dictionary which maps each of a number of centroids to a corresponding centroid identifier, quantizing and compressing F filters of the first layer, storing F quantized and compressed filters of the first layer in a memory of the computing system, storing F biases of the first layer in the memory, and classifying data received by the convolutional neural network.
展开▼
机译:一种在计算系统上实例化卷积神经网络的方法。卷积神经网络包括多个层,实例化卷积神经网络包括使用第一损失函数训练卷积神经网络,直到达到第一分类精度,将一组 F I> x K I>个内核分成一组 C I>个簇,使用第二个损失函数训练卷积神经网络,直到达到第二个分类精度,然后创建一个字典来映射每个多个质心到相应的质心标识符,量化和压缩第一层的 F I>过滤器,将第一层的 F I>量化和压缩的过滤器存储在计算的内存中系统,将第一层的 F I>偏差存储在内存中,并对卷积神经网络接收的数据进行分类。
展开▼