首页> 外国专利> DIRECT COMPUTATION WITH COMPRESSED WEIGHT IN TRAINING DEEP NEURAL NETWORK

DIRECT COMPUTATION WITH COMPRESSED WEIGHT IN TRAINING DEEP NEURAL NETWORK

机译:训练深层神经网络的压缩直接计算

摘要

A distributed training system including a parameter server is configured to compress the weight metrices according to a clustering algorithm, with the compressed representation of the weight matrix may thereafter distributed to training workers. The compressed representation may comprise a centroid index matrix and a centroid table, wherein each element of the centroid index matrix corresponds to an element of the corresponding weight matrix and comprises an index into the centroid table, and wherein each element of the centroid table comprises a centroid value. In a further example aspect, a training worker may compute an activation result directly from the compressed representation of a weight matrix and a training data matrix by performing gather-reduce-add operations that accumulate all the elements of the training data matrix that correspond to the same centroid value to generate partial sums, multiplying each partial sum by its corresponding centroid value, and summing the resulting products.
机译:包括参数服务器的分布式训练系统被配置为根据聚类算法压缩权重度量,权重矩阵的压缩表示随后可以分配给训练工作者。压缩表示可以包括质心索引矩阵和质心表,其中质心索引矩阵的每个元素对应于相应权重矩阵的元素,并且包括质心表的索引,并且其中质心表的每个元素包括重心值。在进一步的示例方面,训练工作者可以通过执行聚集-减少-加法运算来直接根据权重矩阵和训练数据矩阵的压缩表示来计算激活结果,该运算累积与训练数据矩阵相对应的所有元素。相同的质心值生成部分和,将每个部分和与相应的质心值相乘,然后求和。

著录项

  • 公开/公告号US2020342288A1

    专利类型

  • 公开/公告日2020-10-29

    原文格式PDF

  • 申请/专利权人 MICROSOFT TECHNOLOGY LICENSING LLC;

    申请/专利号US201916584711

  • 发明设计人 JINWEN XI;BHARADWAJ PUDIPEDDI;

    申请日2019-09-26

  • 分类号G06N3/04;G06K9/62;G06F17/16;

  • 国家 US

  • 入库时间 2022-08-21 11:22:51

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号