首页> 外国专利> Native tensor processor, and partitioning of tensor contractions

Native tensor processor, and partitioning of tensor contractions

机译:本机张量处理器和张量收缩分区

摘要

A native tensor processor calculates tensor contractions using a sum of outer products. In one implementation, the native tensor processor preferably is implemented as a single integrated circuit and includes an input buffer and a contraction engine. The input buffer buffers tensor elements retrieved from off-chip and transmits the elements to the contraction engine as needed. The contraction engine calculates the tensor contraction by executing calculations from an equivalent matrix multiplications, as if the tensors were unfolded into matrices, but avoiding the overhead of expressly unfolding the tensors. The contraction engine includes a plurality of outer product units that calculate matrix mutiplications by a sum of outer products. By using outer products, the equivalent matrix multiplications can be partitioned into smaller matrix multiplications, each of which is localized with respect to which tensor elements are required.
机译:本机张量处理器使用外部乘积之和来计算张量收缩。在一个实施方式中,本机张量处理器优选地被实现为单个集成电路,并且包括输入缓冲器和收缩引擎。输入缓冲器缓冲从芯片外获取的张量元素,并根据需要将其传输到收缩引擎。收缩引擎通过从等效矩阵乘法执行计算来计算张量收缩,就像将张量展开为矩阵一样,但避免了明确展开张量的开销。收缩引擎包括多个外部乘积单元,该多个外部乘积单元通过外部乘积之和计算矩阵倍乘。通过使用外部乘积,可以将等效矩阵乘法划分为较小的矩阵乘法,每个矩阵乘法都针对需要张量元素的位置进行了局部化。

著录项

  • 公开/公告号US10073816B1

    专利类型

  • 公开/公告日2018-09-11

    原文格式PDF

  • 申请/专利权人 NOVUMIND LIMITED;

    申请/专利号US201715655814

  • 发明设计人 CHIEN-PING LU;YU-SHUEN TANG;

    申请日2017-07-20

  • 分类号G06F17/16;G06F17/14;

  • 国家 US

  • 入库时间 2022-08-21 13:06:15

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号