首页> 外国专利> Systems and methods for memory efficient parallel tensor decompositions

Systems and methods for memory efficient parallel tensor decompositions

机译:存储器有效平行张量分解的系统和方法

摘要

In a system for improving performance of tensor-based computations and for minimizing the associated memory usage, computations associated with different non-zero tensor values are performed while exploiting an overlap between the respective index tuples of those non-zero values. While performing computations associated with a selected mode, when an index corresponding to a particular mode in a current index tuple is the same as the corresponding index from another, previously processed index tuple, the value already stored in a buffer corresponding to that particular mode is reused either wholly or in part, minimizing the processor usage and improving performance. Certain matrix operations may be iterated more than once so as to avoid the need to store a large partial result obtained from those operations. The performance overhead of the repeated operations is not significant, but the reduction in memory usage is.
机译:在用于提高基于卷的计算性能的系统中并用于最小化相关的存储器使用,在利用这些非零值的相应索引元件之间的重叠之间进行与不同的非零卷重值相关联的计算。 虽然执行与所选模式相关联的计算,但是,当当前索引元组中对应于特定模式的索引与来自另一个先前处理的索引元组的相应索引相同时,已经存储在对应于该特定模式的缓冲器中的值 全部或部分重复使用,最大限度地减少处理器使用和提高性能。 某些矩阵操作可以迭代多次,以避免需要存储从这些操作获得的大的部分结果。 重复操作的性能开销不显着,但内存使用率的降低是。

著录项

  • 公开/公告号US11086968B1

    专利类型

  • 公开/公告日2021-08-10

    原文格式PDF

  • 申请/专利权人 RESERVOIR LABS INC.;

    申请/专利号US201816000486

  • 发明设计人 MUTHU MANIKANDAN BASKARAN;

    申请日2018-06-05

  • 分类号G06F17/16;G06F16/174;

  • 国家 US

  • 入库时间 2022-08-24 20:29:00

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号