首页> 外文期刊>Journal of computational science >A multi-dimensional Morton-ordered block storage for mode-oblivious tensor computations
【24h】

A multi-dimensional Morton-ordered block storage for mode-oblivious tensor computations

机译:用于模式令人沮丧的张量计算的多维升量排序块存储

获取原文
获取原文并翻译 | 示例
       

摘要

Computation on tensors, treated as multidimensional arrays, revolve around generalized basic linear algebra subroutines (BLAS). We propose a novel data structure in which tensors are blocked and blocks are stored in an order determined by Morton order. This is not only proposed for efficiency reasons, but also to induce efficient performance regardless of which mode a generalized BLAS call is invoked for; we coin the term mode-oblivious to describe data structures and algorithms that induce such behavior. Experiments on one of the most bandwidth-bound generalized BLAS kernel, the tensor-vector multiplication, not only demonstrate superior performance over two state-of-the-art variants by up to 18%, but additionally show that the proposed data structure induces a 71% less sample standard deviation for tensor-vector multiplication across d modes, where d varies from 2 to 10. Finally, we show our data structure naturally expands to other tensor kernels and demonstrate up to 38% higher performance for the higher-order power method. (C) 2019 Elsevier B.V. All rights reserved.
机译:对张量的计算,被视为多维数阵列,围绕着广义基本线性代数子程序(BLA)旋转。我们提出了一种新的数据结构,其中张阻被阻塞并且块以由Morton order确定的顺序存储。这不仅提出了效率的原因,而且还要引起有效性能,无论哪种模式如何调用广义的BLAS呼叫;我们投入了术语模式,以描述诱导此类行为的数据结构和算法。关于最具带宽广义的BLAS内核之一,张量矢量乘法的实验,不仅可以展示多达18%的最新变体的优越性,但另外表明所提出的数据结构诱导Decor-Vector乘法的样品标准偏差减少71%,其中D从2到10之间变化。最后,我们显示我们的数据结构自然地扩展到其他张核,并展示了高阶功率的高度高度的性能。方法。 (c)2019 Elsevier B.v.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号