COMPUTING THE GRADIENT IN OPTIMIZATION ALGORITHMS FOR THE CP DECOMPOSITION IN CONSTANT MEMORY THROUGH TENSOR BLOCKING

Vannieuwenhoven Nick; Meerbergen Karl; Vandebril Raf

首页> 外文期刊>SIAM Journal on Scientific Computing >COMPUTING THE GRADIENT IN OPTIMIZATION ALGORITHMS FOR THE CP DECOMPOSITION IN CONSTANT MEMORY THROUGH TENSOR BLOCKING

【24h】

COMPUTING THE GRADIENT IN OPTIMIZATION ALGORITHMS FOR THE CP DECOMPOSITION IN CONSTANT MEMORY THROUGH TENSOR BLOCKING

机译：通过张量阻塞计算恒定内存中CP分解的优化算法的梯度

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The construction of the gradient of the objective function in gradient-based optimization algorithms for computing an r-term CANDECOMP/PARAFAC (CP) decomposition of an unstructured dense tensor is a key computational kernel. The best technique for efficiently implementing this operation has a memory consumption that scales linearly with the number of terms r and sublinearly with the number of elements of the tensor. We consider a blockwise computation of the CP gradient, reducing the memory requirements to a constant. This reduction is achieved by a novel technique that we call implicit block unfoldings, which combines the benefits of the block tensor unfoldings by [Ragnarsson and Van Loan, SIAM J. Matrix Anal. Appl., 33 (2012), pp. 149169] and the implicit unfoldings of [Phan, Tichavsky, and Cichocki, IEEE Trans. Signal Process., 61 (2013), pp. 4834-4846]. A heuristic algorithm for automatically choosing the division into subtensors is part of the proposed algorithm. The throughput that can be attained is essentially determined by the performance of a matrix product of two small matrices of constant size. Numerical experiments illustrate that the proposed method can outperform the current state-of-the-art by up to two orders of magnitude for large dense tensors in terms of memory consumption, while the increase of the execution time is no more than 5%. The proposed algorithm attained upward of 90% of the theoretical peak performance of the computer system, using no more than 50MB of memory, irrespective of the size of the tensor and the number of terms r.

机译：在基于梯度的优化算法中用于计算非结构化密集张量的r项CANDECOMP / PARAFAC（CP）分解的目标函数梯度的构造是关键的计算内核。有效执行此操作的最佳技术的内存消耗与项r的数量成线性比例，与张量的元素数成线性关系。我们考虑对CP梯度进行逐块计算，从而将内存需求减少到一个常数。这种减少是通过一种称为隐式块展开的新技术实现的，该技术结合了[Ragnarsson and Van Loan，SIAM J. Matrix Anal。]的块张量展开的优点。 Appl。，33（2012），pp.149169]和[Phan，Tichavsky和Cichocki的隐式展开，IEEE Trans。信号处理”，第61卷，2013年，第4834-4846页。用于自动选择划分为张量的启发式算法是该算法的一部分。基本上可以通过两个恒定大小的小矩阵的矩阵乘积的性能来确定可以达到的吞吐量。数值实验表明，对于大的密集张量，该方法在内存消耗方面可以比当前的最新技术高两个数量级，而执行时间的增加不超过5％。所提出的算法使用了不超过50MB的内存，而不管张量的大小和项r的数量，都达到了计算机系统理论峰值性能的90％以上。

著录项

来源
《SIAM Journal on Scientific Computing》 |2015年第3期|共24页
作者
Vannieuwenhoven Nick; Meerbergen Karl; Vandebril Raf;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算数学;
关键词
CANDECOMP/PARAFAC; tensor rank decomposition; CP decomposition; CP gradient; implicit block unfolding;

机译：CANDECOMP / PARAFAC;张量秩分解;CP分解;CP梯度;隐式块展开;

相似文献

外文文献
中文文献
专利

1. COMPUTING THE GRADIENT IN OPTIMIZATION ALGORITHMS FOR THE CP DECOMPOSITION IN CONSTANT MEMORY THROUGH TENSOR BLOCKING [J] . Vannieuwenhoven Nick, Meerbergen Karl, Vandebril Raf SIAM Journal on Scientific Computing . 2015,第3期

机译：通过张量阻塞计算恒定内存中CP分解的优化算法的梯度
2. A Tensor CP Decomposition Method for Clustering Heterogeneous Information Networks via Stochastic Gradient Descent Algorithms [J] . Wu Jibing, Wang Zhifei, Wu Yahui, Scientific programming . 2017,第PTa1期

机译：基于随机梯度下降算法的异构信息网络的张量CP分解方法
3. Limited memory block krylov subspace optimization for computing dominant singular value decompositions [J] . Liu X., Wen Z., Zhang Y. SIAM Journal on Scientific Computing . 2013,第3期

机译：计算显性奇异值分解的有限内存块Krylov子空间优化
4. Linked PARAFAC/CP Tensor Decomposition and Its Fast Implementation for Multi-block Tensor Analysis [C] . Tatsuya Yokota, Andrzej Cichocki, Yukihiko Yamashita International conference on neural information processing . 2012

机译：链接的PARAFAC / CP张量分解及其在多块张量分析中的快速实现
5. Optimization of Block-Based Tensor Decompositions through Sub-Tensor Impact Graphs and Applications to Dynamicity in Data and User Focus [D] . Huang, Shengyu. 2021

机译：通过子张量冲击图和应用于数据和用户焦点的动态性的基于块的张量分解的优化
6. Generalized and efficient algorithm for computing multipole energies and gradients based on Cartesian tensors [O] . Dejun Lin -1

机译：基于笛卡尔张量的通用高效计算多极能量和梯度的算法
7. Computing the gradient in optimization algorithms for the CP decomposition in constant memory through tensor blocking [O] . Vannieuwenhoven Nick, Meerbergen Karl, Vandebril Raf 2015

机译：通过张量阻塞在常量存储器中CP分解的优化算法中计算梯度
8. Limited Memory Block Krylov Subspace Optimization for Computing Dominant Singular Value Decompositions. [R] . X. Liu Y. Zhang Z. Wen 2012

机译：用于计算主导奇异值分解的有限内存块Krylov子空间优化。

COMPUTING THE GRADIENT IN OPTIMIZATION ALGORITHMS FOR THE CP DECOMPOSITION IN CONSTANT MEMORY THROUGH TENSOR BLOCKING

摘要

著录项

相似文献

相关主题

期刊订阅