...
首页> 外文期刊>IEEE Transactions on Parallel and Distributed Systems >SGD__Tucker: A Novel Stochastic Optimization Strategy for Parallel Sparse Tucker Decomposition
【24h】

SGD__Tucker: A Novel Stochastic Optimization Strategy for Parallel Sparse Tucker Decomposition

机译:SGD__Tucker:一种新的平行稀疏Tucker分解的随机优化策略

获取原文
获取原文并翻译 | 示例
           

摘要

Sparse Tucker Decomposition (STD) algorithms learn a core tensor and a group of factor matrices to obtain an optimal low-rank representation feature for the High-Order, High-Dimension, and Sparse Tensor (HOHDST). However, existing STD algorithms face the problem of intermediate variables explosion which results from the fact that the formation of those variables, i.e., matrices Khatri-Rao product, Kronecker product, and matrix-matrix multiplication, follows the whole elements in sparse tensor. The above problems prevent deep fusion of efficient computation and big data platforms. To overcome the bottleneck, a novel stochastic optimization strategy (SGD_Tucker) is proposed for STD which can automatically divide the high-dimension intermediate variables into small batches of intermediate matrices. Specifically, SGD_Tucker only follows the randomly selected small samples rather than the whole elements, while maintaining the overall accuracy and convergence rate. In practice, SGD_Tucker features the two distinct advancements over the state of the art. First, SGD_Tucker can prune the communication overhead for the core tensor in distributed settings. Second, the low data-dependence of SGD_Tucker enables fine-grained parallelization, which makes SGD_Tucker obtaining lower computational overheads with the same accuracy. Experimental results show that SGD_Tucker runs at least 2X faster than the state of the art.
机译:稀疏的Tucker分解(STD)算法学习核心张量和一组因子矩阵,以获得高阶,高维和稀疏张量(HOHDST)的最佳低秩表示特征。然而,现有的STD算法面临中间变量爆炸的问题,这导致了这些变量的形成,即矩阵khatri-rao产品,kronecker产品和矩阵矩阵乘法,遵循稀疏张量的整个元素。上述问题防止了高效计算和大数据平台的深度融合。为了克服瓶颈,提出了一种新的随机优化策略(SGD_TUCKER)的STD,它可以自动将高维中间变量分成小批次的中间矩阵。具体而言,SGD_TUCKER仅遵循随机选择的小型样本而不是整个元素,同时保持整体精度和收敛速率。在实践中,SGD_TUCKER具有对最先进的两个不同的进步。首先,SGD_TUCKER可以在分布式设置中修剪核心张量的通信开销。其次,SGD_TUCKER的低数据依赖性使得能够进行细粒度并行化,这使得SGD_TUCKER以相同的准确度获得更低的计算开销。实验结果表明,SGD_TUCKER比最先进的速度快2倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号