首页> 外文会议>IEEE International Conference on Data Mining >SCED: A General Framework for Sparse Tensor Decomposition with Constraints and Elementwise Dynamic Learning
【24h】

SCED: A General Framework for Sparse Tensor Decomposition with Constraints and Elementwise Dynamic Learning

机译:SCED:具有约束和元素动态学习的稀疏张量分解的通用框架

获取原文

摘要

CANDECOMP/PARAFAC Decomposition (CPD) is one of the most popular tensor decomposition methods that has been extensively studied and widely applied. In recent years, sparse tensors that contain a huge portion of zeros but a limited number of non-zeros have attracted increasing interest. Existing techniques are not directly applicable to sparse tensors, since they mainly target dense ones and usually have poor efficiency. Additionally, specific issues also arise for sparse tensors, depending on different data sources and applications: the role of zero entries can be different; incorporating constraints like non-negativity and sparseness might be necessary; the ability to learn on-the-fly is a must for dynamic scenarios that new data keeps arriving at high velocity. However, state-of-art algorithms only partially address the above issues. To fill this gap, we propose a general framework for finding the CPD of sparse tensors. Modeling the sparse tensor decomposition problem by a generalized weighted CPD formulation and solving it efficiently, our proposed method is also flexible to handle constraints and dynamic data streams. Through experiments on both synthetic and real-world datasets, for the static case, our method demonstrates significant improvements in terms of effectiveness, efficiency and scalability. Moreover, under the dynamic setting, our method speeds up current technology by hundreds to thousands times, without sacrificing decomposition quality.
机译:CANDECOMP / PARAFAC分解(CPD)是已被广泛研究和广泛应用的最流行的张量分解方法之一。近年来,包含很大一部分零但数量有限的非零的稀疏张量引起了越来越多的兴趣。现有技术不能直接应用于稀疏张量,因为它们主要针对密集张量并且通常效率很低。此外,对于稀疏张量,还会出现特定的问题,具体取决于不同的数据源和应用程序:零项的作用可能有所不同;零项的作用可能有所不同;零项的作用可能会有所不同。合并非负性和稀疏性等约束可能是必要的;动态学习的能力是动态场景的必备条件,因为新数据不断以高速度到达。但是,现有技术仅部分解决了上述问题。为了填补这一空白,我们提出了一个通用框架来查找稀疏张量的CPD。通过广义加权CPD公式对稀疏张量分解问题建模并有效解决,我们提出的方法还可以灵活地处理约束和动态数据流。通过在合成数据集和实际数据集上进行的实验,对于静态情况,我们的方法证明了有效性,效率和可伸缩性方面的显着改进。此外,在动态设置下,我们的方法可将现有技术加速数百倍至数千倍,而不会牺牲分解质量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号