首页> 外文会议>International Conference on Data Engineering >DisMASTD: An Efficient Distributed Multi-Aspect Streaming Tensor Decomposition
【24h】

DisMASTD: An Efficient Distributed Multi-Aspect Streaming Tensor Decomposition

机译:拆除:高效的分布式多方面流式传输张量分解

获取原文

摘要

Tensor decomposition is a fundamental multidimensional data analysis tool for many data-driven applications, such as social computing, computer vision, and bioinformatics, to name but a few. However, the rapidly increasing streaming data nowadays introduces new challenges to traditional static tensor decomposition. It requires an efficient distributed dynamic tensor decomposition without re-computing the whole tensor from scratch. In this paper, we propose DisMASTD, an efficient distributed multi-aspect streaming tensor decomposition. First, we prove the optimal tensor partitioning problem is NP-hard. Second, we present two heuristic tensor partitioning approaches to ensure the load balancing. Third, we develop a distributed multi-aspect streaming tensor decomposition computation method, which avoids repetitive computation and reduces network communication by maintaining and reusing the intermediate results. Last but not least, we perform extensive experiments with both real and synthetic datasets to demonstrate the efficiency and scalability of DisMASTD.
机译:张量分解是一种基本的多维数据分析工具,用于许多数据驱动的应用程序,例如社交计算,计算机视觉和生物信息学,以姓名但是几个。然而,现在迅速增加的流动数据对传统的静态张量分解引入了新的挑战。它需要有效的分布式动态张量分解,而无需重新计算从头开始计算整个张量。在本文中,我们提出了拆解,一种有效的分布式多方面流张传统分解。首先,我们证明了最佳张量分区问题是NP-HARD。其次,我们展示了两个启发式张量分区方法,以确保负载平衡。第三,我们开发了一种分布式多方面流传输张量分解计算方法,其避免重复计算并通过维护和重用中间结果来降低网络通信。最后但并非最不重要的是,我们用真实的合成数据集进行广泛的实验,以展示拆除的效率和可扩展性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号