【24h】

Scalable Nonnegative Matrix Factorization with Block-wise Updates

机译:可扩展的非负矩阵因子与块明智的更新

获取原文

摘要

Nonnegative Matrix Factorization (NMF) has been applied with great success to many applications. As NMF is applied to massive datasets such as web-scale dyadic data, it is desirable to leverage a cluster of machines to speed up the factorization. However, it is challenging to efficiently implement NMF in a distributed environment. In this paper, we show that by leveraging a new form of update functions, we can perform local aggregation and fully explore parallelism. Moreover, under the new form of update functions, we can perform frequent updates, which aim to use the most recently updated data whenever possible. As a result, frequent updates are more efficient than their traditional concurrent counterparts. Through a series of experiments on a local cluster as well as the Amazon EC2 cloud, we demonstrate that our implementation with frequent updates is up to two orders of magnitude faster than the existing implementation with the traditional form of update functions.
机译:非负矩阵分解(NMF)已经应用于许多应用成功。由于NMF应用于诸如网级二元数据的大规模数据集,因此希望利用一组机器来加速分解。然而,在分布式环境中有效地实现NMF是挑战性的。在本文中,我们表明,通过利用新的更新功能,我们可以执行本地聚合和完全探索并行性。此外,在新的更新功能的情况下,我们可以执行频繁的更新,该旨在随时使用最近更新的数据。因此,频繁的更新比传统的并发对应物更有效。通过在本地群集以及亚马逊EC2云上的一系列实验,我们证明我们的实施具有频繁更新的速度高于现有更新功能的速度快两个数量级。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号