首页> 外文会议>IEEE International Conference on Data Engineering >Rima: An RDMA-Accelerated Model-Parallelized Solution to Large-Scale Matrix Factorization
【24h】

Rima: An RDMA-Accelerated Model-Parallelized Solution to Large-Scale Matrix Factorization

机译:Rima:RDMA加速的模型并行解决方案,用于大规模矩阵分解

获取原文

摘要

Matrix factorization (MF) is a fundamental technique in machine learning and data mining, which gains wide application in many fields. When the matrix becomes large, MF cannot be processed on a single machine. Considering this, many distributed SGD algorithms (e.g. DSGD) have been developed to solve large-scale MF on multiple machines in a model-parallel way. Existing distributed algorithms are primarily implemented under Map/Reduce or PS (parameter server)-based architectures, which incur significant communication overheads. Besides, existing solutions cannot well embrace the benefit of RDMA/RoCE transport and suffer from scalability problems. Targeting at these drawbacks, we propose Rima, which uses ring-based model parallelism to solve large-scale MF with higher communication efficiency. Compared with PS-based SGD algorithms, Rima also consumes less queue pairs (QPs) and can thus better leverage the power of RDMA/RoCE to accelerate the training speed. Our experiment shows that, compared with PS-based DSGD when solving 1M × 1M MF, Rima achieves comparable convergence performance after equal number of iterations, but reduces the training time by 68.7% and 85.4% via TCP and RDMA respectively.
机译:矩阵分解(MF)是机器学习和数据挖掘中的一项基本技术,在许多领域得到了广泛的应用。当矩阵变大时,无法在单台机器上处理MF。考虑到这一点,已经开发了许多分布式SGD算法(例如DSGD)以模型并行的方式在多台机器上求解大规模MF。现有的分布式算法主要是在基于Map / Reduce或PS(参数服务器)的体系结构下实现的,这会产生大量的通信开销。此外,现有的解决方案不能很好地利用RDMA / RoCE传输的优势,并且会遇到可伸缩性问题。针对这些缺点,我们提出了Rima,它使用基于环的模型并行性来解决具有较高通信效率的大规模MF。与基于PS的SGD算法相比,Rima还消耗更少的队列对(QP),因此可以更好地利用RDMA / RoCE的功能来加快训练速度。我们的实验表明,与求解1M×1M MF的基于PS的DSGD相比,Rima在经过相同的迭代次数后可以达到可比的收敛性能,但通过TCP和RDMA分别减少了68.7%和85.4%的训练时间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号