首页> 外文会议>IEEE International Symposium on Information Theory >Straggler-Proofing Massive-Scale Distributed Matrix Multiplication with D-Dimensional Product Codes
【24h】

Straggler-Proofing Massive-Scale Distributed Matrix Multiplication with D-Dimensional Product Codes

机译:用D维产品代码塑造施用巨型分布式矩阵乘法

获取原文

摘要

Distributed computing allows for large-scale computation and machine learning tasks by enabling parallel computing at massive scale. A critical challenge to speeding up distributed computing comes from stragglers, a crippling bottleneck to system performance [1]. Recently, coding theory has offered an attractive paradigm dubbed as coded computation [2] for addressing this challenge through the judicious introduction of redundant computing to combat stragglers. However, most existing approaches have limited applicability if the system scales to hundreds or thousands of workers, as is the trend in computing platforms. At these scales, previously proposed algorithms based on Maximum Distance Separable (MDS) codes are too expensive due to their hidden cost, i.e., computing and communication costs associated with the encoding/decoding procedures. Motivated by this limitation, we present a novel coded matrix-matrix multiplication scheme based on d-dimensional product codes. We show that our scheme allows for order-optimal computation/communication costs for the encoding/decoding procedures while achieving near-optimal compute time.
机译:分布式计算允许通过在大规模规模上启用并行计算来实现大规模计算和机器学习任务。加速分布式计算的危急挑战来自陷阱,一个围绕系统性能的瓶颈[1]。最近,编码理论提供了一个有吸引力的范例称为编码计算[2]通过明智地引入冗余计算来解决战斗障碍者来解决这一挑战。然而,如果系统缩放到数百个工人,大多数现有方法都具有有限的适用性,这是计算平台的趋势。在这些比较中,由于其隐藏的成本,即与编码/解码过程相关联的计算和通信成本,先前提出了基于最大距离可分离(MDS)代码的算法太昂贵。通过这种限制,我们提出了一种基于D维产品代码的新型编码矩阵矩阵乘法方案。我们表明我们的方案允许编码/解码过程的订单最佳计算/通信成本,同时实现近最佳计算时间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号