首页> 外文会议>International workshop on machine learning, optimization, and big data >A Quantitative Analysis on Required Network Bandwidth for Large-Scale Parallel Machine Learning
【24h】

A Quantitative Analysis on Required Network Bandwidth for Large-Scale Parallel Machine Learning

机译:大规模并行机器学习所需网络带宽的定量分析

获取原文

摘要

Parallelization is essential for machine learning systems that deals with large-scale dataset. Data parallel machine leaning systems that are composed of multiple machine learning modules, exchange the parameter to synchronize the models in the modules through network. We investigate the network bandwidth requirements for various parameter exchange method using a cluster simulator called SimGrid. We have confirmed that (1) direct exchange methods are substantially more efficient than parameter server based methods, and (2) with proper exchange methods, the bisection-bandwidth of network does not affect the efficiency, which implies smaller investment on network facility will be sufficient.
机译:并行化对于处理大规模数据集的机器学习系统至关重要。由多个机器学习模块组成的数据并行机器学习系统交换参数以通过网络同步模块中的模型。我们使用称为SimGrid的群集模拟器调查各种参数交换方法的网络带宽需求。我们已经确认,(1)直接交换方法比基于参数服务器的方法有效得多,并且(2)使用适当的交换方法,网络的二等分带宽不会影响效率,这意味着对网络设施的投资会减少足够。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号