首页> 外文期刊>Cluster computing >A data transmission algorithm for distributed computing system based on maximum flow
【24h】

A data transmission algorithm for distributed computing system based on maximum flow

机译:基于最大流量的分布式计算系统数据传输算法

获取原文
获取原文并翻译 | 示例
       

摘要

Data skew can lead to load imbalance and longer computation time in the distributed computing system. To avoid data skew and reduce the data computation time, it is necessary to transmit the data to appropriate machines, this may however take too much network resources. How to balance the computational resources and the network resources is a problem. In this paper, we introduce a computation model called distributed two-phase model, in which the process of a task can be divided into two independent phases: data transmission and data computation. Suppose an upper bound of relative computation time is given, we show how to schedule data transmission with minimum resources, such as data transmission time and occupied bandwidth, to meet the demand. In this paper, we present a novel algorithm to minimize data transmission time and network bandwidth usage in the data transmission phase, with the conditions that an upper bound of relative computation time of data computation phase is given. Moreover, the number of nodes that participate in data computation phase is also reduced, in this way, the computational resources are saved. The simulation results show that the occupied bandwidth can be reduced effectively (about 70 %) in the situation of large-scale data sets and large number of nodes. Our algorithm is also shown to be available in replication situation.
机译:数据偏斜会导致分布式计算系统中的负载不平衡和更长的计算时间。为了避免数据偏斜并减少数据计算时间,有必要将数据传输到适当的机器,但是这可能会占用过多的网络资源。如何平衡计算资源和网络资源是一个问题。在本文中,我们介绍了一种称为分布式两阶段模型的计算模型,其中任务的过程可以分为两个独立的阶段:数据传输和数据计算。假设给定了相对计算时间的上限,我们展示了如何以最少的资源(例如数据传输时间和占用的带宽)来调度数据传输以满足需求。在本文中,我们提出了一种新的算法,该算法在给定数据计算阶段的相对计算时间上限的前提下,将数据传输阶段的数据传输时间和网络带宽占用最小化。而且,也减少了参与数据计算阶段的节点数量,从而节省了计算资源。仿真结果表明,在大规模数据集和大量节点的情况下,可以有效减少占用带宽(约70%)。我们的算法也显示在复制情况下可用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号