...
首页> 外文期刊>Parallel and Distributed Systems, IEEE Transactions on >Prediction of Optimal Parallelism Level in Wide Area Data Transfers
【24h】

Prediction of Optimal Parallelism Level in Wide Area Data Transfers

机译:广域数据传输中最佳并行度的预测

获取原文
获取原文并翻译 | 示例

摘要

Wide area data transfer may be a major bottleneck for the end-to-end performance of distributed applications. A practical way of increasing the wide area throughput at the application layer is using multiple parallel streams. Although increased number of parallel streams may yield much better performance than using a single stream, overwhelming the network by opening too many streams may have an inverse effect. The congestion created by excess number of streams may cause a drop down in the throughput achieved. Hence, it is important to decide on the optimal number of streams without congesting the network. Predicting this "optimumȁD; number is not straightforward, since it depends on many parameters specific to each individual transfer. Generic models that try to predict this number either rely too much on historical information or fail to achieve accurate predictions. In this paper, we present a set of new models which aim to approximate the optimal number with least history information and lowest prediction overhead. An algorithm is introduced to select the best combination of historic information to do the prediction for evaluation purposes as well as optimizing prediction by reducing error rate. We measure the feasibility and accuracy of the proposed prediction models by comparing to actual GridFTP data transfer by using little historical information and have seen that we could predict the throughput of parallel streams accurately and find a very close approximation of the optimal stream number.
机译:广域数据传输可能是分布式应用程序端到端性能的主要瓶颈。在应用层增加广域吞吐量的一种实用方法是使用多个并行流。尽管增加数量的并行流可能会比使用单个流产生更好的性能,但是通过打开太多流淹没网络可能会产生相反的效果。过多的流造成的拥塞可能导致所达到的吞吐量下降。因此,重要的是在不使网络拥塞的情况下确定最佳的流数量。预测此“最佳数量”并不容易,因为它取决于每个单独传输的许多参数。试图预测该数量的通用模型要么过于依赖历史信息,要么无法实现准确的预测。在本文中,我们提出一套旨在以最少的历史信息和最低的预测开销来逼近最佳数量的新模型,引入了一种算法,以选择历史信息的最佳组合来进行预测以用于评估目的,并通过降低错误率来优化预测。通过使用很少的历史信息,通过与实际的GridFTP数据传输进行比较,我们测量了所提出的预测模型的可行性和准确性,并且发现我们可以准确地预测并行流的吞吐量,并找到最佳流数的非常近似的值。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号