Prediction of Optimal Parallelism Level in Wide Area Data Transfers

Yildirim Esma; Yin Dengpan; Kosar Tevfik

首页> 外文期刊>Parallel and Distributed Systems, IEEE Transactions on >Prediction of Optimal Parallelism Level in Wide Area Data Transfers

【24h】

Prediction of Optimal Parallelism Level in Wide Area Data Transfers

机译：广域数据传输中最佳并行度的预测

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Wide area data transfer may be a major bottleneck for the end-to-end performance of distributed applications. A practical way of increasing the wide area throughput at the application layer is using multiple parallel streams. Although increased number of parallel streams may yield much better performance than using a single stream, overwhelming the network by opening too many streams may have an inverse effect. The congestion created by excess number of streams may cause a drop down in the throughput achieved. Hence, it is important to decide on the optimal number of streams without congesting the network. Predicting this "optimumȁD; number is not straightforward, since it depends on many parameters specific to each individual transfer. Generic models that try to predict this number either rely too much on historical information or fail to achieve accurate predictions. In this paper, we present a set of new models which aim to approximate the optimal number with least history information and lowest prediction overhead. An algorithm is introduced to select the best combination of historic information to do the prediction for evaluation purposes as well as optimizing prediction by reducing error rate. We measure the feasibility and accuracy of the proposed prediction models by comparing to actual GridFTP data transfer by using little historical information and have seen that we could predict the throughput of parallel streams accurately and find a very close approximation of the optimal stream number.

机译：广域数据传输可能是分布式应用程序端到端性能的主要瓶颈。在应用层增加广域吞吐量的一种实用方法是使用多个并行流。尽管增加数量的并行流可能会比使用单个流产生更好的性能，但是通过打开太多流淹没网络可能会产生相反的效果。过多的流造成的拥塞可能导致所达到的吞吐量下降。因此，重要的是在不使网络拥塞的情况下确定最佳的流数量。预测此“最佳数量”并不容易，因为它取决于每个单独传输的许多参数。试图预测该数量的通用模型要么过于依赖历史信息，要么无法实现准确的预测。在本文中，我们提出一套旨在以最少的历史信息和最低的预测开销来逼近最佳数量的新模型，引入了一种算法，以选择历史信息的最佳组合来进行预测以用于评估目的，并通过降低错误率来优化预测。通过使用很少的历史信息，通过与实际的GridFTP数据传输进行比较，我们测量了所提出的预测模型的可行性和准确性，并且发现我们可以准确地预测并行流的吞吐量，并找到最佳流数的非常近似的值。

著录项

来源
《Parallel and Distributed Systems, IEEE Transactions on 》 |2011年第12期| p.2033-2045| 共13页
作者
Yildirim Esma; Yin Dengpan; Kosar Tevfik;
展开▼
作者单位

The State University of New York at Buffalo, Buffalo;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Distributed applications; modeling and prediction; network protocols.; parallelism and concurrency;

机译：分布式应用程序;建模和预测;网络协议;并行性和并发性;

相似文献

外文文献
中文文献
专利

1. Application-Level Optimization of Big Data Transfers through Pipelining, Parallelism and Concurrency [J] . Esma Yildirim, Engin Arslan, Jangyoung Kim, Cloud Computing, IEEE Transactions on . 2016 ,第1期

机译：通过流水线，并行和并发进行大数据传输的应用程序级优化
2. Reductions in alanine aminotransferase levels with liraglutide treatment are greatest in those with raised baseline levels and are independent of weight loss: real-world outcome data from the ABCD Nationwide Liraglutide Audit [J] . Crabtree Thomas S. J., Rowles Susannah, Tarpey Stephanie, Nature reviews Cancer . 2019 ,第2期

机译：在具有升高的基线水平的人中，丙氨酸氨基转移酶水平的减少最大，并且与体重减轻无关：来自ABCD全国范围内Liraglutide审计的现实世界结果数据
3. Optimally Maximizing Iteration-Level Loop Parallelism [J] . Liu Duo, Wang Yi, Shao Zili, Parallel and Distributed Systems, IEEE Transactions on . 2012 ,第3期

机译：优化最大化迭代级循环并行性
4. Dynamic Adaptation of Parallelism Level in Data Transfer Scheduling [C] . Balman M., Kosar T. Complex, Intelligent and Software Intensive Systems, 2009. CISIS '09 . 2009

机译：数据传输调度中并行度的动态适应
5. Design and Evaluation of a Bitcoin Miner SystemC Model with Thread and Data-Level Parallelism [D] . Cheng, Zhongqi. 2017

机译：具有线程和数据级并行性的比特币矿工系统模型的设计与评估
6. Exploiting Thread-Level and Instruction-Level Parallelism to Cluster Mass Spectrometry Data using Multicore Architectures [O] . Fahad Saeed, Jason D. Hoffert, Trairak Pisitkun, -1

机译：利用多核体系结构利用线程级和指令级并行性对质谱数据进行聚类
7. Dynamically tuning level of parallelism in wide area data transfers [O] . Esma Yildirim, Mehmet Balman, Tevfik Kosar 2008

机译：动态调整广域数据传输中的并行度

Prediction of Optimal Parallelism Level in Wide Area Data Transfers

摘要

著录项

相似文献

相关主题

期刊订阅