...
首页> 外文期刊>Computer communication review >Leveraging Endpoint Flexibility in Data-Intensive Clusters
【24h】

Leveraging Endpoint Flexibility in Data-Intensive Clusters

机译:在数据密集型集群中利用端点灵活性

获取原文
获取原文并翻译 | 示例

摘要

Many applications do not constrain the destinations of their network transfers. New opportunities emerge when such transfers contribute a large amount of network bytes. By choosing the endpoints to avoid congested links, completion times of these transfers as well as that of others without similar flexibility can be improved. In this paper, we focus on leveraging the flexibility in replica placement during writes to cluster file systems (CFSes), which account for almost half of all cross-rack traffic in data-intensive clusters. The replicas of a CFS write can be placed in any subset of machines as long as they are in multiple fault domains and ensure a balanced use of storage throughout the cluster. We study CFS interactions with the cluster network, analyze optimizations for replica placement, and propose Sinbad - a system that identifies imbalance and adapts replica destinations to navigate around congested links. Experiments on EC2 and trace-driven simulations show that block writes complete 1.3 × (respectively, 1.58 ×) faster as the network becomes more balanced. As a collateral benefit, end-to-end completion times of data-intensive jobs improve as well. Sinbad does so with little impact on the long-term storage balance.
机译:许多应用程序并不限制其网络传输的目的地。当此类传输占用大量网络字节时,就会出现新的机会。通过选择端点以避免链路拥塞,可以改善这些传输以及没有类似灵活性的其他传输的完成时间。在本文中,我们着重于在写入群集文件系统(CFS)期间利用副本放置的灵活性,该文件几乎占数据密集型群集中所有跨机架流量的一半。 CFS写入的副本可以放置在计算机的任何子集中,只要它们位于多个故障域中,并确保在整个群集中平衡使用存储即可。我们研究了CFS与群集网络的交互,分析了副本放置的优化,并提出了Sinbad-一个识别不平衡并适应副本目标以在拥塞的链接周围导航的系统。在EC2上进行的实验和跟踪驱动的仿真表明,随着网络变得更加平衡,块写入完成的速度提高了1.3倍(分别为1.58倍)。作为附带的好处,数据密集型工作的端到端完成时间也缩短了。 Sinbad这样做对长期存储平衡影响很小。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号