EC-Shuffle: Dynamic Erasure Coding Optimization for Efficient and Reliable Shuffle in Spark

机译：EC-Shuffle：动态擦除编码优化，可实现火花中高效可靠的随机播放

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Fault-tolerance capabilities attract increasing attention from existing data processing frameworks, such as Apache Spark. To avoid replaying costly distributed computation, like shuffle, local checkpoint and remote replication are two popular approaches. They incur significant runtime overhead, such as extra storage cost or network traffic. Erasure coding is another emerging technology, which also enables data resilience. It is perceived as capable of replacing the checkpoint and replication mechanisms for its high storage efficiency. However, it suffers heavy network traffic due to distributing data partitions to different locations. In this paper, we propose EC-Shuffle with two encoding schemes and optimize the shuffle-based operations in Spark or MapReduce-like frameworks. Specifically, our encoding schemes concentrate on optimizing the data traffic during the execution of shuffle operations. They only transfer the parity chunks generated via erasure coding, instead of a whole copy of all data chunks. EC-Shuffle also provides a strategy, which can dynamically select the per-shuffle biased encoding scheme according to the number of senders and receivers in each shuffle. Our analyses indicate that this dynamic encoding selection can minimize the total size of parity chunks. The extensive experimental results using BigDataBench with hundreds of mappers and reducers shows this optimization can reduce up to 50% network traffic and achieve up to 38% performance improvement.

机译：容错功能引起了诸如Apache Spark之类的现有数据处理框架的越来越多的关注。为避免重播昂贵的分布式计算（如随机播放，本地检查点和远程复制），这是两种流行的方法。它们会导致大量的运行时开销，例如额外的存储成本或网络流量。擦除编码是另一种新兴技术，它还可以实现数据弹性。它被认为能够取代检查点和复制机制，因为它具有很高的存储效率。但是，由于将数据分区分配到不同的位置，它遭受了繁重的网络流量。在本文中，我们提出了具有两种编码方案的EC-Shuffle，并在Spark或类似MapReduce的框架中优化了基于随机播放的操作。具体来说，我们的编码方案专注于在随机播放操作执行期间优化数据流量。它们仅传输通过擦除编码生成的奇偶校验块，而不是所有数据块的完整副本。 EC-Shuffle还提供了一种策略，该策略可以根据每个shuffle中的发送者和接收者的数量动态选择针对每个shuffle的编码方案。我们的分析表明，这种动态编码选择可以使奇偶校验块的总大小最小化。使用BigDataBench和数百个映射器和精简器进行的广泛实验结果表明，此优化可以减少多达50％的网络流量，并实现多达38％的性能提升。

著录项

来源
《IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing》|2019年|41-51|共11页
会议地点
作者
Xin Yao; Cho-Li Wang; Mingzhe Zhang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
checkpointing; cluster computing; data handling; distributed processing; fault tolerant computing; optimisation; storage management; telecommunication traffic;

机译：检查点;集群计算;数据处理;分布式处理;容错计算;优化;存储管理;电信流量;

相似文献

外文文献
中文文献
专利

1. FEC-Seluge: Efficient, reliable, and secure large data dissemination using erasure codes [J] . Hyun Sangwon, Sun Kun, Ning Peng Computer Communications . 2017,第MAY15期

机译：FEC-Seluge：使用擦除码高效，可靠和安全地传播大数据
2. Enabling Efficient and Reliable Transition from Replication to Erasure Coding for Clustered File Systems [J] . Runhui Li, Yuchong Hu, Patrick P. C. Lee IEEE Transactions on Parallel and Distributed Systems . 2017,第9期

机译：为群集文件系统实现从复制到擦除编码的高效可靠转换
3. A reliable and energy-efficient storage system with erasure coding cache [J] . Wan Ji-guang, Li Da-ping, Qu Xiao-yang, Journal of Turbulence . 2017,第9期

机译：具有擦除编码缓存的可靠和节能的存储系统
4. EC-Shuffle: Dynamic Erasure Coding Optimization for Efficient and Reliable Shuffle in Spark [C] . Xin Yao, Cho-Li Wang, Mingzhe Zhang IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing . 2019

机译：EC-Shuffle：动态擦除编码优化，用于高效可靠的Spark
5. Reliable and Efficient Transmission of Signals: Coding Design, Beamforming Optimization and Multi-Point Cooperation. [D] . Liu, Yang. 2016

机译：可靠高效的信号传输：编码设计，波束成形优化和多点协作。
6. A systematic approach to designing reliable VV optimization methodology: Assessment of internal validity of echocardiographic electrocardiographic and haemodynamic optimization of cardiac resynchronization therapy [O] . Andreas Kyriacou, Matthew E. Li Kam Wa, Punam A. Pabari, -1

机译：一种设计可靠的VV优化方法的系统方法：评估心脏再同步治疗的超声心动图心电图和血流动力学的内部有效性
7. A Hybrid ARQ scheme combining erasure codes and selective retransmissions for reliable data transfer in underwater acoustic sensor networks [O] . K. S. Geethu, A. V. Babu 2017

机译：结合了擦除码和选择性重传的混合ARQ方案，可在水下声传感器网络中可靠地传输数据

EC-Shuffle: Dynamic Erasure Coding Optimization for Efficient and Reliable Shuffle in Spark

摘要

著录项

相似文献

相关主题

期刊订阅