【24h】

Reliable Data-Center Scale Computations

机译:可靠的数据中心规模计算

获取原文

摘要

Neither of the two broad classes of fault models considered by traditional fault tolerance techniques - crash and Byzantine faults - suit the environment of systems that run in today's data centers. On the one hand, assuming Byzantine faults is considered overkill due to the assumption of a worst-case adversarial behavior, and the use of other techniques to guard against malicious attacks. On the other hand, the crash fault model is insufficient since it does not capture non-crash faults that may result from a variety of unexpected conditions that are commonplace in this setting. In this paper, we present the case for a more practical approach at handling non-crash (but non-adversarial) faults in data-center scale computations. In this context, we discuss how such problem can be tackled for an important class of data-center scale systems: systems for large-scale processing of data, with a particular focus on the Pig programming framework. Such an approach not only covers a significant fraction of the processing jobs that run in today's data centers, but is potentially applicable to a broader class of applications.
机译:传统的容错技术考虑的两种广泛的故障模型 - 崩溃和拜占庭故障 - 适应在当今数据中心运行的系统环境。一方面,假设拜占庭故障被认为是矫枉过正的,因为假设最坏情况的对抗性行为,以及使用其他技术来防范恶意攻击。另一方面,崩溃故障模型不足,因为它不会捕获由于此设置中常见的各种意想不到的条件可能导致的非崩溃故障。在本文中,我们在处理数据中心规模计算中处理非崩溃(但非对抗性)故障时更实用的方法。在这种情况下,我们讨论了如何为一类重要的数据中心规模系统解决这些问题:用于数据的大规模处理的系统,特别关注猪编程框架。这样的方法不仅涵盖了在今天的数据中心运行的加工作业的大部分,而且可能适用于更广泛的应用程序。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号