首页> 外文会议>2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops amp; PhD Forum >Distributed Virtual Diskless Checkpointing: A Highly Fault Tolerant Scheme for Virtualized Clusters
【24h】

Distributed Virtual Diskless Checkpointing: A Highly Fault Tolerant Scheme for Virtualized Clusters

机译:分布式虚拟无盘检查点:虚拟化群集的高度容错方案

获取原文
获取原文并翻译 | 示例

摘要

Today's high-end computing systems are facing a crisis of high failure rates due to increased numbers of components. Recent studies have shown that traditional fault tolerant techniques incur overheads that more than double execution times on these highly parallel machines. Thus, future high-end computing must be able to provide adequate fault tolerance at an acceptable cost or the burdens of fault management will severely affect the viability of such systems. Cluster virtualization offers a potentially unique solution for fault management, but brings significant overhead, especially for I/O. In this paper, we propose a novel diskless check pointing technique on clusters of virtual machines. Our technique splits Virtual Machines into sets of orthogonal RAID systems and distributes parity evenly across the cluster, similar to a RAID-5 configuration, but using VM images as data elements. Our theoretical analysis shows that our technique significantly reduces the overhead associated with check pointing by removing the disk I/O bottleneck.
机译:由于组件数量的增加,当今的高端计算系统面临着高故障率的危机。最近的研究表明,传统的容错技术在这些高度并行的机器上产生的开销超过了两倍的执行时间。因此,未来的高端计算必须能够以可接受的成本提供足够的容错能力,否则故障管理的负担将严重影响此类系统的生存能力。群集虚拟化为故障管理提供了潜在的独特解决方案,但带来了巨大的开销,尤其是对于I / O。在本文中,我们提出了一种在虚拟机群集上的新型无盘检查指向技术。我们的技术将虚拟机分为正交RAID系统的集合,并在整个群集中均匀分配奇偶校验,类似于RAID-5配置,但使用VM映像作为数据元素。我们的理论分析表明,通过消除磁盘I / O瓶颈,我们的技术大大减少了与检查点相关的开销。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号