首页> 外文会议>International Conference on Parallel and Distributed Computing Systems >FT-OpenVZ: A Virtualized Approach to Fault-Tolerance in Distributed Systems
【24h】

FT-OpenVZ: A Virtualized Approach to Fault-Tolerance in Distributed Systems

机译:FT-OPENVZ:分布式系统中的虚拟化方法

获取原文

摘要

We present FT-OpenVZ, a full check pointing and fault-tolerant solution for virtual private server (VPS) distributed computing. FT-OpenVZ extends OpenVZ's VPS check pointing to include full VPS check pointing for MPI applications, including incremental file system check pointing and user-assisted restart of checkpoints. With our solution, we extend the state of virtual machine/VPS fault-tolerance to any MPI-based distributed solution. By check pointing all child processes, threads, and files within a distributed system, we provide a framework for future fault-tolerance work. For added resiliency to node failure, we include checkpoint replication and show that its use dramatically decreases the burden of check pointing on network storage/centralized storage solutions. Using replication, FT-OpenVZ eliminates any need for network storage or centralized servers, reducing the impact of check pointing on non-participating cluster nodes/users. Further, we show that by using replication our solution is scalable, where network storage and centralized server-based solutions are not. Our analysis is based on the NAS Parallel Benchmarks with cluster sizes up to 64 nodes. Using these benchmarks we examine the overhead of check pointing with replication, demonstrating low overhead for virtualized check pointing.
机译:我们为虚拟专用服务器(VPS)分布式计算提供FT-OpenVz,全检查指向和容错解决方案。 FT-OpenVz扩展了OpenVZ的VPS检查指向,包括全VPS检查指向MPI应用程序,包括增量文件系统检查指向和用户辅助重启检查点。通过我们的解决方案,我们将虚拟机/ VPS容错的状态扩展到任何基于MPI的分布式解决方案。通过检查分布式系统中的所有子进程,线程和文件,我们为未来的容错工作提供了一个框架。为了增加节点失败的恢复性,我们包括检查点复制,并显示其使用显着降低了网络存储/集中存储解决方案的检查的负担。使用复制,FT-OpenVz消除了对网络存储或集中式服务器的任何需求,从而减少了检查指向非参与群集节点/用户的影响。此外,我们表明,通过使用复制,我们的解决方案是可扩展的,其中网络存储和基于服务器的基于服务器的解决方案不是。我们的分析基于NAS并联基准,其中群集大小最多可达64个节点。使用这些基准测试,我们检查检查指向复制的开销,展示虚拟化检查指向的低开销。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号