首页> 外文会议>49th Annual IEEE International Carnahan Conference on Security Technology >Virtual machines of high availability using hardware-assisted failure detection
【24h】

Virtual machines of high availability using hardware-assisted failure detection

机译:使用硬件辅助故障检测的高可用性虚拟机

获取原文
获取原文并翻译 | 示例

摘要

The virtualization technology has been widely used in today's doud computing datacenters. With the virtualization technology, each physical machine in a datacenter can be logically divided into several virtual machines, on which different types of software services can host. However, many reasons may decrease the availability of the whole system. For example, a failed physical machine automatically fails all virtual machines on the physical machine, and consequently fails every software service on the virtual machines. It is difficult to detect failures efficiently in a general-purpose computer architecture because the hardware cannot provide enough information for fast failure detection. On the contrary, the ATCA (Advanced Telecommunications Computing Architecture) physical machines provide high hardware availability, and support IPMI (Intelligent Platform Management Interface) that can quickly detect the hardware status. In this paper, we developed a novel failure model and designed a symmetric fault-tolerant mechanism using ATCA physical machines and KVM to provide a solution for high system availability. The proposed fault-tolerant mechanism divides ATCA physical machines into pairs, such that each machine of a pair supports fault tolerance for each other. Once a failure is detected in the physical machine layer or the virtualization layer, the failed virtual machines are then recovered on the other physical machine. We have compared the proposed fault-tolerance mechanism with another prior VM-based fault-tolerance tool. The results show that the proposed mechanism significantly reduces the service downtime. That is, it provides better system availability for software services running on the virtual machines.
机译:虚拟化技术已广泛应用于当今的计算计算数据中心。借助虚拟化技术,可以将数据中心中的每台物理计算机逻辑上划分为多个虚拟机,在这些虚拟机上可以承载不同类型的软件服务。但是,许多原因可能会降低整个系统的可用性。例如,发生故障的物理机会自动使物理机上的所有虚拟机发生故障,从而使虚拟机上的每个软件服务都发生故障。在通用计算机体系结构中,很难有效地检测故障,因为硬件无法提供足够的信息来进行快速故障检测。相反,ATCA(高级电信计算体系结构)物理机提供了较高的硬件可用性,并支持可以快速检测硬件状态的IPMI(智能平台管理接口)。在本文中,我们开发了一种新颖的故障模型,并使用ATCA物理机和KVM设计了对称的容错机制,以提供高系统可用性的解决方案。所提出的容错机制将ATCA物理机分为几对,从而使一对中的每台机器彼此支持容错。一旦在物理机层或虚拟化层中检测到故障,就会在另一台物理机上恢复发生故障的虚拟机。我们已经将提出的容错机制与另一种基于VM的现有容错工具进行了比较。结果表明,该机制显着减少了服务停机时间。也就是说,它为虚拟机上运行的软件服务提供了更好的系统可用性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号