【24h】

Overcoming Virtualization Overheads for Large-vCPU Virtual Machines

机译:克服大VCPU虚拟机的虚拟化开销

获取原文

摘要

Virtual Machines (VM) frequently run parallel applications in cloud environments, and high performance computing platforms. It is well known that configuring a VM with too many virtual processors (vCPUs) worsens application performance due to scheduling cross-talk between the hypervisor and the guest OS. Specifically, when the number of vCPUs assigned to a VM exceeds available physical CPUs then parallel applications in the VM experience worse performance, even when number of application threads remains fixed. In this paper, we first track the root cause of this performance loss to inefficient hypervisor-level emulation of inter-vCPU synchronization events. We then present three techniques to minimize hypervisor-induced overheads on parallel workloads in large-VCPU VMs. The first technique pins application threads to dedicated vCPUs to eliminate inter-vCPU thread migrations, reducing the overhead of emulating inter-processor interrupts (IPIs). The second technique para-virtualizes inter-vCPU TLB flush operations. The third technique enables faster reactivation of idle vCPUs by prioritizing the delivery of rescheduling IPIs. Unlike existing solutions which rely on heavyweight and slow vCPU hotplug mechanisms, our techniques are lightweight and provide more flexibility in migrating large-vCPU VMs. Using several parallel benchmarks, we demonstrate the effectiveness of our prototype implementation in the Linux KVM/QEMU virtualization platform. Specifically, we demonstrate that with our techniques, parallel applications can maintain their performance even when 255 VCPUs are assigned to a VM running on only 6 physical cores.
机译:虚拟机(VM)经常在云环境中运行并行应用程序,以及高性能计算平台。众所周知,由于管理程序和客户操作系统之间的调度跨谈,配置虚拟处理器(VCPU)的虚拟机(VCPU)配置VM。具体而言,当分配给VM的VCPU的数量超过可用物理CPU时,VM中的并行应用程序在VM体验中,即使应用程序线程仍然固定。在本文中,我们首先跟踪这种性能损失的根本原因,以效率低效的VCPU同步事件的效率仿真。然后,我们提出了三种技术,以最小化大VCPU VM中的并行工作负载上的虚拟机制诱导的开销。第一种技术引脚应用线程到专用VCPU来消除VCPU线程迁移,从而减少仿真处理器间中断(IPIS)的开销。第二种技术对虚拟化VCPU间TLB互通操作。第三种技术通过优先排序重新安排的IPIS来实现空闲VCPU的重新激活。与依赖重量级和慢速VCPU Hotplug机制的现有解决方案不同,我们的技术是重量轻,在迁移大型VCPU VM时提供更大的灵活性。使用若干并行基准,我们展示了我们在Linux KVM / QEMU虚拟化平台中的原型实现的有效性。具体地,我们演示了,通过我们的技术,即使在仅为6个物理核心运行的VM运行的VM时,并行应用也可以保持其性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号