首页> 外文期刊>ACM SIGPLAN Notices: A Monthly Publication of the Special Interest Group on Programming Languages >Performance Implications of Extended Page Tables on Virtualized x86 Processors
【24h】

Performance Implications of Extended Page Tables on Virtualized x86 Processors

机译:Virtualized X86处理器上扩展页表的性能含义

获取原文
获取原文并翻译 | 示例
           

摘要

Managing virtual memory is an expensive operation, and becomes even more expensive on virtualized servers. Processing TLB misses on a virtualized x86 server requires a twodimensional page walk that can have 6x more page table lookups, hence 6x more memory references, than a native page table walk. Thus much of the recent research on the subject starts from the assumption that TLB miss processing in virtual environments is significantly more expensive than on native servers. However, we will show that with the latest software stack on modern x86 processors, most of these page table lookups are satisfied by internal paging structure caches and the L1/L2 data caches, and the actual virtualization overhead of TLB miss processing is a modest fraction of the overall time spent processing TLB misses. In this paper, we present a detailed accounting of the TLB miss processing costs on virtualized x86 servers for an exhaustive set of workloads, in particular, two very demanding industry standard workloads. We show that an implementation of the TPC-C workload that actively uses 475 GB of memory on a 72-CPU Haswell-EP server spends 20% of its time processing TLB misses when the application runs in a VM. Although this is a non-trivial amount, it is only 4.2% higher than the TLB miss processing costs on bare metal. The multi-VM VMmark benchmark sees 12.3% in TLB miss processing, but only 4.3% of that can be attributed to virtualization overheads. We show that even for the heaviest workloads, a well-tuned application that uses large pages on a recent OS release with a modern hypervisor running on the latest x86 processors sees only minimal degradation from the additional overhead of the two-dimensional page walks in a virtualized server.
机译:管理虚拟内存是昂贵的操作,在虚拟化服务器上​​变得更加昂贵。在虚拟化X86服务器上处理TLB未命中需要一个TwoDimensional Page Walk,可以具有6倍更多的页面表查找,因此6x更多的内存引用,而不是本机页面表。因此,最近关于对象的研究中的大部分研究从假设的假设是虚拟环境中的TLB未命中处理比本机服务器昂贵更昂贵。但是,我们将显示,在现代X86处理器上的最新软件堆栈中,大多数这些页面表查找都是由内部寻呼结构缓存和L1 / L2数据高速缓存满足,以及TLB未命中处理的实际虚拟化开销是一个适度的分数整个时间花费加工TLB未命中。在本文中,我们在虚拟化的X86服务器上提供了一个详细的X86服务器,用于详尽的工作负载,特别是两个非常苛刻的行业标准工作负载。我们表明,在72-CPU Haswell-EP服务器上积极使用475 GB内存的TPC-C工作负载的实现花费了20%的时间处理TLB在VM中运行时未命中。虽然这是一个非琐碎的金额,但它比裸金属上的TLB未命中处理成本高4.2%。多VM VMMark基准测试在TLB Miss处理中看到12.3%,但只有4.3%可以归因于虚拟化开销。我们展示即使对于最重的工作负载,即使是最近的操作系统版本使用大页面的良好调整的应用程序,在最新的X86处理器上运行的现代虚拟机管理程序也只能从二维页面的额外开销中获得最小的劣化。虚拟化服务器。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号