History-Based Arbitration for Fairness in Processor-Interconnect of NUMA Servers

Wonjun Song; Gwangsun Kim; Hyungjoon Jung; Jongwook Chung; Jung Ho Ann; Jae W. Lee; John Kim

首页> 外文期刊>Computer architecture news >History-Based Arbitration for Fairness in Processor-Interconnect of NUMA Servers

【24h】

History-Based Arbitration for Fairness in Processor-Interconnect of NUMA Servers

机译：基于历史的仲裁，以实现NUMA服务器处理器互连中的公平性

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

NUMA (non-uniform memory access) servers are commonly used in high-performance computing and datacen-ters. Within each server, a processor-interconnect (e.g., Intel QPI, AMD HyperTransport) is used to communicate between the different sockets or nodes. In this work, we explore the impact of the processor-interconnect on overall performance - in particular, the performance unfairness caused by processor-interconnect arbitration. It is well known that locally-fair arbitration does not guarantee globally-fair bandwidth sharing as closer nodes receive more bandwidth in a multi-hop network. However, this work demonstrates that the opposite can occur in a commodity NUMA server where remote nodes receive higher bandwidth (and perform better). We analyze this problem and identify that this occurs because of external concentration used in router micro-architectures for processor-interconnects without globally-aware arbitration. While accessing remote memory can occur in any NUMA system, performance unfairness (or performance variation) is more critical in cloud computing and virtual machines with shared resources. We demonstrate how this unfairness creates significant performance variation when a workload is executed on the Xen virtualization platform. We then provide analysis using synthetic workloads to better understand the source of unfairness and eliminate the impact of other shared resources, including the shared last-level cache and main memory. To provide fairness, we propose a novel, history-based arbitration that tracks the history of arbitration grants made in the previous history window. A weighted arbitration is done based on the history to provide global fairness. Through simulations, we show our proposed history-based arbitration can provide global fairness and minimize the processor-interconnect performance unfairness at low cost.

机译：NUMA（非统一内存访问）服务器通常用于高性能计算和数据中心。在每个服务器中，处理器互连（例如，Intel QPI，AMD HyperTransport）用于在不同的套接字或节点之间进行通信。在这项工作中，我们探讨了处理器互连对整体性能的影响，特别是处理器互连仲裁所导致的性能不公平。众所周知，本地公平仲裁不能保证全球公平的带宽共享，因为更近的节点在多跳网络中会收到更多的带宽。但是，这项工作表明，在商用NUMA服务器中，相反的情况可能会发生，其中远程节点接收到更高的带宽（并且性能更好）。我们分析了此问题，并确定发生这种情况是由于路由器微体系结构中用于处理器互连的外部集中而没有全局意识的仲裁。尽管在任何NUMA系统中都可能发生访问远程内存的问题，但在具有共享资源的云计算和虚拟机中，性能不公平（或性能差异）更为严重。我们演示了在Xen虚拟化平台上执行工作负载时，这种不公平如何导致性能显着变化。然后，我们使用综合工作负载进行分析，以更好地了解不公平的根源，并消除其他共享资源（包括共享的最后一级缓存和主内存）的影响。为了公平起见，我们提出了一种新颖的，基于历史的仲裁，该仲裁可以跟踪在以前的历史窗口中做出的仲裁授权的历史。根据历史记录进行加权仲裁，以提供全球公平性。通过仿真，我们证明了我们提出的基于历史的仲裁可以提供全局公平性，并以低成本将处理器互连性能不公平性降至最低。

著录项

来源
《Computer architecture news》 |2017年第1期|765-777|共13页
作者
Wonjun Song; Gwangsun Kim; Hyungjoon Jung; Jongwook Chung; Jung Ho Ann; Jae W. Lee; John Kim;
展开▼
作者单位

KAIST;

ARM;

KAIST;

Seoul National University;

Seoul National University;

Seoul National University;

KAIST;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Processor-interconnect; NUMA servers; arbitration; router concentration;

机译：处理器互连;NUMA服务器;仲裁;路由器集中度;

相似文献

外文文献
中文文献
专利

1. The Moderating Effects of Fairness of Arbitration and Speed of Arbitration on Korean Traders' Attitudes toward Arbitration [J] . Yongkyun Chung, Hong-Youl Ha, Kwang-Soo Kim Journal of Korea Trade . 2014,第2期

机译：仲裁公平和仲裁速度对韩国商人仲裁态度的调节作用
2. Design, Implementation, and Evaluation of a NUMA-Aware Cache for iSCSI Storage Servers [J] . Ren Y., Li T., Yu D., Parallel and Distributed Systems, IEEE Transactions on . 2015,第2期

机译：iSCSI存储服务器的NUMA感知缓存的设计，实现和评估
3. Operating system support for improving data locality on CC-NUMA compute servers [J] . Ben Verghese, Scott Devine, Anoop Gupta, ACM SIGPLAN Notices: A Monthly Publication of the Special Interest Group on Programming Languages . 1996,第9期

机译：操作系统支持，可改善CC-NUMA计算服务器上的数据局部性
4. Per-Server Dominant-Share Fairness (PS-DSF): A multi-resource fair allocation mechanism for heterogeneous servers [C] . Jalal Khamse-Ashari, Ioannis Lambadaris, George Kesidis, IEEE International Conference on Communications . 2017

机译：每服务器主导份额公平（PS-DSF）：异构服务器的多资源公平分配机制
5. The NUMA page migration/page replication ASIC {lcub}NPMR{rcub}: A chip design to improve memory system performance in a Non-Uniform Memory Access (NUMA) multiprocessor system architecture. [D] . Kelly, Terence James. 2000

机译：NUMA页面迁移/页面复制ASIC {lcub} NPMR {rcub}：一种芯片设计，用于在非统一内存访问（NUMA）多处理器系统体系结构中提高内存系统性能。
6. Fair compute loads enabled by blockchain: sharing models by alternating client and server roles [O] . Tsung-Ting Kuo, Rodney A Gabriel, Lucila Ohno-Machado 2019

机译：通过BlockChain启用了公平计算负载：通过交替的客户端和服务器角色共享模型
7. The Arbitration Fairness Index: Using a Public Rating System to Skirt the Legal Logjam and Promote Fairer and More Effective Arbitration of Employment and Consumer Disputes [O] . 2012

机译：仲裁公平指数：利用公共评级系统介绍合法的洛基，推动更公平，更有效的就业和消费者纠纷仲裁

History-Based Arbitration for Fairness in Processor-Interconnect of NUMA Servers

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅