首页> 外文期刊>Computer architecture news >Parallelism-aware Batch Scheduling: Enhancing Both Performance And Fairness Of Shared Dram Systems
【24h】

Parallelism-aware Batch Scheduling: Enhancing Both Performance And Fairness Of Shared Dram Systems

机译:并行感知批处理调度:增强共享Dram系统的性能和公平性

获取原文
获取原文并翻译 | 示例

摘要

In a chip-multiprocessor (CMP) system, the DRAM system is shared among cores. In a shared DRAM system, requests from a thread can not only delay requests from other threads by causing bank/bus/row-buffer conflicts but they can also destroy other threads' DRAM-bank-level parallelism. Requests whose latencies would otherwise have been overlapped could effectively become serialized. As a result both fairness and system throughput degrade, and some threads can starve for long time periods. This paper proposes a fundamentally new approach to designing a shared DRAM controller that provides quality of service to threads, while also improving system throughput. Our parallelism-aware batch scheduler (PAR-BS) design is based on two key ideas. First, PAR-BS processes DRAM requests in batches to provide fairness and to avoid starvation of requests. Second, to optimize system throughput, PAR-BS employs a parallelism-aware DRAM scheduling policy that aims to process requests from a thread in parallel in the DRAM banks, thereby reducing the memory-related stall-time experienced by the thread. PAR-BS seamlessly incorporates support for system-level thread priorities and can provide different service levels, including purely opportunistic service, to threads with different priorities. We evaluate the design trade-offs involved in PAR-BS and compare it to four previously proposed DRAM scheduler designs on 4-, 8-, and 16-core systems. Our evaluations show that, averaged over 100 4-core workloads, PAR-BS improves fairness by 1.11X and system throughput by 8.3% compared to the best previous scheduling technique, Stall-Time Fair Memory (STFM) scheduling. Based on simple request pri-oritization rules, PAR-BS is also simpler to implement than STFM.
机译:在芯片多处理器(CMP)系统中,DRAM系统在内核之间共享。在共享的DRAM系统中,来自线程的请求不仅可以通过引起存储体/总线/行缓冲区冲突来延迟来自其他线程的请求,而且还可以破坏其他线程的DRAM库级并行性。延迟本来会重叠的请求可以有效地序列化。结果,公平性和系统吞吐量都会降低,并且某些线程可能会饿死很长时间。本文提出了一种从根本上全新的方法来设计共享DRAM控制器,该控制器为线程提供服务质量,同时还提高了系统吞吐量。我们的并行感知批处理调度程序(PAR-BS)设计基于两个关键思想。首先,PAR-BS分批处理DRAM请求,以提供公平性并避免请求匮乏。其次,为了优化系统吞吐量,PAR-BS采用了可感知并行性的DRAM调度策略,该策略旨在处理DRAM库中并行处理来自线程的请求,从而减少了线程遇到的与内存相关的停顿时间。 PAR-BS无缝集成了对系统级线程优先级的支持,并可以为具有不同优先级的线程提供不同的服务级别,包括纯机会服务。我们评估了PAR-BS中涉及的设计权衡,并将其与4、8和16核系统上先前提出的四种DRAM调度程序设计进行了比较。我们的评估表明,与以前最好的调度技术(停顿时间公平内存(STFM)调度)相比,PAR-BS平均可处理100多个4核工作负载,从而将公平性提高了1.11倍,将系统吞吐量提高了8.3%。基于简单的请求优先级排序规则,PAR-BS的实现也比STFM更简单。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号