首页> 外文OA文献 >A Hardware and Software Integrated Approach for Adaptive Thread Management in Multicore Multithreaded Microprocessors
【2h】

A Hardware and Software Integrated Approach for Adaptive Thread Management in Multicore Multithreaded Microprocessors

机译:多核多线程微处理器中自适应线程管理的硬件和软件集成方法

摘要

The Multicore Multithreaded Microprocessor maximizes parallelism on a chip for the optimal system performance, such that its popularity is growing rapidly in high-performance computing. It increases the complexity in resource distribution on a chip by leading it to two directions: isolation and unification. On one hand, multiple cores are implemented to deliver the computation and memory accessing resources to more than one thread at the same time. Nevertheless, it limits the threads’ access to resources in different cores, even if extensively demanded. On the other hand, simultaneous multithreaded architectures unify the domestic execu- tion resources together for concurrently running threads. In such an environment, threads are greatly affected by the inter-thread interference. Moreover, the impacts of the complicated distribution are enlarged by variation in workload behaviors. As a result, the microprocessor requires an adaptive management scheme to schedule threads throughout different cores and coordinate them within cores.In this study, an adaptive thread management scheme was proposed, integrating both hardware and software approaches. The instruction fetch policy at the hardware level took the responsibility by prioritizing domestic threads, while the Operating System scheduler at the software level was used to pair threads dynami- vi cally to multiple cores. The tie between them was the proposed online linear model, which was dynamically constructed for every thread based on data misses by the regression algorithm. Consequently, the hardware part of the proposed scheme proactively granted higher priority to the threads with less predicted long-latency loads, expecting they would better utilize the shared execution resources. Mean- while, the software part was invoked by such a model upon significant changes in the execution phases and paired threads with different demands to the same core to minimize competition on the chip. The proposed scheme was compared to its peer designs and overall 43% speedup was achieved by the integrated approach over the combination of two baseline policies in hardware and software, respectively. The overhead was examined carefully regarding power, area, storage and latency, as well as the relationship between the overhead and the performance.
机译:多核多线程微处理器可最大程度地提高芯片上的并行性以实现最佳系统性能,从而使其在高性能计算中的普及迅速增长。通过将其引向两个方向,它增加了芯片上资源分配的复杂性:隔离和统一。一方面,实现了多个内核,以将计算和内存访问资源同时交付给多个线程。尽管如此,即使有大量需求,它也限制了线程对不同核心资源的访问。另一方面,同时的多线程体系结构将家用执行资源统一在一起,以同时运行线程。在这种环境下,线程受线程间干扰的影响很大。此外,工作负载行为的变化扩大了复杂分布的影响。因此,微处理器需要一种自适应管理方案来调度不同内核中的线程并在内核中进行协调。在本研究中,提出了一种自适应线程管理方案,该方案将硬件和软件方法进行了集成。硬件级别的指令提取策略通过优先处理本地线程来承担责任,而软件级别的操作系统调度程序则用于将线程动态配对到多个内核。它们之间的纽带是提出的在线线性模型,该模型是通过回归算法基于数据丢失为每个线程动态构建的。因此,提出的方案的硬件部分主动为具有较少预测的长延迟负载的线程授予了较高的优先级,期望它们可以更好地利用共享的执行资源。同时,在执行阶段的显着变化以及对同一个内核有不同需求的成对线程的配对下,此类模型会调用软件部分,以最大程度地减少芯片上的竞争。将该提议的方案与其同类设计进行了比较,并且通过在硬件和软件中两个基准策略的组合中分别采用集成方法,总体上实现了43%的加速。仔细检查了有关开销,面积,存储和延迟以及开销与性能之间的关系的开销。

著录项

  • 作者

    Weng Lichen;

  • 作者单位
  • 年度 2012
  • 总页数
  • 原文格式 PDF
  • 正文语种
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号