首页> 外文期刊>International Journal of Parallel Programming >Improving Adaptability and Per-Core Performance of Many-Core Processors Through Reconfiguration
【24h】

Improving Adaptability and Per-Core Performance of Many-Core Processors Through Reconfiguration

机译:通过重新配置提高多核处理器的适应性和每核性能

获取原文
获取原文并翻译 | 示例

摘要

Increasing the number of cores in a multi-core processor can only be achieved by reducing the resources available in each core, and hence sacrificing the per-core performance. Furthermore, having a large number of homogeneous cores may not be effective for all the applications. For instance, threads with high instruction level parallelism will under-perform considerably in the resource-constrained cores. In this paper, we propose a core architecture that can be adapted to improve a single thread’s performance or to execute multiple threads. In particular, we integrate Reconfigurable Hardware Unit (RHU) in the resource-constrained cores of a many-core processor. The RHU can be reconfigured to execute the frequently encountered instructions from a thread in order to increase the core’s overall execution bandwidth, thus improving its performance. On the other hand, if the core’s resources are sufficient for a thread, then the RHU can be configured to executed instructions from a different thread to increase the thread level parallelism. The RHU has low area overhead, and hence has minimal impact on scalability of the number of cores. To further limit the area overhead of this mechanism, generation of the reconfiguration bits for the RHUs of multiple cores is delegated to a single core. In this paper, we present the results for using the RHU to improve a single thread’s performance. Our experiments show that the proposed architecture improves the per-core performance by an average of about 23% across a wide range of applications.
机译:增加多核处理器中的核数只能通过减少每个核中可用的资源,从而牺牲每个核的性能来实现。此外,具有大量的均质芯可能并非对所有应用都是有效的。例如,具有高指令级并行性的线程在资源受限的内核中的性能将大大下降。在本文中,我们提出了一种核心体系结构,该体系结构可用于提高单个线程的性能或执行多个线程。特别是,我们将可重构硬件单元(RHU)集成到了多核处理器的资源受限的内核中。可以将RHU重新配置为执行线程中经常遇到的指令,以增加内核的整体执行带宽,从而提高其性能。另一方面,如果内核的资源足以容纳一个线程,则可以将RHU配置为执行来自不同线程的指令,以增加线程级别的并行性。 RHU的区域开销较低,因此对核心数量的可伸缩性影响最小。为了进一步限制该机制的区域开销,将多个内核的RHU的重新配置位的生成委派给单个内核。在本文中,我们介绍了使用RHU改善单线程性能的结果。我们的实验表明,所提出的体系结构可在各种应用中将每核性能平均提高约23%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号