首页> 外文会议>International Symposium on Parallel Distributed Processing >Using Hardware Multithreading to Overcome Broadcast/Reduction Latency in an Associative SIMD Processor
【24h】

Using Hardware Multithreading to Overcome Broadcast/Reduction Latency in an Associative SIMD Processor

机译:使用硬件多线程克服关联SIMD处理器中的广播/减少延迟

获取原文

摘要

The latency of broadcast/reduction operations has a significant impact on the performance of SIMD processors. This is especially true for associative programs, which make extensive use of global search operations. Previously, we developed a prototype associative SIMD processor that uses hardware multithreading to overcome the broadcast/reduction latency. In this paper we show, through simulations of the processor running an associative program, that hardware multithreading is able to improve performance by increasing system utilization, even for processors with hundreds or thousands of processing elements. However, the choice of thread scheduling policy used by the hardware is critical in determining the actual utilization achieved. We consider three thread scheduling policies and show that a thread scheduler that avoids issuing threads that will stall due to pipeline dependencies or thread synchronization operations is able to maintain system utilization independent of the number of threads.
机译:广播/减少操作的延迟对SIMD处理器的性能产生了重大影响。这对于关联计划尤其如此,这使得广泛使用全球搜索操作。以前,我们开发了一个原型关联SIMD处理器,它使用硬件多线程来克服广播/减少延迟。在本文中,我们通过运行关联程序的处理器模拟,该硬件多线程能够通过提高系统利用率来提高性能,即使对于具有数百或数千个处理元素的处理器,也能够提高性能。但是,硬件使用的线程调度策略选择对于确定所实现的实际利用来说是至关重要的。我们考虑了三个线程调度策略,并显示了一种避免由于管道依赖性或线程同步操作而迁移的线程的线程调度器能够维护与线程数无关的系统利用率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号