【24h】

Performance of a micro-threaded pipeline

机译:微线程管道的性能

获取原文
获取原文并翻译 | 示例

摘要

The micro-threaded microprocessor is a chip multi-processor, which uses a multi-threaded approach, where the threads are obtained from within a single context and exploit both vector and instruction level parallelism (ILP). This approach employs vertical and horizontal transfer in a simple pipeline. The horizontal transfer is referred to as the normal scalar pipeline processing used in most microprocessors. Vertical transfer is a context switch, which allows the code to tolerate any latency from undetermined data and control dependencies. The performance of the single pipeline is very important in the overall performance of the whole processor, which can distribute threads to any of the available processors. We have measured the influence of three crucial parameters - cache delay, cache miss rate, and number of registers - on the performance using our simulator. Even for a long cache delay (1000 processor cycles) we found that the micro-threaded pipeline can still achieves an IPC of 0.8 in the peak performance which is some 6 times better than a conventional scalar pipeline. If we further degrade cache performance by using an artificially small cache line size the performance of conventional scalar pipeline gives an IPC of 0.02, whereas with unlimited registers the micro-threaded pipeline still manages to achieve and IPC of 0.8 (a factor of 40 difference in performance).
机译:微线程微处理器是一种芯片多处理器,它使用多线程方法,其中线程是从单个上下文中获取的,并且同时利用了向量级和指令级并行性(ILP)。这种方法在简单的管道中采用垂直和水平传输。水平传输被称为大多数微处理器中使用的常规标量流水线处理。垂直传输是上下文切换,它允许代码容忍来自不确定数据和控制依赖项的任何延迟。单个管道的性能对于整个处理器的整体性能非常重要,该处理器可以将线程分配给任何可用的处理器。我们已经使用模拟器测量了三个关键参数(缓存延迟,缓存未命中率和寄存器数量)对性能的影响。即使在较长的缓存延迟(1000个处理器周期)下,我们也发现微线程管道仍然可以实现0.8的峰值性能IPC,这是传统标量管道的6倍。如果我们通过使用人为的较小的缓存行大小进一步降低缓存性能,则常规标量流水线的性能将提供0.02的IPC,而在无限制寄存器的情况下,微线程流水线仍可实现IPC为0.8(相差40倍)。性能)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号