【24h】

Out-of-order vector architectures

机译:无序向量架构

获取原文

摘要

Register renaming and out-of-order instruction issue are now commonly used in superscalar processors. These techniques can also be used to significant advantage in vector processors, as this paper shows. Performance is improved and available memory bandwidth is used more effectively. Using a trace driven simulation we compare a conventional vector implementation, based on the Convex C3400, with an out-of-order, register renaming, vector implementation. When the number of physical registers is above 12, out-of-order execution coupled with register renaming provides a speedup of 1.24--1.72 for realistic memory latencies. Out-of-order techniques also tolerate main memory latencies of 100 cycles with a performance degradation less than 6%. The mechanisms used for register renaming and out-of-order issue can be used to support precise interrupts -- generally a difficult problem in vector machines. When precise interrupts are implemented, there is typically less than a 10% degradation in performance. Anew technique based on register renaming is targeted at dynamically eliminating spill code; this technique is shown to provide an extra speedup ranging between 1.10 and 1.20 while reducing total memory traffic by an average of 15--20%.
机译:寄存器重命名和乱序指令问题现在通常在超标量处理器中使用。如本文所示,这些技术也可以在矢量处理器中发挥显着优势。性能得到改善,可用内存带宽得到更有效的利用。使用跟踪驱动的仿真,我们将基于Convex C3400的常规矢量实现与乱序的寄存器重命名矢量实现进行了比较。当物理寄存器的数量大于12时,乱序执行与寄存器重命名一起提供了1.24--1.72的加速,以实现实际的内存延迟。乱序技术还可以承受100个周期的主内存延迟,而性能降级不到6%。用于寄存器重命名和乱序问题的机制可用于支持精确的中断,这通常是向量机中的难题。当实施精确的中断时,性能通常会下降不到10%。一种基于寄存器重命名的新技术旨在动态消除溢出代码。该技术显示出可提供1.10到1.20的额外加速,同时将总内存流量平均减少15--20%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号