【24h】

Load Execution Latency Reduction

机译:减少执行负载延迟

获取原文
获取原文并翻译 | 示例

摘要

Load execution latency is dependent on memory access latency, pipeline depth, and data dependencies. Through load effective address prediction both data dependencies and deep pipeline effects can potentially be removed from the overall execution time. If a load effective address is correctly predicted, the data cache can be speculatively accessed prior to execution, thus effectively reducing the latency of load execution. A hybrid load effective address prediction technique is proposed, using three basic predictors: Last Address Predictor (LAP), Stride Predictor (SP), and Global Dynamic Predictor (GDP). In addition to improving load address prediction accuracy, this work explores the balance of data ports in the cache memory hierarchy, and the effects of load and store aliasing in wide superscalar machines.rnResults: Using a realistic hybrid load address predictor, load address prediction rates range from 32% to 77% averaging 51% for SPECint95 and 60% to 96% averaging 87% for SPECfp95. For a wide superscalar machine with a significant number of execution resources, this prediction rate increases IPC by 12% and 19% for SPECint95 and SPECfp95, respectively. It is also shown that load/ store aliasing decreases the average IPC by 33% for SPECint95 and 24% for SPECfp95.
机译:加载执行延迟取决于内存访问延迟,管道深度和数据依赖性。通过有效的地址预测,可以从整体执行时间中消除数据依赖性和深度流水线效应。如果正确预测了负载有效地址,则可以在执行之前以推测方式访问数据缓存,从而有效地减少了负载执行的延迟。提出了一种使用三种基本预测器的混合负载有效地址预测技术:Last Address Predictor(LAP),Stride Predictor(SP)和Global Dynamic Predictor(GDP)。除了提高加载地址预测的准确性之外,这项工作还探索了高速缓存存储器层次结构中数据端口的平衡,以及在大型超标量计算机中加载和存储别名的影响。rn结果:使用实际的混合加载地址预测器,可以实现加载地址预测率SPECint95的范围为32%至77%,平均为51%; SPECfp95的范围为60%至96%,平均为87%。对于具有大量执行资源的大型超标量计算机,对于SPECint95和SPECfp95,此预测率分别将IPC提升12%和19%。还显示,加载/存储别名使SPECint95的平均IPC降低了33%,对于SPECfp95的平均IPC降低了24%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号