首页> 外文学位 >Exploring speculative techniques to improve the memory system performance.
【24h】

Exploring speculative techniques to improve the memory system performance.

机译:探索推测性技术以提高内存系统性能。

获取原文
获取原文并翻译 | 示例

摘要

As processor clock speeds have increased along with microarchitectural innovations, the gap between processor and memory performance has become a greater bottleneck and improvements in memory system design have become more important. This dissertation focuses on improving memory performance through the addition of novel functionalities in the memory system. Specifically, we have proposed two different techniques to hide the latency for memory accesses: Incorrect Speculation and Address Correlation.; The speculated execution of threads in a multithreaded architecture, plus the branch prediction used in each thread execution units, allows many instructions to be executed speculatively, that is, before it is known whether they actually will be needed by the program. We have found that incorrect speculation (wrong execution) on the instruction- and thread-level provides an indirect prefetching effect for the later correct execution paths and threads. By continuing to execute the mispredicted load instructions even after the instruction- or thread-level control speculation is known to be incorrect, the cache misses observed on the correctly executed paths can be reduced. However, we also found that these extra loads can increase the amount of memory traffic and can pollute the cache. We introduce the small, fully-associative Wrong Execution Cache (WEC) to eliminate the potential pollution that can be caused by the execution of the mispredicted load instructions. Our simulation results show that the WEC can improve the performance of a concurrent multithreaded architecture due to the reductions in the number of cache misses.; In another approach, we investigate a program phenomenon, Address Correlation, which links addresses that reference the same data. This work shows that different addresses containing the same data can often be correlated at run-time to eliminate a load miss or a partial hit. For the programs tested, a great majority of the L1 data cache load misses and the partial hits, can be supplied from a correlated address already found in the cache. Our source code-level analysis shows that semantically equivalent information, duplicated references, and frequent values are the major causes of address correlations.
机译:随着处理器时钟速度的提高以及微体系结构的创新,处理器和内存性能之间的差距已成为更大的瓶颈,而内存系统设计的改进也变得越来越重要。本文致力于通过在存储系统中增加新功能来提高存储性能。具体来说,我们提出了两种不同的技术来隐藏内存访问的延迟:不正确的推测地址关联。在多线程体系结构中推测的线程执行,再加上在每个线程执行单元中使用的分支预测,可以推测性地执行许多指令,即在知道程序是否真正需要它们之前。我们发现,指令级和线程级的不正确的推测(错误的执行)为以后的正确执行路径和线程提供了间接的预取效果。即使在已知指令或线程级控制推测不正确之后,通过继续执行错误预测的加载指令,可以减少在正确执行的路径上观察到的缓存丢失。但是,我们还发现这些额外的负载会增加内存流量,并会污染缓存。我们引入了小型的,完全关联的错误执行缓存(WEC),以消除执行错误的加载指令可能导致的潜在污染。我们的仿真结果表明,由于减少了高速缓存未命中次数,因此WEC可以提高并发多线程体系结构的性能。在另一种方法中,我们研究了程序现象地址相关性,该现象链接引用了相同数据的地址。这项工作表明,包含相同数据的不同地址通常可以在运行时进行关联,以消除负载丢失或部分命中的情况。对于经过测试的程序,可以从缓存中已经找到的相关地址提供大部分L1数据缓存未命中和部分命中。我们的源代码级分析显示,语义上相等的信息,重复的引用和频繁的值是地址相关性的主要原因。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号