首页> 美国政府科技报告 >Effectiveness of Caches and Data Prefetch Buffers in Large-Scale Shared Memory Multiprocessors
【24h】

Effectiveness of Caches and Data Prefetch Buffers in Large-Scale Shared Memory Multiprocessors

机译:大规模共享存储器多处理器中高速缓存和数据预取缓冲区的有效性

获取原文

摘要

Large shared memory multiprocessors usually have difficulty in matching the memory speed with the processor speed due to the large interconnection network between processors and shared memory. In this thesis, we explore the effectiveness of caches and data prefetching by the processor in eliminating the memory access bottleneck. Using trace driven simulations of numerical subroutines that have been transformed into parallel form, we model the following multiprocessor characteristics in our performance evaluation, the cost of a cache coherence enforcement scheme, the effect of a high degree of overlap between cache miss service, the cost of a pin limited data path between shared memory and caches, the effect of a high degree of data prefetching, the program behavior of a scientific workload as represented by over 20 numerical subroutines, and the parallel execution of programs. The optimal cache block size is examined in detail and we found that a small cache block size is favored in the multiprocessors of interest. A compile-time cache coherence scheme that allows data to change cacheability is proposed and is shown to significantly perform better than schemes which do not allow cacheability changes. We show the relative effectiveness of caches and data prefetching for a range of network delays and bandwidths and propose techniques to overcome some of the performance degradations due to read/write access to shared memory. 47 refs., 40 figs., 8 tabs. (ERA citation 13:013829)

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号