首页> 外文会议>Proceedings of the 42nd Annual IEEE/ACM International Symposium on Microarchitecture >Comparing cache architectures and coherency protocols on x86-64 multicore SMP systems
【24h】

Comparing cache architectures and coherency protocols on x86-64 multicore SMP systems

机译:比较x86-64多核SMP系统上的缓存体系结构和一致性协议

获取原文

摘要

Across a broad range of applications, multicore technology is the most important factor that drives today's microprocessor performance improvements. Closely coupled is a growing complexity of the memory subsystems with several cache levels that need to be exploited efficiently to gain optimal application performance. Many important implementation details of these memory subsystems are undocumented. We therefore present a set of sophisticated benchmarks for latency and bandwidth measurements to arbitrary locations in the memory subsystem. We consider the coherency state of cache lines to analyze the cache coherency protocols and their performance impact. The potential of our approach is demonstrated with an in-depth comparison of ccNUMA multiprocessor systems with AMD (Shanghai) and Intel (Nehalem-EP) quad-core x86-64 processors that both feature integrated memory controllers and coherent point-to-point interconnects. Using our benchmarks we present fundamental memory performance data and architectural properties of both processors. Our comparison reveals in detail how the microarchitectural differences tremendously affect the performance of the memory subsystem.
机译:在广泛的应用中,多核技术是推动当今微处理器性能提高的最重要因素。紧密耦合的是,内存子系统的复杂性不断提高,需要有效利用多个缓存级别来获得最佳应用程序性能。这些内存子系统的许多重要实现细节均未记录。因此,我们提供了一组复杂的基准,用于对内存子系统中任意位置的延迟和带宽测量。我们考虑缓存行的一致性状态来分析缓存一致性协议及其性能影响。通过将ccNUMA多处理器系统与AMD(上海)和Intel(Nehalem-EP)四核x86-64处理器进行深度比较,展示了我们方法的潜力,这些处理器均具有集成的内存控制器和一致的点对点互连。使用我们的基准,我们展示了两个处理器的基本内存性能数据和体系结构属性。我们的比较详细揭示了微体系结构差异如何极大地影响内存子系统的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号