首页> 外文期刊>International journal of parallel programming >An experimental evaluation of the HP V-Class and SGI Origin 2000 multiprocessors using microbenchmarks and scientific applications
【24h】

An experimental evaluation of the HP V-Class and SGI Origin 2000 multiprocessors using microbenchmarks and scientific applications

机译:使用微基准测试和科学应用对HP V-Class和SGI Origin 2000多处理器进行的实验评估

获取原文
获取原文并翻译 | 示例

摘要

As processor technology continues to advance at a rapid pace, the principal performance bottleneck of shared memory systems has become the memory access latency. In order to understand the effects of cache and memory hierarchy on system latencies, performance analysts perform benchmark analysis on existing multiprocessors. In this study, we present a detailed comparison of two architectures, the HP V-Class and the SGI Origin 2000. Our goal is to compare and contrast design techniques used in these multiprocessors. We present the impact of processor design, cache/memory hierarchies and coherence protocol optimizations on the memory system performance of these multiprocessors. We also study the effect of parallelism overheads such as process creation and synchronization on the user-level performance of these multiprocessors. Our experimental methodology uses microbenchmarks as well as scientific applications to characterize the user-level performance. Our microbenchmark results show the impact of Ll/L2 cache size and TLB size on uniprocessor load/store latencies, the effect of coherence protocol design/optimizations and data sharing patterns on multiprocessor memory access latencies and finally the overhead of parallelism. Our application-based evaluation shows the impact of problem size, dominant sharing patterns and number of Processors used on speedup and raw execution time. Finally, we use hardware counter measurements to study the correlation of system-level performance metrics and the application's execution time performance.
机译:随着处理器技术的快速发展,共享内存系统的主要性能瓶颈已成为内存访问延迟。为了了解高速缓存和内存层次结构对系统延迟的影响,性能分析人员对现有的多处理器进行基准分析。在这项研究中,我们对HP V-Class和SGI Origin 2000这两种体系结构进行了详细的比较。我们的目标是比较和对比这些多处理器中使用的设计技术。我们介绍了处理器设计,高速缓存/内存层次结构和一致性协议优化对这些多处理器的内存系统性能的影响。我们还研究了并行开销(例如进程创建和同步)对这些多处理器用户级性能的影响。我们的实验方法使用微基准测试和科学应用来表征用户级别的性能。我们的微基准测试结果显示了L1 / L2缓存大小和TLB大小对单处理器加载/存储延迟的影响,一致性协议设计/优化和数据共享模式对多处理器内存访问延迟的影响,以及最终的并行性开销。我们基于应用程序的评估显示了问题大小,主要共享模式以及所使用的处理器数量对加速和原始执行时间的影响。最后,我们使用硬件计数器测量来研究系统级性能指标与应用程序执行时间性能的相关性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号