首页> 外文OA文献 >Reducing coherence overhead and boosting performance of high-end SMP multiprocessors running a DSS workload
【2h】

Reducing coherence overhead and boosting performance of high-end SMP multiprocessors running a DSS workload

机译:减少运行DSS工作负载的高端SMP多处理器的一致性开销并提高其性能

摘要

In this work, we characterized the memory performance-and in particular the impact of coherence overhead and process migration-of a shared-bus shared-memory multiprocessor running a DSS workload. When the number of processors is increased in order to achieve higher computational power, the bus becomes a major bottleneck of such architecture. We evaluated solutions that can greatly reduce that bottleneck. An area where this kind of optimization is important regards data base systems. For this reason, we considered a DSS workload and we setup the experiments following TPC-D specifications on the PostgreSQL DBMS in order to explore different optimizations on same kind of workloads as evaluated in the literature. In this scenario, we compare possible solutions to boost performance and we show the impact of process migration on coherence overhead. We found that the consequences of coherence overhead and process migration on performance are very important in machines with 16 or more processors. In this case, even little sharing, as in DSS applications, can become crucial for system performance. Another important result of our analysis regards the interaction between the coherence protocol and the scheduler. The basic cache affinity scheduling is useful in reducing migration, but it is not effective in every load condition. Specific coherence protocols can help reduce the effects of process migration, especially in situations when the scheduler cannot apply the affinity requirement. In these conditions, the use of a wiite-update protocol with a selective invalidation strategy for private data improves performance (and scalability) of about 20% with respect to a classical MESI-based solution. This advantage is about 50% in the case of high cache-to-cache transfer. (c) 2004 Elsevier Inc. All rights reserved.
机译:在这项工作中,我们描述了运行DSS工作负载的共享总线共享内存多处理器的内存性能,尤其是一致性开销和进程迁移的影响。当增加处理器的数量以实现更高的计算能力时,总线成为这种架构的主要瓶颈。我们评估了可以大大减少瓶颈的解决方案。这种优化很重要的领域是数据库系统。因此,我们考虑了DSS工作负载,并按照PostgreSQL DBMS上的TPC-D规范进行了实验,以便对文献中评估的同类工作负载进行不同的优化。在这种情况下,我们比较了可能的解决方案以提高性能,并展示了流程迁移对一致性开销的影响。我们发现,在具有16个或更多处理器的计算机中,一致性开销和进程迁移对性能的影响非常重要。在这种情况下,像DSS应用程序中那样很少的共享对于系统性能也变得至关重要。我们分析的另一个重要结果涉及一致性协议和调度程序之间的交互。基本的缓存相似性计划对于减少迁移很有用,但是并不是在所有负载情况下都有效。特定的一致性协议可以帮助减少进程迁移的影响,尤其是在调度程序无法应用关联性要求的情况下。在这些情况下,相对于传统的基于MESI的解决方案,将wiite更新协议与针对私有数据的选择性失效策略配合使用可将性能(和可伸缩性)提高约20%。在高速缓存到高速缓存传输的情况下,此优势约为50%。 (c)2004 Elsevier Inc.保留所有权利。

著录项

  • 作者

    Foglia P; Giorgi R; Prete C;

  • 作者单位
  • 年度 2005
  • 总页数
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 入库时间 2022-08-20 20:20:17

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号