...
首页> 外文期刊>Parallel Computing >Evaluating the SW26010 many-core processor with a micro-benchmark suite for performance optimizations
【24h】

Evaluating the SW26010 many-core processor with a micro-benchmark suite for performance optimizations

机译:使用微基准套件评估SW26010多核处理器以优化性能

获取原文
获取原文并翻译 | 示例
           

摘要

The inadequate public information of China’s SW26010 processor’s micro-architecture prevents global researchers from improving application performances on the TaihuLight supercomputer. This study aims to illuminate the uncharted area of SW26010 in order to provide important information for performance optimizations and modeling. First, we developed a micro-benchmark suite,swCandle, to evaluate the key micro-architectural features. The benchmark results revealed some unanticipated findings beyond the publicly available data. For instance, the broadcast mode of register communications has the same latency as the peer-to-peer mode. Second, we applied the roofline model, with the key parameters obtained withswCandle, to identify the key programming challenge of SW26010. Third, based on the micro-benchmark results and the roofline model analysis, we proposed a systematic guideline for performance optimizations on SW26010 and instantiated the guideline with two cases. The methodology we developed in this study, that infers a processor’s micro-architecture design from micro-benchmark results, can also be applied on other processors lacking of public information.
机译:中国SW26010处理器微体系结构的公开信息不足,使全球研究人员无法改善TaihuLight超级计算机上的应用程序性能。这项研究旨在阐明SW26010的未知领域,以便为性能优化和建模提供重要信息。首先,我们开发了一个微基准套件swCandle来评估关键的微体系结构功能。基准结果显示了一些超出公开数据的意外发现。例如,寄存器通信的广播模式具有与对等模式相同的等待时间。其次,我们将车顶线模型以及通过swCandle获得的关键参数应用于SW26010的关键编程挑战。第三,基于微基准测试结果和车顶线模型分析,我们针对SW26010的性能优化提出了系统性的指导原则,并用两种情况实例化了该指导原则。我们在这项研究中开发的方法可以从微基准测试结果推断处理器的微体系结构设计,也可以应用于缺乏公共信息的其他处理器。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号