首页> 外文会议>International Multiconference on Computer Science and Information Technology >Empirically tuning LAPACK#x2019;s blocking factor for increased performance
【24h】

Empirically tuning LAPACK#x2019;s blocking factor for increased performance

机译:经验调整Lapack的阻塞因子增加性能

获取原文

摘要

LAPACK (Linear Algebra PACKage) is a statically cache-blocked library, where the blocking factor (NB) is determined by the service routine ILAENV. Users are encouraged to tune NB to maximize performance on their platform/BLAS (the BLAS are LAPACK’s computational engine), but in practice very few users do so (both because it is hard, and because its importance is not widely understood). In this paper we (1) Discuss our empirical tuning framework for discovering good NB settings, (2) quantify the performance boost that tuning NB can achieve on several LAPACK routines across multiple architectures and BLAS implementations, (3) compare the best performance of LAPACK’s statically blocked routines against state of the art recursively blocked routines, and vendor-optimized LAPACK implementations, to see how much performance loss is mandated by LAPACK’s present static blocking strategy, and finally (4) use results to determine how best to block nonsquare matrices once good square blocking factors are discovered.
机译:Lapack(线性代数包)是静态缓存阻止的库,其中阻塞因子(NB)由服务例程ILAENV确定。鼓励用户调整NB,以最大化其平台/ BLAS上的性能(BLAS是Lapack的计算引擎),但在实践中很少有用户(两者都是难的,因为它的重要性并没有被广泛理解)。在本文中,我们(1)讨论我们发现良好的NB设置的实证调整框架,(2)量化调谐NB在多个架构和BLAS实现中实现的调整NB可以实现的性能提升,(3)比较LAPACK的最佳性能静态阻止了用于递归封锁的例程的静态封锁例程,以及供应商优化的Lapack实现,以了解Lapack目前的静态阻断策略的性能损失,最后(4)使用结果来确定如何最好地阻止Nonsquare矩阵一次发现了良好的方形阻挡因子。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号