首页> 外文期刊>International Journal of High Performance Computing Applications >Sparse triangular solves for ILU revisited: data layout crucial to better performance
【24h】

Sparse triangular solves for ILU revisited: data layout crucial to better performance

机译:稀疏三角形解决了ILU的问题:数据布局对于提高性能至关重要

获取原文
获取原文并翻译 | 示例
           

摘要

A key to good processor utilization for sparse matrix computations is storing the data in the format that is most conducive to fast access by the memory system. In particular, for sparse matrix triangular solves the traditional compressed sparse matrix format is poor, and minor adjustments to the data structure can increase the processor utilization dramatically. Such adjustments involve storing the L and U factors separately and storing the U rows 'backwards' so that they are accessed in a simple streaming fashion during the triangular solves. Changes to the PETSc libraries to use this modified storage format resulted in over twice the floating-point rate for some matrices. This improvement can be accounted for by a decrease in the cache misses and TLB (transaction lookaside buffer) misses in the modified code.
机译:稀疏矩阵计算获得良好的处理器利用率的关键是,以最有利于内存系统快速访问的格式存储数据。特别是对于三角形的稀疏矩阵,解决了传统的压缩稀疏矩阵格式差的问题,对数据结构的细微调整可以显着提高处理器利用率。此类调整包括分别存储L和U因子,以及“向后”存储U行,以便在三角求解期间以简单的流方式访问它们。更改PETSc库以使用这种修改后的存储格式,导致某些矩阵的浮点速率增加了一倍以上。可以通过减少修改后的代码中的高速缓存未命中和TLB(事务后备缓冲区)未命中来解决此改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号