首页> 外文会议>International Conference on Parallel Processing and Applied Mathematics >New Data Structures for Matrices and Specialized Inner Kernels: Low Overhead for High Performance
【24h】

New Data Structures for Matrices and Specialized Inner Kernels: Low Overhead for High Performance

机译:用于矩阵和专用内核的新数据结构:高性能的低开销

获取原文

摘要

Dense linear algebra codes are often expressed and coded in terms of BLAS calls. This approach, however, achieves suboptimal performance due to the overheads associated to such calls. Taking as an example the dense Cholesky factorization of a symmetric positive definite matrix we show that the potential of non-canonical data structures for dense linear algebra can be better exploited with the use of specialized inner kernels. The use of non-canonical data structures together with specialized inner kernels has low overhead and can produce excellent performance.
机译:密集的线性代数代码通常以BLAS调用表示和编码。然而,这种方法由于与此类呼叫相关联的开销而实现了次优性能。作为一个例子,对称正定矩阵的密集孔基分解,我们表明,可以更好地利用专用内核来利用致密线性代数的非规范数据结构的潜力。使用非规范数据结构与专用内核一起具有低开销,可以产生出色的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号