...
首页> 外文期刊>IBM Journal of Research and Development >Minimal-storage high-performance Cholesky factorization via blocking and recursion
【24h】

Minimal-storage high-performance Cholesky factorization via blocking and recursion

机译:通过阻塞和递归实现最少存储的高性能Cholesky因式分解

获取原文

摘要

We present a novel practical algorithm for Cholesky factorization when the matrix is stored in packed format by combining blocking and recursion. The algorithm simultaneously obtains Level 3 performance, conserves about half the storage, and avoids the production of Level 3 BLAS for packed format. We use recursive packed format, which was first described by Andersen et al. [1] . Our algorithm uses only DGEMM and Level 3 kernel routines; it first transforms standard packed format to packed recursive lower row format. Our new algorithm outperforms the Level 3 LAPACK routine DPOTRF even when we include the cost of data transformation. (This is true for three IBM platforms—the POWER3, the POWER2, and the PowerPC 604e.) For large matrices, blocking is not required for acceptable Level 3 performance. However, for small matrices the overhead of pure recursion and/or data transformation is too high. We analyze these costs analytically and provide de tailed cost estimates. We show that blocking combined with recursion reduces all overheads to a tiny, acceptable level. However, a new problem of nonlinear addressing arises. We use two-dimensional mappings (tables) or data copying to overcome the high costs of directly computing addresses that are nonlinear functions of i and j.
机译:当矩阵通过结合分块和递归以压缩格式存储时,我们提出了一种新颖的实用的Cholesky分解算法。该算法可同时获得3级性能,节省大约一半的存储空间,并避免产生打包格式的3级BLAS。我们使用递归打包格式,这是由Andersen等人首先描述的。 [1]。我们的算法仅使用DGEMM和3级内核例程。它首先将标准打包格式转换为打包递归下排格式。即使算上数据转换的成本,我们的新算法也优于3级LAPACK例程DPOTRF。 (对于三个IBM平台(POWER3,POWER2和PowerPC 604e,这是正确的。)对于大型矩阵,对于可接受的3级性能,不需要阻塞。但是,对于较小的矩阵,纯递归和/或数据转换的开销太高。我们以分析方式分析这些成本,并提供详细的成本估算。我们证明了将阻塞与递归相结合可以将所有开销减少到很小的可接受水平。但是,出现了非线性寻址的新问题。我们使用二维映射(表)或数据复制来克服直接计算地址(i和j的非线性函数)的高额费用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号