首页> 外文期刊>Journal of Computational Physics >A fast algorithm for sparse matrix computations related to inversion
【24h】

A fast algorithm for sparse matrix computations related to inversion

机译:一种与反演有关的稀疏矩阵计算的快速算法

获取原文
获取原文并翻译 | 示例
           

摘要

We have developed a fast algorithm for computing certain entries of the inverse of a sparse matrix. Such computations are critical to many applications, such as the calculation of non-equilibrium Green's functions ~(Gr) and ~(G <) for nano-devices. The FIND (Fast Inverse using Nested Dissection) algorithm is optimal in the big-O sense. However, in practice, FIND suffers from two problems due to the width-2 separators used by its partitioning scheme. One problem is the presence of a large constant factor in the computational cost of FIND. The other problem is that the partitioning scheme used by FIND is incompatible with most existing partitioning methods and libraries for nested dissection, which all use width-1 separators. Our new algorithm resolves these problems by thoroughly decomposing the computation process such that width-1 separators can be used, resulting in a significant speedup over FIND for realistic devices - up to twelve-fold in simulation. The new algorithm also has the added advantage that desired off-diagonal entries can be computed for free. Consequently, our algorithm is faster than the current state-of-the-art recursive methods for meshes of any size. Furthermore, the framework used in the analysis of our algorithm is the first attempt to explicitly apply the widely-used relationship between mesh nodes and matrix computations to the problem of multiple eliminations with reuse of intermediate results. This framework makes our algorithm easier to generalize, and also easier to compare against other methods related to elimination trees. Finally, our accuracy analysis shows that the algorithms that require back-substitution are subject to significant extra round-off errors, which become extremely large even for some well-conditioned matrices or matrices with only moderately large condition numbers. When compared to these back-substitution algorithms, our algorithm is generally a few orders of magnitude more accurate, and our produced round-off errors stay at a reasonable level.
机译:我们已经开发了一种用于计算稀疏矩阵逆的某些项的快速算法。这样的计算对于许多应用都是至关重要的,例如纳米器件的非平衡格林函数〜(Gr)和〜(G <)的计算。从大O角度来看,FIND(使用嵌套解剖的快速逆算法)是最佳的。但是,实际上,由于其分区方案使用的宽度为2的分隔符,FIND遇到两个问题。一个问题是FIND的计算成本中存在较大的常数。另一个问题是FIND所使用的分区方案与大多数现有的用于嵌套解剖的分区方法和库不兼容,它们都使用width-1分隔符。我们的新算法通过彻底分解计算过程来解决这些问题,从而可以使用宽度为1的分隔符,从而使实际设备的FIND速度大大提高-在模拟中达到12倍。新算法还具有可以免费计算所需的非对角线条目的优点。因此,对于任何大小的网格,我们的算法都比当前最新的递归方法要快。此外,在我们的算法分析中使用的框架是首次尝试将网格节点和矩阵计算之间广泛使用的关系明确应用到具有中间结果重用的多重消除问题。该框架使我们的算法更易于推广,并且也易于与与消除树相关的其他方法进行比较。最后,我们的精度分析表明,要求反替换的算法会遭受明显的额外舍入误差,即使对于某些条件良好的矩阵或条件数仅中等的矩阵,该误差也会变得非常大。与这些替代算法相比,我们的算法通常更精确几个数量级,并且我们产生的舍入误差保持在合理的水平。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号