...
首页> 外文期刊>Advances in Engineering Software >An improved direct linear equation solver using multi-GPU in multi-body dynamics
【24h】

An improved direct linear equation solver using multi-GPU in multi-body dynamics

机译:在多体动力学中使用多GPU的改进的直接线性方程求解器

获取原文
获取原文并翻译 | 示例

摘要

This research proposes an implementation of effective direct linear equation solver for mechanical multi-body dynamics analysis. The proposed method focuses on the solvability for any size of GPU memory and scalability for any number of GPUs by using BFS-based traversal. A multi-level tree is divided into as many sub-trees as a GPU number by using the nested dissection, each of which is assigned to each GPU. Balanced graph bisection, additional sub-trees, and work stealing lead to minimum idle GPU computing time. Numerical experiments have been performed to decide the optimal maximum block size. Three mechanical models and the other three matrices from UF collection have been solved to show the effectiveness of the proposed method. Two different kinds of 4 GPUs, GeForce GTX 460 and GTX TITAN BLACK, are involved in this experiment. The proposed method shows a good solvability even when the test GPU memory is dozens of times smaller than the required data size for numerical factorization. The proposed optimization algorithm presents a good scalability on the number of GPUs. The performance results are compared with those obtained from CHOLMOD included in SuiteSparse library.
机译:这项研究提出了一种有效的直接线性方程求解器,用于机械多体动力学分析。所提出的方法着重于通过使用基于BFS的遍历来解决任意大小的GPU内存的可解性和任意数量的GPU的可扩展性。通过使用嵌套解剖,将多级树分为与GPU数量一样多的子树,每个子树都分配给每个GPU。平衡的图形二等分,额外的子树和工作窃取可将GPU的空闲计算时间降至最低。已经进行了数值实验,以确定最佳的最大块大小。 UF集合中的三个力学模型和其他三个矩阵已被求解,以证明该方法的有效性。本实验涉及两种不同的4种GPU,即GeForce GTX 460和GTX TITAN BLACK。即使当测试GPU内存比数值分解所需的数据大小小几十倍时,所提出的方法也显示出良好的可解性。所提出的优化算法在GPU数量上具有良好的可扩展性。将性能结果与SuiteSparse库中包含的CHOLMOD获得的结果进行比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号