首页> 外文会议>International Conference on Parallel and Distributed Processing Techniques and Applications >A Scalable Parallel Bisection Algorithm for Symmetric Tridiagonal Eigenvalue Problem
【24h】

A Scalable Parallel Bisection Algorithm for Symmetric Tridiagonal Eigenvalue Problem

机译:对称性三角形特征值问题的可伸缩并行分布算法

获取原文

摘要

Bisection method is a numerically stable algorithm used to find the eigenvalues of symmetric tridiagonal matrices. It is distinct from other methods in that it can be used to compute a subset of eigenvalues with high accuracy. However, the algorithm is significantly slow compared to other methods when a large number of eigenvalues are desired. Fortunately, the algorithm exhibits a high level of parallelism when it is implemented on various types of multiprocessors, including a single-GPU system. In this paper, we describe a highly scalable implementation using multi-GPU systems to accommodate large matrices. Our approach exploits the latest memory management features available on Nvidia Tesla K20c GPUs, including unified memory architecture, peer-to-peer data transfer, and dynamic parallelism. Our implementation was at least 60 times faster than a multi-core CPU system and exhibits a linear speedup with respect to the number of GPUs in the system.
机译:二分辨率是一种用于找到对称三角形矩阵的特征值的数值稳定的算法。它与其他方法不同,因为它可用于计算具有高精度的特征值的子集。然而,与需要大量特征值时的其他方法相比,该算法显着慢。幸运的是,当在各种类型的多处理器上实现时,该算法表现出高水平的并行性,包括单个GPU系统。在本文中,我们使用多GPU系统来容纳大矩阵来描述高度可扩展的实现。我们的方法利用NVIDIA Tesla K20c GPU上提供的最新内存管理功能,包括统一的内存架构,对等数据传输和动态并行性。我们的实现比多核CPU系统快至少60倍,并且相对于系统中GPU的数量展示线性加速。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号