首页> 外文会议>2012 7th IEEE International Conference on Networking, Architecture and Storage. >Implementing the Jacobi Algorithm for Solving Eigenvalues of Symmetric Matrices with CUDA
【24h】

Implementing the Jacobi Algorithm for Solving Eigenvalues of Symmetric Matrices with CUDA

机译:用CUDA实现Jacobi算法求解对称矩阵的特征值

获取原文
获取原文并翻译 | 示例

摘要

Solving the eigenvalues of matrices is an open problem which is often related to scientific computation. With the increasing of the order of matrices, traditional sequential algorithms are unable to meet the needs for the calculation time. Although people can use cluster systems in a short time to solve the eigenvalues of large-scale matrices, it will bring an increase in equipment costs and power consumption. This paper proposes a parallel algorithm named Jacobi on gpu which is implemented by CUDA (Computer Unified Device Architecture) on GPU (Graphic Process Unit) to solve the eigenvalues of symmetric matrices. In our experimental environment, we have Intel Core i5-760 quad-core CPU, NVIDIA GeForce GTX460 card, and Win7 64-bit operating system. When the size of matrix is 10240×10240, the number of iterations is 10000 times, the speedup ratio is 13.71. As the size of matrices increase, the speedup ratio increases correspondingly. Moreover, as the number of iterations increases, the speedup ratio is very stable. When the size of matrix is 8192×8192, the number of iterations are 1000, 2000, 4000, 8000 and 16000 respectively, the standard deviation of the speedup ratio is 0.1161. The experimental results show that the Jacobi on gpu algorithm can save more running time than traditional sequential algorithms and the speedup ratio is 3.02~13.71. Therefore, the computing time of traditional sequential algorithms to solve the eigenvalues of matrices is reduced significantly.
机译:解决矩阵的特征值是一个开放的问题,通常与科学计算有关。随着矩阵阶数的增加,传统的顺序算法无法满足计算时间的需求。尽管人们可以在短时间内使用集群系统来解决大规模矩阵的特征值,但这将带来设备成本和功耗的增加。本文提出了一种在gpu上名为Jacobi的并行算法,该算法由GPU(图形处理单元)上的CUDA(计算机统一设备架构)实现,用于求解对称矩阵的特征值。在我们的实验环境中,我们拥有Intel Core i5-760四核CPU,NVIDIA GeForce GTX460卡和Win7 64位操作系统。当矩阵大小为10240×10240时,迭代次数为10000次,加速比为13.71。随着矩阵大小的增加,加速比也相应增加。此外,随着迭代次数的增加,加速比非常稳定。当矩阵的大小为8192×8192时,迭代次数分别为1000、2000、4000、8000和16000,则加速比的标准偏差为0.1161。实验结果表明,Jacobi on gpu算法比传统的顺序算法可节省更多的运行时间,加速比为3.02〜13.71。因此,大大减少了传统顺序算法求解矩阵特征值的时间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号