...
首页> 外文期刊>Journal of supercomputing >GPU parallelization of the sequential matrix diagonalization algorithm and its application to high-dimensional data
【24h】

GPU parallelization of the sequential matrix diagonalization algorithm and its application to high-dimensional data

机译:序列矩阵对角化算法的GPU并行化及其在高维数据中的应用

获取原文
获取原文并翻译 | 示例
           

摘要

This paper presents the parallelization on a GPU of the sequential matrix diagonalization (SMD) algorithm, a method for diagonalizing polynomial covariance matrices, which is the most recent technique for polynomial eigenvalue decomposition. We first parallelize with CUDA the calculation of the polynomial covariance matrix. Then, following a formal transformation of the polynomial matrix multiplication code-extensively used by SMD-we insert in this code the cublasDgemm function of CUBLAS library. Furthermore, a specialized cache memory system is implemented within the GPU to greatly limit the PC-to-GPU transfers of slices of polynomial matrices. The resulting SMD code can be applied efficiently over high-dimensional data. The proposed method is verified using sequences of images of airplanes with varying spatial orientation. The performance of the parallel codes for polynomial covariance matrix generation and SMD is evaluated and reveals speedups of up to 161 and 67, respectively, relative to sequential execution on a PC.
机译:本文介绍了对数矩阵对角化(SMD)算法在GPU上的并行化,该算法是对多项式协方差矩阵进行对角化的方法,这是多项式特征值分解的最新技术。我们首先将CUDA的多项式协方差矩阵的计算并行化。然后,在对SMD广泛使用的多项式矩阵乘法代码进行形式转换之后,我们在此代码中插入CUBLAS库的cublasDgemm函数。此外,在GPU内实现了专门的缓存存储系统,以极大地限制多项式矩阵的切片在PC到GPU之间的传输。生成的SMD代码可以有效地应用于高维数据。使用具有变化的空间方向的飞机的图像序列来验证所提出的方法。评估了用于多项式协方差矩阵生成和SMD的并行代码的性能,并揭示了相对于PC上的顺序执行速度,分别提高了161和67。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号