...
首页> 外文期刊>Neural computing & applications >CPU versus GPU: which can perform matrix computation faster-performance comparison for basic linear algebra subprograms
【24h】

CPU versus GPU: which can perform matrix computation faster-performance comparison for basic linear algebra subprograms

机译:CPU与GPU:它可以执行基本线性代数子程序更快的性能比较矩阵计算

获取原文
获取原文并翻译 | 示例

摘要

Matrix computing is the core component of machine learning and artificial intelligence. Fast matrix computations can facilitate many large-scale computational projects greatly. Basic linear algebra subprograms (BLAS) are proposed, which classify different matrices and provide a standardized interface. Currently, the most commonly used heterogeneous computing platforms are central processing unit (CPU) and graphics processing unit (GPU). At present, BLAS has been implemented on both CPU and GPU. However, due to the different characteristics of algorithms and hardware, a particular matrix method should be designed for a particular processor. It is important to choose the right processor for a particular matrix computation. This paper first briefly reviews the BLAS, and then introduces architecture and optimization methods of CPU and GPU. The effect of different subroutines in BLAS is studied through experiments. Finally, we discuss the reasons and the processor selection scheme of matrix computations.
机译:矩阵计算是机器学习和人工智能的核心分量。快速矩阵计算可以很大程度上促进许多大型计算项目。提出了基本线性代数子程序(BLA),其分类不同的矩阵并提供标准化接口。目前,最常用的异构计算平台是中央处理单元(CPU)和图形处理单元(GPU)。目前,BLA已经在CPU和GPU上实现。然而,由于算法和硬件的特点不同,应为特定处理器设计特定的矩阵方法。为特定矩阵计算选择正确的处理器非常重要。本文首先介绍了BLA,然后介绍了CPU和GPU的架构和优化方法。通过实验研究了不同子程序在BLAS中的影响。最后,我们讨论了矩阵计算的原因和处理器选择方案。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号