首页> 外文期刊>Computing reviews >Programming matrix algorithms-by-blocks for thread-level parallelism
【24h】

Programming matrix algorithms-by-blocks for thread-level parallelism

机译:逐块编程矩阵算法以实现线程级并行性

获取原文
获取原文并翻译 | 示例
           

摘要

For the last few years, decomposing processors into multiple cores that operate independently, in parallel, within a shared address space, has increased the power of computer processors. This paper presents a new method for programming dense linear algebra algorithms that gives modern architectures, in this context, better performance than the traditional approach of using libraries such as linear algebra package (LAPACK).
机译:在过去的几年中,将处理器分解为可在共享地址空间中并行并行运行的多个内核增加了计算机处理器的功能。本文提出了一种用于编程密集线性代数算法的新方法,该方法在这种情况下提供了比使用线性代数包(LAPACK)等库的传统方法更好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号