首页> 外文会议>International Conference on Information Science and Engineering >Parallel Matrix Algorithm Autotuner on Multi-core Architecture
【24h】

Parallel Matrix Algorithm Autotuner on Multi-core Architecture

机译:多核架构上的并行矩阵算法自动箱

获取原文

摘要

This paper develops a performance autotuner of the parallel sparse matrix multiplication algorithm. This autotuner can utilize the cache characters of multicore computers. It can optimize the value of performance parameters in multicore and achieve the better performance of parallel matrix multiplication algorithm. This paper proposes a parallel matrix multiplication algorithm by Using the CSR compression and cache blocking technology. The cache blocking parameters are optimized to get the better performance. From the matrix experiments, we can consult that the autotuner system can achieve the sparse matrix multiplication algorithm better performance on the multicore hardware.
机译:本文开发了并行稀疏矩阵乘法算法的性能自动箱。此AutoTuner可以利用多核计算机的缓存字符。它可以优化多核性能参数的值,实现了并行矩阵乘法算法的更好性能。本文通过使用CSR压缩和缓存阻塞技术提出了一种并行矩阵乘法算法。高速缓存阻塞参数被优化以获得更好的性能。从Matrix实验中,我们可以咨询Autico手持系统可以在多核硬件上实现更好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号