首页> 外文期刊>Concurrency and Computation >Increasing data reuse of sparse algebra codes on simultaneous multithreading architectures
【24h】

Increasing data reuse of sparse algebra codes on simultaneous multithreading architectures

机译:同时多线程体系结构上稀疏代数代码的数据重用

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

In this paper the problem of the locality of sparse algebra codes on simultaneous multithreading (SMT) architectures is studied. In these kind of architectures many hardware structures are dynamically shared among the running threads. This puts a lot of stress on the memory hierarchy, and a poor locality, both inter-thread and intra-thread, may become a major bottleneck in the performance of a code. This behavior is even more pronounced when the code is irregular, which is the case of sparse matrix ones. Therefore, techniques that increase the locality of irregular codes on SMT architectures are important to achieve high performance. This paper proposes a data reordering technique specially tuned for these kind of architectures and codes. It is based on a locality model developed by the authors in previous works. The technique has been tested, first, using a simulator of a SMT architecture, and subsequently, on a real architecture as Intel's Hyper-Threading. Important reductions in the number of cache misses have been achieved, even when the number of running threads grows. When applying the locality improvement technique, we also decrease the total execution time and improve the scalability of the code.
机译:本文研究了同时代多线程(SMT)体系结构中稀疏代数代码的局部性问题。在这些类型的体系结构中,许多硬件结构在运行的线程之间动态共享。这给内存层次结构带来了很大压力,并且线程间和线程内的局域性差可能成为代码性能的主要瓶颈。当代码不规则时,这种行为更加明显,这是稀疏矩阵代码的情况。因此,增加SMT体系结构上不规则代码的局部性的技术对于实现高性能很重要。本文提出了一种针对此类架构和代码进行了特别调整的数据重新排序技术。它基于作者在以前的工作中开发的位置模型。首先使用SMT架构的仿真器对该技术进行了测试,然后使用英特尔的超线程技术在真实的架构上进行了测试。即使正在运行的线程数增加,也可以大大减少高速缓存未命中的次数。当应用局部性改进技术时,我们还减少了总执行时间并提高了代码的可伸缩性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号