首页> 外文期刊>システム/制御/情報 >Clustered Pipelined Multithreading on Commodity Multi-Core Processors
【24h】

Clustered Pipelined Multithreading on Commodity Multi-Core Processors

机译:商品多核处理器上的集群流水线多线程

获取原文
获取原文并翻译 | 示例
           

摘要

Recently proposed pipelined multithreading (PMT) techniques have shown great applicability to parallelizing general programs on multi-core processors. However, the potential performance of these techniques is limited by the large inter-core communication overheads which become a performance bottleneck. This paper addresses this problem and presents a novel clustered pipelined multithreading (CPMT) technique that can construct efficient pipeline parallelism on commodity multi-core processors. This technique combines a clustered communication mechanism that can greatly reduce average communication overheads (ACOs) in software only approach. We quantitatively demonstrate the performance of CPMT can be improved through reducing the ACOs and show the performance characteristics. Moreover, we also give the stage decomposition procedure and provide a stage execution framework that can execute the multiple stages within one procedure. The effectiveness of CPMT technique has been evaluated on the commodity AMD Phenom four-core processors. Experimental results show that our CPMT technique achieves speedup ranging from 116.8% to 219.8% on some typical loops extracted from SPEC CPU 2000 benchmark programs.
机译:最近提出的流水线多线程(PMT)技术在并行化多核处理器上的通用程序方面显示出极大的适用性。但是,这些技术的潜在性能受到内核间较大的通信开销的限制,这些开销已成为性能瓶颈。本文解决了这个问题,并提出了一种新颖的集群流水线多线程(CPMT)技术,该技术可以在商用多核处理器上构建有效的流水线并行性。此技术结合了群集通信机制,可以通过纯软件方法大大降低平均通信开销(ACO)。我们定量证明了CPMT的性能可以通过减少ACO来改善,并显示其性能特征。此外,我们还提供了阶段分解过程,并提供了一个阶段执行框架,该框架可以在一个过程中执行多个阶段。 CPMT技术的有效性已在商用AMD Phenom四核处理器上进行了评估。实验结果表明,我们的CPMT技术在从SPEC CPU 2000基准程序中提取的一些典型循环上实现了116.8%到219.8%的加速比。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号