首页> 外文会议>International conference on Supercomputing >Synchronization optimizations for efficient execution on multi-cores
【24h】

Synchronization optimizations for efficient execution on multi-cores

机译:多核高效执行的同步优化

获取原文

摘要

Multi-cores are becoming ubiquitous as exemplified by Sun's Niagra-2, Intel's Nehalem and AMD's Sau Paulo octal cores. The number of cores per chip is expected to rise in foreseeable future, as evidenced by the recently announced Intel's 80-core Teraflops Research Chip. Exploiting the parallelism of multicores necessitates concurrent software. One way to parallelize programs, not amenable to auto-parallelization, is via explicit synchronization. The placement of the synchronization primitives has a large bearing on how much thread-level parallelism (TLP) can be achieved. In this paper, we propose novel predication-based and other adjunct synchronization optimizations which facilitate exploitation on higher level of TLP than what can be achieved using the state-of-the-art. We demonstrate the efficacy of our techniques, on a real machine, using real codes, specifically, from the industry-standard SPEC CPU benchmarks and other widely used open source codes such as PostgreSQL. Our results show thatthe proposed techniques yield significantly higher levels of TLP than the state-of-the-art.
机译:多核正在变得无处不在,如Sun的Niagra-2,英特尔的Nehalem和AMD的Sau Paulo Octal Cores。预计每芯片的核心数量将在可预见的未来上升,正如最近宣布的英特尔的80核特拉波普通研究芯片所证明的那样。利用多电线的并行性需要并发软件。通过显式同步,并行化程序不适合自行并行化的一种方法是通过显式同步。同步基元的放置具有大的轴承,可以实现多少螺纹平行行度(TLP)。在本文中,我们提出了新的基于预测和其他附加的同步优化,其促进了比可以使用最先进的TLP级别的利用。我们展示了我们的技术在真实机器上使用实际代码,具体地,从行业标准规范CPU基准和其他广泛使用的开源代码(如PostgreSQL)的效能。我们的研究结果表明,该技术的技术额外高于最先进的TLP水平。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号