【24h】

Exploiting Speculative Thread-Level Parallelism in Data Compression Applications

机译:在数据压缩应用程序中利用推测性线程级并行性

获取原文
获取原文并翻译 | 示例

摘要

Although hardware support for Thread-Level Speculation (TLS) can ease the compiler's tasks in creating parallel programs by allowing the compiler to create potentially dependent parallel threads, advanced compiler optimization techniques must be developed and judiciously applied to achieve the desired performance. In this paper, we take a close examination on two data compression benchmarks, GZIP and BZIP2, propose, implement and evaluate new compiler optimization techniques to eliminate performance bottlenecks in their parallel execution and improve their performance. The proposed techniques (ⅰ) remove the critical forwarding path created by synchronizing memory-resident values; (ⅱ) identify and categorize reduction-like variables whose intermediate results are used within loops, and propose code transformation to remove the inter-thread data dependences caused by these variables; and (ⅲ) transform the program to eliminate stalls caused by variations in thread size. While no previous work has reported significant performance improvement on parallelizing these two benchmarks, we are able to achieve up to 36% performance improvement for GZIP and 37% for BZIP2.
机译:尽管对线程级推测(TLS)的硬件支持可以通过允许编译器创建可能依赖的并行线程来简化编译器创建并行程序的任务,但必须开发并明智地使用高级编译器优化技术,以实现所需的性能。在本文中,我们仔细研究了两个数据压缩基准GZIP和BZIP2,提出,实施和评估了新的编译器优化技术,以消除并行执行时的性能瓶颈并提高其性能。所提出的技术(ⅰ)删除了通过同步内存驻留值创建的关键转发路径; (ⅱ)识别并归类于循环中使用中间结果的约简类变量,并提出代码转换以消除由这些变量引起的线程间数据依赖性; (ⅲ)转换程序以消除由于线程大小变化而导致的停顿。尽管没有以前的工作报道过将这两个基准进行并行处理可以显着提高性能,但是我们可以将GZIP和BZIP2的性能分别提高36%和37%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号