首页> 外文OA文献 >2-D wavelet transform enhancement on general-purpose microprocessors: memory hierarchy and SIMD parallelism exploitation
【2h】

2-D wavelet transform enhancement on general-purpose microprocessors: memory hierarchy and SIMD parallelism exploitation

机译:通用微处理器上的二维小波变换增强:存储器层次结构和SIMD并行性开发

摘要

This paper addresses the implementation of a 2-D Discrete Wavelet Transform on general-purpose microprocessors, focusing on both memory hierarchy and SIMD parallelization issues. Both topics are somewhat related, since SIMD extensions are only useful if the memory hierarchy is efficiently exploited. In this work, locality has been significantly improved by means of a novel approach called pipelined computation, which complements previous techniques based on loop tiling and non-linear layouts. As experimental platforms we have employed a Pentium-III (P-III) and a Pentium-4 (P-4) microprocessor. However, our SIMD-oriented tuning has been exclusively performed at source code level. Basically, we have reordered some loops and introduced some modifications that allow automatic vectorization. Taking into account the abstraction level at which the optimizations are carried out, the speedups obtained on the investigated platforms are quite satisfactory, even though further improvement can be obtained by dropping the level of abstraction (compiler intrinsics or assembly code).
机译:本文着眼于通用微处理器上的二维离散小波变换的实现,重点关注存储器层次结构和SIMD并行化问题。这两个主题都有些相关,因为SIMD扩展仅在有效利用内存层次结构时才有用。在这项工作中,通过一种称为流水线计算的新方法大大改善了局部性,该方法补充了基于循环平铺和非线性布局的先前技术。作为实验平台,我们采用了Pentium-III(P-III)和Pentium-4(P-4)微处理器。但是,我们的面向SIMD的调整仅在源代码级别执行。基本上,我们对一些循环进行了重新排序,并引入了一些允许自动矢量化的修改。考虑到执行优化的抽象级别,尽管可以通过降低抽象级别(编译器内在函数或汇编代码)获得进一步的改进,但在研究平台上获得的加速效果还是令人满意的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号