首页> 外文会议>European Conference on Parallel Computing >Memory Bandwidth: The True Bottleneck of SIMD Multimedia Performance on a Superscalar Processor
【24h】

Memory Bandwidth: The True Bottleneck of SIMD Multimedia Performance on a Superscalar Processor

机译:内存带宽:Superscalar处理器上SIMD多媒体性能的真实瓶颈

获取原文

摘要

This paper presents the performance of DSP, image and 3D applications on recent general-purpose microprocessors using streaming SIMD ISA extensions (integer and floating point). The 9 benchmarks benchmark we use for this evaluation have been optimized for DLP and caches use with SIMD extensions and data prefetch. The result of these cumulated optimizations is a speedup that ranges from 1.9 to 7.1. All the benchmarks were originaly computation bound and 7 becomes memory bandwidth bound with the addition of SIMD and data prefetch. Quadrupling the memory bandwidth has no effect on original kernels but improves the performance of SIMD kernels by 15-55%.
机译:本文在近期通用微处理器上使用流SIMD ISA扩展(整数和浮点)提出了DSP,图像和3D应用的性能。我们用于此评估的9个基准测试基准已针对DLP和缓存使用SIMD扩展和数据预取优化。这些累积优化的结果是加速,范围为1.9到7.1。所有基准测试都是最初的计算绑定,7个以添加SIMD和数据预取的内存带宽绑定。数据内存带宽对原始内核没有影响,但提高了SIMD内核的性能15-55%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号