首页> 外文会议>Performance, Computing and Communications, 1998. IPCCC '98., IEEE International >Modeling the performance of general purpose instruction level parallel architectures in image processing
【24h】

Modeling the performance of general purpose instruction level parallel architectures in image processing

机译:在图像处理中对通用指令级并行架构的性能进行建模

获取原文

摘要

RISC Instruction Level Parallel systems are today the most commonly used high performance computing platform. On such systems, Image Processing and Pattern Recognition (IPPR) tasks, if not thoroughly optimized to fit each architecture, exhibit a performance level up to one order of magnitude lower than expected. In this paper we identify the sources of such behavior and we model them defining a set of indices to measure their influence. Our model allows planning program optimizations, assessing the results of such optimizations as well as evaluating the efficiency of the CPUs architectural solutions in IPPR tasks. A case study using a combination of a specific IPPR task and a RISC workstation is used to demonstrate these capabilities. We analyze the sources of inefficiency of the task, we plan some source level program optimizations, namely data type optimization and loop unrolling, and we assess the impact of these transformations on the task performance. The results of our study allow us to obtain an eight times performance improvement and to conclude that, in low-medium level IPPR tasks, it is more difficult to efficiently exploit superscalarity than pipelining.
机译:RISC指令级并行系统是当今最常用的高性能计算平台。在这样的系统上,图像处理和模式识别(IPPR)任务,如果没有完全优化以适合每种体系结构,则其性能水平将比预期降低多达一个数量级。在本文中,我们确定了此类行为的来源,并对它们进行建模,定义了一组指标以衡量其影响。我们的模型允许规划程序优化,评估此类优化的结果以及评估IPPR任务中CPU体系结构解决方案的效率。通过结合使用特定IPPR任务和RISC工作站的案例研究来演示这些功能。我们分析了任务效率低下的根源,计划了一些源代码级程序优化,即数据类型优化和循环展开,并评估了这些转换对任务性能的影响。我们的研究结果使我们获得了八倍的性能提升,并得出结论,在中低级IPPR任务中,有效利用超标量比流水线更困难。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号