首页> 外文期刊>IEEE Transactions on Computers >Exploiting parallelism in geometry processing with general purpose processors and floating-point SIMD instructions
【24h】

Exploiting parallelism in geometry processing with general purpose processors and floating-point SIMD instructions

机译:利用通用处理器和浮点SIMD指令在几何处理中利用并行性

获取原文
获取原文并翻译 | 示例

摘要

Three-dimensional (3D) graphics applications have become very important workloads running on today's computer systems. A cost-effective graphics solution is to perform geometry processing of 3D graphics on the host CPU and have specialized hardware handle the rendering task. In this paper, we analyze microarchitecture and SIMD instruction set enhancements to a RISC superscalar processor for exploiting parallelism in geometry processing for 3D computer graphics. Our results show that 3D geometry processing has inherent parallelism. Adding SIMD operations improves performance from 8 percent to 28 percent on a 4-issue dynamically scheduled processor that can issue at most two floating-point operations. In comparison, an 8-issue processor, ignoring cycle time effects, can achieve 20 to 60 percent performance improvement over a 4-issue. If processor cycle time scales with the number of ports to the register file, then doubling only the floating-point issue width of a 4-issue processor with SIMD instructions gives the best performance among the architectural configurations that we examine (the most aggressive configuration is an 8-issue processor with SIMD instructions).
机译:三维(3D)图形应用程序已成为当今计算机系统上运行的非常重要的工作负载。一种经济高效的图形解决方案是在主机CPU上执行3D图形的几何处理,并由专门的硬件处理渲染任务。在本文中,我们分析了RISC超标量处理器的微体系结构和SIMD指令集增强功能,以利用3D计算机图形的几何处理中的并行性。我们的结果表明3D几何处理具有固有的并行性。在可以发出最多两个浮点运算的4个问题的动态调度处理器上,添加SIMD运算可以将性能从8%提高到28%。相比之下,忽略周期时间影响的8问题处理器可以比4问题获得20%到60%的性能提升。如果处理器周期时间与寄存器文件的端口数量成比例,那么使用SIMD指令仅将4问题处理器的浮点问题宽度加倍,就可以在我们研究的架构配置中获得最佳性能(最积极的配置是具有SIMD指令的8问题处理器)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号