...
首页> 外文期刊>Concurrency and computation: practice and experience >Enabling heterogeneous ray-tracing acceleration in edge/cloud architectures
【24h】

Enabling heterogeneous ray-tracing acceleration in edge/cloud architectures

机译:在边缘/云架构中启用异构射线追踪加速度

获取原文
获取原文并翻译 | 示例

摘要

The ray-tracing algorithm is very costly regarding time complexity and while many techniques have been conceived over the years with the purpose of accelerating its execution, one stands out: parallelism exploitation of ray-triangle intersection operations. In this sense, field-programmable gate arrays (FPGAs) have plenty resources to run specialized accelerators that execute multiple operations in parallel. Moreover, modern FPGAs are embedded with multiprocessor systems-on-chip based on ARM architecture, which can be used simultaneously with the FPGA programmable logic to further accelerate the application execution. In this work, we present and analyze a reconfigurable accelerator for ray-tracing specialized in computing ray-triangle intersections at the network edge of a heterogeneous cloud computing environment. The accelerator is specified using Xilinx high-level synthesis and is implemented in a Xilinx Zynq FPGA (XC7Z020-1CLG400C). We also present an execution model which enables the exploitation of the available computing elements of the heterogeneous system: ARM Cortex-A53, FPGA programmable logic, and cloud machines. Experimental performance and synthesis results show that the heterogeneous system can efficiently render a simplified version of the Stanford Bunny model when using the hardware accelerator with up to six instances of a ray-triangle intersection unit together with the other computing resources.
机译:光线跟踪算法是非常昂贵的关于时间复杂度而许多技术已经设想过以加快其执行的目的,这些年来,一个突出:光线 - 三角形相交操作的并行性开发。在这个意义上,现场可编程门阵列(FPGA)有足够的资源来运行,在并行执行多个操作专门的加速器。此外,现代的FPGA被嵌入与多处理器系统的片上基于ARM架构,其可以同时与FPGA可编程逻辑被用于进一步加速应用程序执行。在这项工作中,我们提出和分析的可重新配置加速器射线跟踪专业在异构云计算环境的网络边缘计算光线 - 三角形相交处。加速器采用Xilinx高级综合指定,并且在一个赛灵思ZYNQ FPGA(XC7Z020-1CLG400C)被实现。我们还提出了一种执行模型这使得多相系统的可用计算元件的开发:ARM的Cortex-A53,FPGA可编程逻辑,和云的机器。实验性能和合成的结果表明,使用共同的硬件加速器当用射线 - 三角形相交单元的至多六个实例与其他计算资源的异构系统能够有效地使斯坦福兔子模型的简化版本。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号