首页> 外文会议>IEEE Symposium on Computer Arithmetic >Single Precision Natural Logarithm Architecture for Hard Floating-Point and DSP-Enabled FPGAs
【24h】

Single Precision Natural Logarithm Architecture for Hard Floating-Point and DSP-Enabled FPGAs

机译:用于硬浮点和启用DSP的FPGA的单精度自然对数架构

获取原文

摘要

In this paper we will present a novel method for implementing floating point (FP) elementary functions using the new FP single precision addition and multiplication features of the Altera Arria~10 DSP Block architecture. Our application example will use log(x), one of the most commonly required functions for emerging datacenter and computing FPGA targets. We will explain why the combination of new FPGA technology, and at the same time, a massive increase in computing performance requirement, fuels the need for this work. We show a comprehensive error analysis, both for the overall function, and each subsection of the architecture, demonstrating that the hard FP (HFP) Blocks, in conjunction with the traditional flexibility and connectivity of the FPGA, can provide a robust and high performance solution. These methods create a highly accurate single precision IEEE754 function, which is OpenCL conformant. Our methods map directly to almost exclusively embedded structures, and therefore result in significant reduction in logic resources and routing stress compared to current methods, and demonstrate that newly introduced FPGA routing architectures can be leveraged to use almost no soft resources. We also show that the latency of the log(x) function can be changed independently of the architecture and function, allowing the performance of the function to be adjusted directly to the system clock rate.
机译:在本文中,我们将介绍一种使用Altera Arria〜10 DSP模块架构的新FP单精度加法和乘法功能来实现浮点(FP)基本功能的新颖方法。我们的应用示例将使用log(x),这是新兴数据中心和计算FPGA目标最常用的功能之一。我们将解释为什么新的FPGA技术的结合以及同时对计算性能要求的大幅提高推动了这项工作的需求。我们针对架构的整体功能和每个子部分都展示了全面的错误分析,表明硬FP(HFP)模块与FPGA的传统灵活性和连通性一起可以提供可靠且高性能的解决方案。这些方法创建了符合OpenCL的高精度单精度IEEE754函数。我们的方法直接映射到几乎专有的嵌入式结构,因此与当前方法相比,显着减少了逻辑资源和布线压力,并证明了可以利用新引入的FPGA布线架构来几乎不使用任何软资源。我们还表明,可以独立于体系结构和功能来更改log(x)函数的延迟,从而可以将函数的性能直接调整为系统时钟速率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号