首页> 外文期刊>IEEE transactions on very large scale integration (VLSI) systems >Area-Efficient Arithmetic Expression Evaluation Using Deeply Pipelined Floating-Point Cores
【24h】

Area-Efficient Arithmetic Expression Evaluation Using Deeply Pipelined Floating-Point Cores

机译:使用深度流水线浮点核心的面积有效算术表达式评估

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Recently, it has become possible to implement floating-point cores on field-programmable gate arrays (FPGAs) to provide acceleration for the myriad applications that require high-performance floating-point arithmetic. To achieve high clock rates, floating-point cores for FPGAs must be deeply pipelined. This deep pipelining makes it difficult to reuse the same floating-point core for a series of dependent computations. However, floating-point cores use a great deal of area, so it is important to use as few of them in an architecture as possible. In this paper, we describe area-efficient architectures and algorithms for arithmetic expression evaluation. Such expression evaluation is necessary in applications from a wide variety of fields, including scientific computing and cognition. The proposed designs effectively hide the pipeline latency of the floating-point cores and use at most two floating-point cores for each type of operator in the expression. While best-suited for particular classes of expressions, the proposed designs can evaluate general expressions as well. Additionally, multiple expressions can be evaluated without reconfiguration. Experimental results show that the areas of our designs increase linearly with the number of types of operations in the expression and that our designs occupy less area and achieve higher throughput than designs generated by a commercial hardware compiler.
机译:最近,在现场可编程门阵列(FPGA)上实现浮点内核已成为可能,从而为需要高性能浮点算法的各种应用提供了加速。为了实现高时钟速率,必须对FPGA的浮点内核进行深度流水线处理。这种深层次的流水线化使得很难将相同的浮点核心重用于一系列相关计算。但是,浮点核心占用大量空间,因此在架构中使用尽可能少的核心非常重要。在本文中,我们描述了用于算术表达式评估的高效区域结构和算法。这种表达评估在包括科学计算和认知在内的广泛领域的应用中是必需的。所提出的设计有效地隐藏了浮点核心的流水线延迟,并为表达式中的每种类型的运算符最多使用了两个浮点核心。虽然最适合特定类别的表达式,但建议的设计也可以评估通用表达式。此外,无需重新配置即可评估多个表达式。实验结果表明,我们设计的面积随着表达式中操作类型的增加而线性增加,并且与商用硬件编译器生成的设计相比,我们的设计占用的面积更少,并且吞吐量更高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号