首页> 外文会议>International Conference on Application-specific Systems, Architectures and Processors >Precision Adaptation for Fast and Accurate Polynomial Evaluation Generation
【24h】

Precision Adaptation for Fast and Accurate Polynomial Evaluation Generation

机译:精确匹配,可快速准确地生成多项式求值

获取原文

摘要

Polynomial evaluation is a critical part of the efficient floating-point approximation of elementary functions, in software as well as in FPGA-based systems. Designing an optimized polynomial evaluation scheme is a complex and tedious task, due to multitudes of choices in numerous dimensions: the evaluation scheme, like Horner or Estrin, needs to be selected based on implementation goals (latency, throughput, accuracy. . . ) and be adapted to a given architecture, for example by adapting the level of parallelism to the architecture capabilities. For each operation, a fixed-point or floating-point format needs to be chosen, e.g. between formats such as binary32, binary64. Furthermore some schemes and formats induce compromises, in particular when it comes to vectorized evaluation schemes. As part of a longer automated code generation toolchain, polynomial evaluation gains to be used repeatedly. Several aspects of polynomial evaluation have been presented before, such as code generation for Horner schemes with floating-point expansions or optimization of polynomial evaluation schemes. In this work we study both combination and extension of these techniques, striving for their integration in a code generator. In particular, we present an algorithm within the Metalibm-ludgdunum code generation framework, based on input by Metalibm-lutetia. Our intent is to offer state of the art multi-word evaluation with polynomial scheme space exploration with CGPE, Gappa correctness proof and advanced code generation, suited for High-Level Synthesis.
机译:在软件以及基于FPGA的系统中,多项式求值是基本函数有效浮点逼近的关键部分。由于在多个维度上有众多选择,因此设计优化的多项式评估方案是一项繁琐而繁琐的任务:需要根据实现目标(延迟,吞吐量,准确性...)来选择评估方案,例如Horner或Estrin。例如通过使并行度适应体系结构能力来适应给定的体系结构。对于每个操作,需要选择定点或浮点格式,例如之间的格式,例如binary32,binary64。此外,某些方案和格式会引起折衷,特别是在向量化评估方案方面。作为较长的自动代码生成工具链的一部分,多项式评估得到了重复使用。以前已经介绍了多项式评估的几个方面,例如具有浮点扩展的Horner方案的代码生成或多项式评估方案的优化。在这项工作中,我们将研究这些技术的组合和扩展,并努力将其集成到代码生成器中。特别是,我们根据Metalibm-lutetia的输入,在Metalibm-ludgdunum代码生成框架中提出了一种算法。我们的意图是通过CGPE,Gappa正确性证明和高级代码生成功能,为多项综合评估提供最先进的多词评估,适用于高级综合的功能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号