首页> 外文期刊>Advances in Radio Science >Area- and energy-efficient CORDIC accelerators in deep sub-micron CMOS technologies
【24h】

Area- and energy-efficient CORDIC accelerators in deep sub-micron CMOS technologies

机译:深亚微米CMOS技术中面积和能源效率高的CORDIC加速器

获取原文
           

摘要

The COordinate Rotate DIgital Computer (CORDIC) algorithm is a well knownversatile approach and is widely applied in today's SoCs for especially butnot restricted to digital communications. Dedicated CORDIC blocks can beimplemented in deep sub-micron CMOS technologies at very low area and energycosts and are attractive to be used as hardware accelerators for ApplicationSpecific Instruction Processors (ASIPs). Thereby, overcoming the well knownenergy vs. flexibility conflict. Optimizing Global Navigation SatelliteSystem (GNSS) receivers to reduce the hardware complexity is an importantresearch topic at present. In such receivers CORDIC accelerators can be usedfor digital baseband processing (fixed-point) and in Position-Velocity-Timeestimation (floating-point). A micro architecture well suited to suchapplications is presented. This architecture is parameterized according tothe wordlengths as well as the number of iterations and can be easilyextended for floating point data format. Moreover, area can be traded forthroughput by partially or even fully unrolling the iterations, whereby thedegree of pipelining is organized with one CORDIC iteration per cycle. Fromthe architectural description, the macro layout can be generated fullyautomatically using an in-house datapath generator tool. Since the addersand shifters play an important role in optimizing the CORDIC block, theymust be carefully optimized for high area and energy efficiency in theunderlying technology. So, for this purpose carry-select adders andlogarithmic shifters have been chosen. Device dimensioning was automaticallyoptimized with respect to dynamic and static power, area and performanceusing the in-house tool. The fully sequential CORDIC block for fixed-pointdigital baseband processing features a wordlength of 16 bits, requires 5232transistors, which is implemented in a 40-nm CMOS technology and occupies asilicon area of 1560 μm2 only. Maximum clock frequency fromcircuit simulation of extracted netlist is 768 MHz under typical, and 463 MHz under worst case technology and application corner conditions,respectively. Simulated dynamic power dissipation is 0.24 uW MHz?1 at 0.9 V; static power is 38 uW in slow corner, 65 uW in typical corner and 518 uW infast corner, respectively. The latter can be reduced by 43% in a 40-nmCMOS technology using 0.5 V reverse-backbias. These features are comparedwith the results from different design styles as well as with animplementation in 28-nm CMOS technology. It is interesting that in thelatter case area scales as expected, but worst case performance and energydo not scale well anymore.
机译:坐标旋转数字计算机(CORDIC)算法是一种众所周知的通用方法,已广泛应用于当今的SoC中,特别是但不限于数字通信。专用的CORDIC块可以以非常低的面积和较低的能源成本在深亚微米CMOS技术中实现,并且吸引人用作专用指令处理器(ASIP)的硬件加速器。因此,克服了众所周知的能源与灵活性的冲突。优化全球导航卫星系统(GNSS)接收器以降低硬件复杂性是当前的重要研究课题。在此类接收机中,CORDIC加速器可用于数字基带处理(定点)和位置速度时间估计(浮点)。提出了一种非常适合此类应用的微架构。该体系结构根据字长以及迭代次数进行了参数化,可以很容易地扩展为浮点数据格式。此外,可以通过部分或什至完全展开迭代来交换区域的吞吐量,从而流水线化程度按每个周期一个CORDIC迭代进行组织。根据体系结构描述,可以使用内部数据路径生成器工具自动生成宏布局。由于加法器和移位器在优化CORDIC块中起着重要作用,因此必须对基础技术中的高面积和高能效进行仔细优化。因此,为此目的选择了进位选择加法器和对数移位器。使用内部工具可针对动态和静态功耗,面积和性能自动优化设备尺寸。用于定点数字基带处理的全序列CORDIC模块的字长为16位,需要5232个晶体管,该晶体管采用40 nm CMOS技术实现,仅占用1560μm 2 的硅面积。在典型情况下,从提取的网表进行电路仿真得到的最大时钟频率通常为768 MHz,在最坏情况下的技术和应用转折条件下则为463 MHz。在0.9 V时仿真的动态功耗为0.24 uW MHz ?1 ;静态功率在慢转角为38 uW,在典型转角为65 uW,在快速转角为518 uW。在使用0.5 V反向偏置的40 nmCMOS技术中,后者可以减少43%。将这些功能与不同设计风格的结果以及28 nm CMOS技术的实现进行了比较。有趣的是,在其他情况下,该区域按预期比例缩放,但最坏情况下的性能和能量却无法很好地缩放。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号