...
首页> 外文期刊>Journal of Low Power Electronics and Applications >VLSI Architecture for 8-Point AI-based Arai DCT having Low Area-Time Complexity and Power at Improved Accuracy
【24h】

VLSI Architecture for 8-Point AI-based Arai DCT having Low Area-Time Complexity and Power at Improved Accuracy

机译:具有低区域时间复杂度和低功耗且精度提高的基于8点AI的Arai DCT的VLSI架构

获取原文
           

摘要

A low complexity digital VLSI architecture for the computation of an algebraic integer (AI) based 8-point Arai DCT algorithm is proposed. AI encoding schemes for exact representation of the Arai DCT transform based on a particularly sparse 2-D AI representation is reviewed, leading to the proposed novel architecture based on a new final reconstruction step (FRS) having lower complexity and higher accuracy compared to the state-of-the-art. This FRS is based on an optimization derived from expansion factors that leads to small integer constant-coefficient multiplications, which are realized with common sub-expression elimination (CSE) and Booth encoding. The reference circuit [1] as well as the proposed architectures for two expansion factors α? = 4.5958 and α′ = 167.2309 are implemented. The proposed circuits show 150% and 300% improvements in the number of DCT coefficients having error ≤ 0:1% compared to [1]. The three designs were realized using both 40 nm CMOS Xilinx Virtex-6 FPGAs and synthesized using 65 nm CMOS general purpose standard cells from TSMC. Post synthesis timing analysis of 65 nm CMOS realizations at 900 mV for all three designs of the 8-point DCT core for 8-bit inputs show potential real-time operation at 2.083 GHz clock frequency leading to a combined throughput of 2.083 billion 8-point Arai DCTs per second. The expansion-factor designs show a 43% reduction in area (A) and 29% reduction in dynamic power (PD) for FPGA realizations. An 11% reduction in area is observed for the ASIC design for α? = 4.5958 for an 8% reduction in total power (PT ). Our second ASIC design having α′ = 167.2309 shows marginal improvements in area and power compared to our reference design but at significantly better accuracy.
机译:提出了一种低复杂度的数字VLSI架构,用于计算基于代数整数(AI)的8点Arai DCT算法。审查了基于特别稀疏的2D AI表示的用于Arai DCT变换的精确表示的AI编码方案,从而导致了基于新的最终重构步骤(FRS)提出的新颖架构,与状态相比,它具有更低的复杂度和更高的准确性最先进的。此FRS基于从扩展因子得出的优化,该优化导致小整数常数系数乘法,这可以通过通用子表达式消除(CSE)和Booth编码来实现。参考电路[1]以及针对两个扩展因子α?的建议架构。 = 4.5958和α'= 167.2309被实现。与[1]相比,所提出的电路在误差≤0:1%的DCT系数数量上显示出150%和300%的改善。这三种设计均使用40 nm CMOS Xilinx Virtex-6 FPGA来实现,并使用TSMC的65 nm CMOS通用标准单元进行了合成。对于三种用于8位输入的8点DCT内核的所有三种设计,在900 mV下对65 nm CMOS实现的合成后时序分析显示,潜在的实时操作在2.083 GHz时钟频率下,总吞吐量为20.83亿个8点每秒Arai DCT。扩展因子设计显示,FPGA实现的面积(A)减少了43%,动态功耗(P D )减少了29%。对于ASIC设计,发现面积减小了11%。 = 4.5958,总功率降低了8%(P T )。与参考设计相比,我们的第二个ASIC设计具有α'= 167.2309,显示出面积和功耗的微小改进,但准确性更高。

著录项

相似文献

  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号