首页> 外文期刊>IEEE Transactions on Signal Processing >Hardware Efficient Fast DCT Based on Novel Cyclic Convolution Structures
【24h】

Hardware Efficient Fast DCT Based on Novel Cyclic Convolution Structures

机译:基于新型循环卷积结构的硬件高效快速DCT

获取原文
获取原文并翻译 | 示例

摘要

Cyclic convolution is a widely used operation in signal processing. In very large-scale integration (VLSI) design, it is usually implemented with systolic array and distributed arithmetic; however, these implementation designs may not be fast enough or use too much hardware cost when the convolution length is large. This paper presents a new fast cyclic convolution algorithm, which is hardware efficient and suitable for high-speed VLSI implementation, especially when the convolution length is large. For example, when the proposed fast cyclic convolution algorithm is applied to the implementation of prime length discrete cosine transform (DCT), the proposed high-throughput implementation of 1297-length DCT design saves 1216 (94%) multiplications, 282 (22%) additions, and 4792 (74%) delay elements compared with those of recently proposed systolic array based algorithms. Furthermore, the proposed algorithm can run at a speed that is 1.5 times that of previous designs and requires less I/O cost as long as the wordlength$ L$is less than 20 bits.
机译:循环卷积是信号处理中广泛使用的操作。在超大规模集成(VLSI)设计中,通常使用脉动阵列和分布式算法来实现。但是,当卷积长度较大时,这些实现设计可能不够快,或使用了过多的硬件成本。本文提出了一种新的快速循环卷积算法,该算法硬件效率高,适用于高速VLSI实现,尤其是在卷积长度较大时。例如,将拟议的快速循环卷积算法应用于素数长度离散余弦变换(DCT)的实现时,拟议的1297长度DCT设计的高吞吐量实现可节省1216(94%)的乘法,节省282(22%)的乘法与最近提出的基于脉动阵列的算法相比,增加了4792个(74%)延迟元素。此外,所提出的算法可以以先前设计的1.5倍的速度运行,并且只要字长L L小于20位,就需要更少的I / O成本。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号