...
首页> 外文期刊>Journal of VLSI signal processing systems >A Compact DSP Core with Static Floating-Point Arithmetic
【24h】

A Compact DSP Core with Static Floating-Point Arithmetic

机译:具有静态浮点算法的紧凑型DSP内核

获取原文
获取原文并翻译 | 示例
           

摘要

A multimedia system-on-a-chip (SoC) usually contains one or more programmable digital signal processors (DSP) to accelerate data-intensive computations. But most of these DSP cores are designed originally for standalone applications, and they must have some overlapped (and redundant) components with the host microprocessor. This paper presents a compact DSP for multi-core systems, which is fully programmable and has been optimized to execute a set of signal processing kernels very efficiently. The DSP core was designed concurrently with its automatic software generator based on high-level synthesis. Moreover, it performs lightweight arithmetic-the static floating-point (SFP), which approximates the quality of floating-point (FP) operations with the hardware similar to that of the integer arithmetic. In our simulations, the compact DSP and its auto-generated software can achieve 3X performance (estimated in cycles) of those DSP cores in the dual-core baseband processors with similar computing resources. Besides, the 16-bit SFP has above 40 dB signal to round-off noise ratio over the IEEE single-precision FP, and it even outperforms the hand-optimized programs based on the 32-bit integer arithmetic. The 24-bit SFP has above 64 dB quality, of which the maximum precision is identical to that of the single-precision FP. Finally, the DSP core has been implemented and fabricated in the UMC 0.18μm 1P6M CMOS technology. It can operate at 314.5 MHz while consuming 52mW average power. The core size is only 1.5 mm x 1.5 mm including the 16 KB on-chip memory and the AMBA AHB interface.
机译:多媒体片上系统(SoC)通常包含一个或多个可编程数字信号处理器(DSP),以加速数据密集型计算。但是,大多数这些DSP内核最初都是为独立应用程序设计的,它们必须与主机微处理器有一些重叠(和冗余)的组件。本文提出了一种用于多核系统的紧凑型DSP,该DSP是完全可编程的,并且已经过优化,可以非常高效地执行一组信号处理内核。 DSP内核与基于高级综合的自动软件生成器同时设计。此外,它执行轻量级算术-静态浮点(SFP),使用类似于整数算术的硬件,可以近似浮点(FP)操作的质量。在我们的仿真中,紧凑型DSP及其自动生成的软件可以在具有相似计算资源的双核基带处理器中实现那些DSP核的3倍性能(按周期估算)。此外,与IEEE单精度FP相比,16位SFP的信号与舍入噪声比超过40 dB,甚至优于基于32位整数算法的手动优化程序。 24位SFP的质量高于64 dB,其最大精度与单精度FP相同。最后,DSP内核已通过UMC0.18μm1P6M CMOS技术实现和制造。它可以在314.5 MHz上运行,同时消耗52mW的平均功率。内核尺寸仅为1.5 mm x 1.5 mm,包括16 KB片上存储器和AMBA AHB接口。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号