首页> 外文会议>IASTED international conference on signal and image processing >AN EFFICIENT VECTORIZATION OF FIR FILTER FOR VECTOR PROCESSOR
【24h】

AN EFFICIENT VECTORIZATION OF FIR FILTER FOR VECTOR PROCESSOR

机译:矢量处理器的FIR滤波器的有效矢量化

获取原文

摘要

The Vectorization of algorithms mapping for vectorprocessor is a critical issues. An efficient vectorization ofFIR filter for vector processor is proposed, in which the FIRfilter computation is divided into N-step vectormultiplication, each vector multiplication is executed inparallel by sixteen vector processing elements, thecalculation of sixteen output is completed at once.Compared with existing methods, this method can fullyexploit the instruction level and data level parallelism ofvector processor, it can be applied to the FIR filter withdifferent length of coefficients, it is not limited to vectorprocessors whether to support the addition of reduction, andsupports 8-bit, 16-bit fixed-point real, fixed-point complex,32-bit floating-point real and complex data types.Experimental results show that the execution time forcalculating 1024-points with a 50-tap fixed-point real FIRfilter based on YHFT-Matrix is only 7.4 us, thevectorization of floating-point complex FIR filter achievesnearly 8x speedup over sequential algorithm ofTMS320C67x, the vectorization of fixed-point complexFIR filter achieves nearly 16x speedup over sequentialalgorithm of TMS320C64x.
机译:向量映射算法的向量化 处理器是一个关键问题。的有效矢量化 提出了一种用于矢量处理器的FIR滤波器,其中FIR 滤波计算分为N步向量 乘法,每个向量乘法都在 由16个矢量处理元素并行 一次完成16个输出的计算。 与现有方法相比,该方法可以完全 利用指令级和数据级的并行性 向量处理器,可以将其应用于FIR滤波器 系数的长度不同,不限于矢量 处理器是否支持加减,以及 支持8位,16位定点实数,定点复数, 32位浮点实数和复数数据类型。 实验结果表明,执行时间为 用50抽头定点实数FIR计算1024点 基于YHFT-Matrix的滤波器仅为7.4 us, 浮点复数FIR滤波器的矢量化实现 比顺序算法的速度提高近8倍 TMS320C67x,定点复合体的矢量化 FIR滤波器实现了连续序列近16倍的加速 TMS320C64x的算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号